Search CORE

1,721,042 research outputs found

Shields, Denis C

Author: Shields Denis C
Publication venue
Publication date: 17/03/2016
Field of study

Banwa Publications (University of the Philippines Mindanao)

Shields, Denis C.

Author: Shields Denis C.
Publication venue
Publication date: 17/03/2016
Field of study

Banwa Publications (University of the Philippines Mindanao)

GASP: gapped ancestral sequence prediction for proteins

Author: Denis C. Shields (18564)
Shields Denis C
Richard J. Edwards (7949342)
Edwards Richard
Richard J Edwards
Denis C Shields
Shields Denis C.
Edwards Richard J
Publication venue
Publication date: 01/01/2004
Field of study

Background: the prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments.Results: here we present a new algorithm, GASP (Gapped Ancestral Sequence Prediction), for predicting ancestral sequences from phylogenetic trees and the corresponding multiple sequence alignments. Alignments may be of any size and contain gaps. GASP first assigns the positions of gaps in the phylogeny before using a likelihood-based approach centred on amino acid substitution matrices to assign ancestral amino acids. Important outgroup information is used by first working down from the tips of the tree to the root, using descendant data only to assign probabilities, and then working back up from the root to the tips using descendant and outgroup data to make predictions. GASP was tested on a number of simulated datasets based on real phylogenies. Prediction accuracy for ungapped data was similar to three alternative algorithms tested, with GASP performing better in some cases and worse in others. Adding simple insertions and deletions to the simulated data did not have a detrimental effect on GASP accuracy.Conclusions: GASP (Gapped Ancestral Sequence Prediction) will predict ancestral sequences from multiple protein alignments of any size. Although not as accurate in all cases as some of the more sophisticated maximum likelihood approaches, it can process a wide range of input phylogenies and will predict ancestral sequences for gapped and ungapped residues alik

Southampton (e-Prints Soton)

Springer - Publisher Connector

Directory of Open Access Journals

RCSI Repository

CompariMotif: quick and easy comparisons of sequence motifs

Author: Davey Norman E.
Edwards Richard J.
Shields Denis C.
Publication venue
Publication date: 28/03/2008
Field of study

CompariMotif is a novel tool for making motif–motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs

Southampton (e-Prints Soton)

Masking residues using context-specific evolutionary conservation significantly improves short linear motif discovery

Author: Davey Norman E.
Edwards Richard J.
Shields Denis C.
Publication venue
Publication date: 01/01/2009
Field of study

Motivation: Short linear motifs (SLiMs) are important mediators of protein–protein interactions. Their short and degenerate nature presents a challenge for computational discovery. We sought to improve SLiM discovery by incorporating evolutionary information, since SLiMs are more conserved than surrounding residues.Results: We have developed a new method that assesses the evolutionary signal of a residue in its sequence and structural context. Under-conserved residues are masked out prior to SLiM discovery, allowing incorporation into the existing statistical model employed by SLiMFinder. The method shows considerable robustness in terms of both the conservation score used for individual residues and the size of the sequence neighbourhood. Optimal parameters significantly improve return of known functional motifs from benchmarking data, raising the return of significant validated SLiMs from typical human interaction datasets from 20% to 60%, while retaining the high level of stringency needed for application to real biological data. The success of this regime indicates that it could be of general benefit to computational annotation and prediction of protein function at the sequence level

Southampton (e-Prints Soton)

Crossref

Computational identification and analysis of protein short linear motifs

Author: Davey Norman E.
Edwards Richard J.
Shields Denis C.
Publication venue
Publication date: 01/06/2010
Field of study

Short linear motifs (SLiMs) in proteins can act as targets for proteolytic cleavage, sites of post-translational modification, determinants of sub-cellular localization, and mediators of protein-protein interactions. Computational discovery of SLiMs involves assembling a group of proteins postulated to share a potential motif, masking out residues less likely to contain such a motif, down-weighting shared motifs arising through common evolutionary descent, and calculation of statistical probabilities allowing for the multiple testing of all possible motifs. Much of the challenge for motif discovery lies in the assembly and masking of datasets of proteins likely to share motifs, since the motifs are typically short (between 3 and 10 amino acids in length), so that potential signals can be easily swamped by the noise of stochastically recurring motifs. Focusing on disordered regions of proteins, where SLiMs are predominantly found, and masking out non-conserved residues can reduce the level of noise but more work is required to improve the quality of high-throughput experimental datasets (e.g. of physical protein interactions) as input for computational discovery

Southampton (e-Prints Soton)

SLiMDisc: short, linear motif discovery, correcting for common evolutionary descent

Author: Davey Norman E.
Edwards Richard J.
Shields Denis C.
Publication venue
Publication date: 20/07/2006
Field of study

Many important interactions of proteins are facilitated by short, linear motifs (SLiMs) within a protein's primary sequence. Our aim was to establish robust methods for discovering putative functional motifs. The strongest evidence for such motifs is obtained when the same motifs occur in unrelated proteins, evolving by convergence. In practise, searches for such motifs are often swamped by motifs shared in related proteins that are identical by descent. Prediction of motifs among sets of biologically related proteins, including those both with and without detectable similarity, were made using the TEIRESIAS algorithm. The number of motif occurrences arising through common evolutionary descent were normalized based on treatment of BLAST local alignments. Motifs were ranked according to a score derived from the product of the normalized number of occurrences and the information content. The method was shown to significantly outperform methods that do not discount evolutionary relatedness, when applied to known SLiMs from a subset of the eukaryotic linear motif (ELM) database. An implementation of Multiple Spanning Tree weighting outperformed two other weighting schemes, in a variety of settings

Southampton (e-Prints Soton)

Crossref

Coding of pointers in the segregation analysis program POINTER

Author: Denis C. Shields
Marlow Angela
Andrew Collins
Angela Marlow
Shields Denis C.
Collins Andrew
Publication venue
Publication date: 01/01/1994
Field of study

Southampton (e-Prints Soton)

Crossref

Estimation and efficient computation of the true probability of recurrence of short linear protein sequence motifs in unrelated proteins.

Author: Davey Norman E.
Davey Norman E
Shields Denis C
Richard J Edwards
Denis C Shields
Edwards Richard J.
Shields Denis C.
Edwards Richard J
Norman E Davey
Publication venue
Publication date: 01/01/2010
Field of study

Background: large datasets of protein interactions provide a rich resource for the discovery of Short Linear Motifs (SLiMs) that recur in unrelated proteins. However, existing methods for estimating the probability of motif recurrence may be biased by the size and composition of the search dataset, such that p-value estimates from different datasets, or from motifs containing different numbers of non-wildcard positions, are not strictly comparable. Here, we develop more exact methods and explore the potential biases of computationally efficient approximations. Results: a widely used heuristic for the calculation of motif over-representation approximates motif probability by assuming that all proteins have the same length and composition. We introduce pv, which calculates the probability exactly. Secondly, the recently introduced SLiMFinder statistic Sig, accounts for multiple testing (across all possible motifs) in motif discovery. However, it approximates the probability of all other possible motifs, occurring with a score of p or less, as being equal to p. Here, we show that the exhaustive calculation of the probability of all possible motif occurrences that are as rare or rarer than the motif of interest, Sig', may be carried out efficiently by grouping motifs of a common probability (i.e. those which have permuted orders of the same residues). Sig'v, which corrects both approximations, is shown to be uniformly distributed in a random dataset when searching for non-ambiguous motifs, indicating that it is a robust significance measure. Conclusions: a method is presented to compute exactly the true probability of a non-ambiguous short protein sequence motif, and the utility of an approximate approach for novel motif discovery across a large number of datasets is demonstrated

Southampton (e-Prints Soton)

Springer - Publisher Connector

Directory of Open Access Journals

A family with X-linked deafness showing linkage to the proximal Xq region of the X chromosome

Author: Lamont Margaret
Curtis Greta
Robinson David
Phelps Peter
Shields Denis C.
Publication venue
Publication date: 1992
Field of study

Linkage analysis has been carried out in a family with severe congenital sensorineural deafness with a structural abnormality of the inner ear. Recombinations show the gene responsible for deafness in this family to lie between the loci DXS255 (Xp11.22) and DXS94 (Xq22). Close linkage was found to locus DXS159 (cpX289) in Xq12, with a LOD score of 3.155 and 0 recombination. This location is consistent with other linkage studies of X-linked deafnes

Southampton (e-Prints Soton)