Search CORE

1,720,978 research outputs found

Structure-guided alignment constructed with homologous sequences using Cn3D (top) and neighbor-joining tree based on the score of aligned residues from homologous sequences using CDTree (bottom).

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

Structure-guided alignment constructed with homologous sequences using Cn3D (top) and neighbor-joining tree based on the score of aligned residues from homologous sequences using CDTree (bottom).</p

The Francis Crick Institute

Percent-identity scale.

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

The horizontal line gives the percent identity between query and subject sequences, and the boxes gives the resources and tools that can be used for functional inference.</p

The Francis Crick Institute

PIRSF (A,B), COG (C,D), and Pfam (E,F) input and results.

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

(A) The fasta sequence of query protein with UniProt accession O67940 from Aquifex aeolicus is scanned against PIR's curated family database. (The query is searched against the full-length and domain hidden Markov models for manually curated PIRSFs. If a match is found, the matched regions and statistics are displayed). (B) The query hits the PIRSF family PIRSF006779. The output provides family details; statistical data for full-length proteins, composite domains, and a pairwise alignment of query with the consensus sequence of the PIRSF. (C) The fasta sequence of query protein with UniProt accession O67940 from Aquifex aeolicus is scanned against the database of clusters of orthologous groups. COG compares protein sequences encoded in complete genomes, representing major phylogenetic lineages. Each COG consists of orthologous/co-orthologous proteins from at least three lineages. (D) The query hits COG1912. The output provides the family details: statistical score, reciprocal best hits, and members of the family. (E) The fasta sequence of query protein with UniProt accession O67940 from Aquifex aeolicus is scanned against the Pfam domain database. The Pfam database is a large collection of domain families, each represented by multiple sequence alignments and hidden Markov models (HMMs). (F) The query hits Pfam family PF01887.</p

The Francis Crick Institute

URLs used for this tutorial

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

URLs used for this tutorial</p

The Francis Crick Institute

Ten-step procedure for comparative analysis of protein structures and sequences to infer biological function.

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

Ten-step procedure for comparative analysis of protein structures and sequences to infer biological function.</p

The Francis Crick Institute

SCOP output.

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

1RQP is used since our query protein O67940 from Aquifex aeolicus does not have a solved structure. The results indicate that the N-terminal and C-terminal domains of 1RQP belong to two SCOP superfamilies. (The SCOP database provides a detailed and comprehensive description of the structural and evolutionary relationships between all proteins whose structure is known).</p

The Francis Crick Institute

Ligplot for 1RQP.

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

SAM-binding residues. Dashed green lines indicate hydrogen bonds, and the half-moon indicates van der Waals interactions. (Ligplot is a program for automatically plotting protein–ligand interactions provided as part of the PDBsum database, which is a Web-based database of summaries and analyses of all PDB structures).</p

The Francis Crick Institute

Pairwise alignment between query sequence O67940_ AQUAE and 2Q6O (top) and 1RQP (bottom).

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

(Top) Query aligns end-to-end without any long gaps with a sequence identity of 32%. (Bottom) Query aligns end-to-end but with three regions of gaps, the most significant being a 23-residue region in 1RQP residues 92–116. The sequence identity of query with 1RQP is 26%.</p

The Francis Crick Institute

Additional file 1: of Ridge regression estimated linear probability model predictions of O-glycosylation in proteins with structural and sequence data

Author: Rajaram Gana (6895718)
Sona Vasudevan (3211)
Publication venue
Publication date: 2019
Field of study

Estimation dataset. (XLSX 1187 kb

The Francis Crick Institute

VAST output.

Author: Raja Mazumder (3204)
Sona Vasudevan (3211)
Publication venue
Publication date: 2013
Field of study

Since our query protein O67940 from Aquifex aeolicus does not have a solved structure, 1RQP is used as a query. The only non-redundant structural neighbor that provides functional annotation is 2Q6O, indicated by a pink box.</p

The Francis Crick Institute