Search CORE

1,721,025 research outputs found

An ENSEMBLE machine learning approach for the prediction of all-alpha membrane proteins

Author: Casadio R.
Martelli Pl
Fariselli Piero
Publication venue
Publication date: 01/01/2003
Field of study

Archivio istituzionale della ricerca - Università di Padova

Integrating ELIXIR Italy with ELIXIR Interoperability platform activities

Author: Martelli PL
Casadio R
Profiti G
Savojardo C
Publication venue
Publication date: 01/01/2018
Field of study

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Tryptophanyl fluorescence lifetime distribution of hyperthermophilic beta-glycosidase from molecular dynamics simulation: a comparison with the experimental data

Author: CASADIO R
BISMUTO E
IRACE Gaetano
MARTELLI PL
Publication venue
Publication date: 01/01/2000
Field of study

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

The effect of tryptophanyl substitution on folding and structure of myoglobin

Author: SIRANGELO Ivana
CASADIO R
TAVASSI S
IRACE Gaetano
MARTELLI PL
Publication venue
Publication date: 01/01/2000
Field of study

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

The prediction of membrane protein structure and genome structural annotation

Author: Tasco G
Casadio R.
Martelli Pl
Fariselli Piero
Publication venue
Publication date: 01/01/2003
Field of study

Crossref

Archivio istituzionale della ricerca - Università di Padova

Prediction of disulfide connectivity in proteins with machine-learning methods and correlated mutations.

Author: Martelli PL
Savojardo C
Casadio R.
CASADIO RITA
Fariselli Piero
Fariselli P
SAVOJARDO CASTRENSE
MARTELLI PIER LUIGI
Publication venue
Publication date: 01/01/2013
Field of study

BACKGROUND: Recently, information derived by correlated mutations in proteins has regained relevance for predicting protein contacts. This is due to new forms of mutual information analysis that have been proven to be more suitable to highlight direct coupling between pairs of residues in protein structures and to the large number of protein chains that are currently available for statistical validation. It was previously discussed that disulfide bond topology in proteins is also constrained by correlated mutations. RESULTS: In this paper we exploit information derived from a corrected mutual information analysis and from the inverse of the covariance matrix to address the problem of the prediction of the topology of disulfide bonds in Eukaryotes. Recently, we have shown that Support Vector Regression (SVR) can improve the prediction for the disulfide connectivity patterns. Here we show that the inclusion of the correlated mutation information increases of 5 percentage points the SVR performance (from 54% to 59%). When this approach is used in combination with a method previously developed by us and scoring at the state of art in predicting both location and topology of disulfide bonds in Eukaryotes (DisLocate), the per-protein accuracy is 38%, 2 percentage points higher than that previously obtained. CONCLUSIONS: In this paper we show that the inclusion of information derived from correlated mutations can improve the performance of the state of the art methods for predicting disulfide connectivity patterns in Eukaryotic proteins. Our analysis also provides support to the notion that improving methods to extract evolutionary information from multiple sequence alignments greatly contributes to the scoring performance of predictors suited to detect relevant features from protein chains

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Padova

Institutional Research Information System University of Turin

Predicting cancer-associated germline variations in proteins.

Author: Martelli PL
Eva Balzani
Piero Fariselli
Balzani E
Pier Luigi Martelli
Casadio R.
CASADIO RITA
Rita Casadio
Fariselli Piero
Fariselli P
MARTELLI PIER LUIGI
Publication venue
Publication date: 01/01/2012
Field of study

BACKGROUND: Various computational methods are presently available to classify whether a protein variation is disease-associated or not. However data derived from recent technological advancements make it feasible to extend the annotation of disease-associated variations in order to include specific phenotypes. Here we tackle the problem of distinguishing between genetic variations associated to cancer and variations associated to other genetic diseases. RESULTS: We implement a new method based on Support Vector Machines that takes as input the protein variant and the protein function, as described by its associated Gene Ontology terms. Our approach succeeds in discriminating between germline variants that are likely to be cancer-associated from those that are related to other genetic disorders. The method performs with values of 90% accuracy and 0.61 Matthews correlation coefficient on a set comprising 6478 germline variations (16% are cancer-associated) in 592 proteins. The sensitivity and the specificity on the cancer class are 69% and 66%, respectively. Furthermore the method is capable of correctly excluding some 96% of 3392 somatic cancer-associated variations in 1983 proteins not included in the training/testing set. CONCLUSIONS: Here we prove feasible that a large set of cancer associated germline protein variations can be successfully discriminated from those associated to other genetic disorders. This is a step further in the process of protein variant annotation. Scoring largely improves when protein function as encoded by Gene Ontology terms is considered, corroborating the role of protein function as a key feature for a correct annotation of its variations

Crossref

Springer - Publisher Connector

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Padova

Institutional Research Information System University of Turin

Fishing new proteins in the twilight zone of genomes

Author: Piero Fariselli
Martelli Pl
Pier Luigi Martelli
Finocchiaro L
Casadio R
Rita Casadio
Fariselli Piero
Giacomo Finocchiaro
Publication venue
Publication date: 01/01/2003
Field of study

We address the problem of clustering the whole protein content of genomes into three different categories globular, all-alpha, and all-beta membrane proteins - with the aim of fishing new membrane proteins in the pool of nonannotated proteins (twilight zone). The focus is then mainly on outer membrane proteins. This is performed by using an integrated suite of programs (Hunter) specifically developed for predicting the occurrence of signal peptides in proteins of Gram-negative bacteria and the topography of all-Î± and all-Î2 membrane proteins. Hunter is tested on the well and partially annotated proteins (2160 and 760, respectively) of Escherichia coli K 12 scoring as high as 95.6% in the correct assignment of each chain to the category. Of the remaining 1253 nonannotated sequences, 1099 are predicted globular, 136 are all-Î±, and 18 are all-Î2 membrane proteins. In Escherichia coli O157:H7 we filtered 1901 nonannotated proteins. Our analysis classifies 1564 globular chains, 327 inner membrane proteins, and 10 outer membrane proteins. With Hunter, new membrane proteins are added to the list of putative membrane proteins of Gram-negative bacteria. The content of outer membrane proteins per genome (nine are analyzed) ranges from 1.5% to 2.4%, and it is one order of magnitude lower than that of inner membrane proteins. The finding is particularly relevant when it is considered that this is the first large-scale analysis based on validated tools that can predict the content of outer membrane proteins in a genome and can allow cross-comparison of the same protein type between different species

Crossref

Archivio istituzionale della ricerca - Università di Padova

Author Instructions

Author: Instructions Author
Publication venue
Publication date: 04/11/2013
Field of study

Crossref

Cartographic Perspectives (E-Journal - North American Cartographic Information Society, NACIS)

Going Beyond Counting First Authors in Author Co-citation Analysis

Author: Zhao Dangzhi
Publication venue
Publication date: 01/01/2005
Field of study

The present study examines one of the fundamental aspects of author co-citation analysis (ACA) - the way co-citation counts are defined. Co-citation counting provides the data on which all subsequent statistical analyses and mappings are based, and we compare ACA results based on two different types of co-citation counting - the traditional type that only counts the first one among a cited work's authors on the one hand and a non-traditional type that takes into account the first 5 authors of a cited work on the other hand. Results indicate that the picture produced through this non-traditional author co-citation counting contains more coherent author groups and is therefore considerably clearer. However, this picture represents fewer specialties in the research field being studied than that produced through the traditional first-author co-citation counting when the same number of top-ranked authors is selected and analyzed. Reasons for these effects are discussed

E-LIS