Document Detail

Accuracy of sequence alignment and fold assessment using reduced amino acid alphabets.
MedLine Citation:
PMID:  16506243     Owner:  NLM     Status:  MEDLINE    
Reduced or simplified amino acid alphabets group the 20 naturally occurring amino acids into a smaller number of representative protein residues. To date, several reduced amino acid alphabets have been proposed, which have been derived and optimized by a variety of methods. The resulting reduced amino acid alphabets have been applied to pattern recognition, generation of consensus sequences from multiple alignments, protein folding, and protein structure prediction. In this work, amino acid substitution matrices and statistical potentials were derived based on several reduced amino acid alphabets and their performance assessed in a large benchmark for the tasks of sequence alignment and fold assessment of protein structure models, using as a reference frame the standard alphabet of 20 amino acids. The results showed that a large reduction in the total number of residue types does not necessarily translate into a significant loss of discriminative power for sequence alignment and fold assessment. Therefore, some definitions of a few residue types are able to encode most of the relevant sequence/structure information that is present in the 20 standard amino acids. Based on these results, we suggest that the use of reduced amino acid alphabets may allow to increasing the accuracy of current substitution matrices and statistical potentials for the prediction of protein structure of remote homologs.
Francisco Melo; Marc A Marti-Renom
Related Documents :
6847613 - Carbohydrate is linked through ethanolamine to the c-terminal amino acid of trypanosoma...
15653803 - Ultraviolet-b sensitivities in japanese lowland rice cultivars: cyclobutane pyrimidine ...
3082323 - Regulation of macrophage eicosanoid production by hydroperoxy-and hydroxy-eicosatetraen...
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't    
Journal Detail:
Title:  Proteins     Volume:  63     ISSN:  1097-0134     ISO Abbreviation:  Proteins     Publication Date:  2006 Jun 
Date Detail:
Created Date:  2006-05-16     Completed Date:  2006-07-20     Revised Date:  2006-11-15    
Medline Journal Info:
Nlm Unique ID:  8700181     Medline TA:  Proteins     Country:  United States    
Other Details:
Languages:  eng     Pagination:  986-95     Citation Subset:  IM    
Copyright Information:
2006 Wiley-Liss, Inc.
Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Santiago, Chile.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Amino Acid Sequence
Amino Acids / chemistry*,  classification,  metabolism*
Consensus Sequence
Molecular Sequence Data
Protein Folding*
Proteins / chemistry*,  metabolism*
Sequence Alignment / methods*
Structural Homology, Protein
Reg. No./Substance:
0/Amino Acids; 0/Proteins

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Scoring a diverse set of high-quality docked conformations: a metascore based on electrostatic and d...
Next Document:  Spatial patterns of protein expression in focal infections of human cytomegalovirus.