Document Detail

Grouping of amino acid types and extraction of amino acid properties from multiple sequence alignments using variance maximization.
MedLine Citation:
PMID:  16184599     Owner:  NLM     Status:  MEDLINE    
Understanding of amino acid type co-occurrence in trusted multiple sequence alignments is a prerequisite for improved sequence alignment and remote homology detection algorithms. Two objective approaches were used to investigate co-occurrence, both based on variance maximization of the weighted residue frequencies in columns taken from a large alignment database. The first approach discretely grouped amino acid types, and the second approach extracted orthogonal properties of amino acids using principal components analysis. The grouping results corresponded to amino acid physical properties such as side chain hydrophobicity, size, or backbone flexibility, and an optimal arrangement of approximately eight groups was observed. However, interpretation of the orthogonal properties was more complex. Although the principal components accounting for the largest variances exhibited modest correlations with hydrophobicity and conservation of glycine, in general principal components did not correspond to physical properties of amino acids. Although not intuitive, these amino acid mathematical properties were demonstrated to be robust and to improve local pairwise alignment accuracy, relative to 20 amino acid frequencies alone, for a simple test case.
James O Wrabl; Nick V Grishin
Related Documents :
99139 - Comparative studies on two ferredoxins from the cyanobacterium nostoc strain mac.
24980609 - Enhanced production of d-lactic acid by sporolactobacillus sp.y2-8 mutant generated by ...
17899389 - Co-purification of glucanase with acid trehalase-invertase aggregate in saccharomyces c...
1905729 - The posttranslationally modified c-terminal structure of bovine aortic smooth muscle rh...
6616019 - Measurement of leucine enkephalin in caudate nucleus tissue with fast atom bombardment-...
11805319 - Gene characterized for membrane desaturase that produces (e)-11 isomers of mono- and di...
16391089 - Engineering the monomer composition of polyhydroxyalkanoates synthesized in saccharomyc...
18624429 - Reducing acrylamide precursors in raw materials derived from wheat and potato.
24726369 - Free radicals mediate systemic acquired resistance.
Publication Detail:
Type:  Journal Article; Research Support, N.I.H., Extramural; Research Support, U.S. Gov't, P.H.S.    
Journal Detail:
Title:  Proteins     Volume:  61     ISSN:  1097-0134     ISO Abbreviation:  Proteins     Publication Date:  2005 Nov 
Date Detail:
Created Date:  2005-10-27     Completed Date:  2006-04-17     Revised Date:  2007-11-14    
Medline Journal Info:
Nlm Unique ID:  8700181     Medline TA:  Proteins     Country:  United States    
Other Details:
Languages:  eng     Pagination:  523-34     Citation Subset:  IM    
Copyright Information:
(c) 2005 Wiley-Liss, Inc.
Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas 75390-9050, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Amino Acids / chemistry*
Models, Chemical
Principal Component Analysis
Proteins / chemistry*
Sequence Alignment*
Grant Support
Reg. No./Substance:
0/Amino Acids; 0/Proteins

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Investigation of molecular size of transcription factor TFIIE in solution.
Next Document:  Prediction of folding pathway and kinetics among plant hemoglobins using an average distance map met...