Document Detail


A protein map and its application.
MedLine Citation:
PMID:  18348704     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Graphical representation of gene sequences provides a simple way of viewing, sorting, and comparing various gene structures. Here we first report a two-dimensional graphical representation for protein sequences. With this method, we constructed the moment vectors for protein sequences, and mathematically proved that the correspondence between moment vectors and protein sequences is one-to-one. Therefore, each protein sequence can be represented as a point in a map, which we call protein map, and cluster analysis can be used for comparison between the points. Sixty-six proteins from five protein families were analyzed using this method. Our data showed that for proteins in the same family, their corresponding points in the map are close to each other. We also illustrate the efficiency of this approach by performing an extensive cluster analysis of the protein kinase C family. These results indicate that this protein map could be used to mathematically specify the similarity of two proteins and predict properties of an unknown protein based on its amino acid sequence.
Authors:
Stephen S-T Yau; Chenglong Yu; Rong He
Related Documents :
15048834 - Progressive combinatorial algorithm for multiple structural alignments: application to ...
19386484 - Evolution of protein modularity.
17270854 - Order independent structural alignment of circularly permuted proteins.
24458544 - Old proteins and the achilles heel of mass spectrometry. the role of proteomics in the ...
12853464 - Diversity of protein-protein interactions.
3957864 - Treponema phagedenis has at least two proteins residing together on its periplasmic fla...
Publication Detail:
Type:  Journal Article; Review    
Journal Detail:
Title:  DNA and cell biology     Volume:  27     ISSN:  1044-5498     ISO Abbreviation:  DNA Cell Biol.     Publication Date:  2008 May 
Date Detail:
Created Date:  2008-05-08     Completed Date:  2008-06-26     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  9004522     Medline TA:  DNA Cell Biol     Country:  United States    
Other Details:
Languages:  eng     Pagination:  241-50     Citation Subset:  IM    
Affiliation:
Department of Mathematics, Statistics, and Computer Science, University of Illinois at Chicago, Chicago, Illinois 60607-7045, USA. yau@uic.edu
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Algorithms*
Cluster Analysis
Databases, Factual
Phylogeny
Proteins / chemistry*,  classification*
Chemical
Reg. No./Substance:
0/Proteins

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Comparison of skeletal effects of ovariectomy versus chemically induced ovarian failure in mice.
Next Document:  The descriptive epidemiology of commonly occurring mental disorders in the United States.