Document Detail


Strategies for the effective identification of remotely related sequences in multiple PSSM search approach.
MedLine Citation:
PMID:  17380509     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Searches using position specific scoring matrices (PSSMs) have been commonly used in remote homology detection procedures such as PSI-BLAST and RPS-BLAST. A PSSM is generated typically using one of the sequences of a family as the reference sequence. In the case of PSI-BLAST searches the reference sequence is same as the query. Recently we have shown that searches against the database of multiple family-profiles, with each one of the members of the family used as a reference sequence, are more effective than searches against the classical database of single family-profiles. Despite relatively a better overall performance when compared with common sequence-profile matching procedures, searches against the multiple family-profiles database result in a few false positives and false negatives. Here we show that profile length and divergence of sequences used in the construction of a PSSM have major influence on the performance of multiple profile based search approach. We also identify that a simple parameter defined by the number of PSSMs corresponding to a family that is hit, for a query, divided by the total number of PSSMs in the family can distinguish effectively the true positives from the false positives in the multiple profiles search approach.
Authors:
V S Gowri; K G Tina; O Krishnadev; N Srinivasan
Related Documents :
8026859 - Mhcdb--database of the human mhc.
8019859 - Seqsee: a comprehensive program suite for protein sequence analysis.
16246909 - Expansion of the biocyc collection of pathway/genome databases to 160 genomes.
19527749 - Genomic structure of the whole d-j-c clusters and the upstream region coding v segments...
20011109 - Annotation error in public databases: misannotation of molecular function in enzyme sup...
15133159 - Accessibility of introduced cysteines in chemoreceptor transmembrane helices reveals bo...
22138989 - Hydrazine synthase, a unique phylomarker with which to study the presence and biodivers...
9441739 - Characterization and mutation analysis of goosecoid-like (gscl), a homeodomain-containi...
22647039 - Metamobilomics - expanding our knowledge on the pool of plasmid encoded traits in natur...
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't    
Journal Detail:
Title:  Proteins     Volume:  67     ISSN:  1097-0134     ISO Abbreviation:  Proteins     Publication Date:  2007 Jun 
Date Detail:
Created Date:  2007-05-11     Completed Date:  2007-06-07     Revised Date:  2007-08-13    
Medline Journal Info:
Nlm Unique ID:  8700181     Medline TA:  Proteins     Country:  United States    
Other Details:
Languages:  eng     Pagination:  789-94     Citation Subset:  IM    
Copyright Information:
2007 Wiley-Liss, Inc.
Affiliation:
Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560 012, India.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Amino Acid Sequence
Databases, Protein* / classification
Sensitivity and Specificity
Sequence Homology, Amino Acid
Grant Support
ID/Acronym/Agency:
//Wellcome Trust

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Computational protocol for predicting the binding affinities of zinc containing metalloprotein-ligan...
Next Document:  Solution structures, dynamics, and lipid-binding of the sterile alpha-motif domain of the deleted in...