Document Detail

Measuring the phylogenetic randomness of biological data sets.
MedLine Citation:
PMID:  12066305     Owner:  NLM     Status:  MEDLINE    
Two qualitative taxonomic characters are potentially compatible if the states of each can be ordered into a character state tree in such a way that the two resulting character state trees are compatible. The number of potentially compatible pairs (NPCP) of qualitative characters from a data set may be considered to be a measure of its phylogenetic randomness. The value of NPCP depends on the number of evolutionary units (EUs), the number of characters, the number of states in the characters, the distributions of EUs among these states, and the amount and distribution of missing information and so does not directly indicate degree of phylogenetic randomness. Thus, for an observed data set, we used Monte Carlo methods to estimate the probability that a data set chosen equiprobably from among those identical (with respect to all the other above determining features) to the observed data set would have as high (or low) an NPCP as the observed data set. This probability, the realized significance of the observed NPCP, is attractive as an indication of phylogenetic randomness because it does not require the assumptions made by other such methods: No character state trees are assumed and consequently, only potential compatibility can be determined; no particular method of phylogenetic estimation is assumed; and no phylogenetic trees are constructed. We determined the values and significances of NPCP for analyses of 57 data sets taken from 53 published sources. All data sets from 37 of those sources exhibited realized significances of < 0.01, indicating high levels of phylogenetic nonrandomness. From each of the remaining 16 sources, at least one data set was more phylogenetically random. Inclusion of outgroups changed significance in some cases, but not always in the same direction. Data sets with significantly low NPCP may be consistent with an ancient hybrid origin (or other ancient polyphyletic gene exchange, crossing over, viral transfer, etc.) of the study group.
W H Day; G F Estabrook; F R McMorris
Related Documents :
12513355 - Numerical study of the development of bulk scale-free structures upon growth of self-af...
15797905 - Identification of transcription factor binding sites with variable-order bayesian netwo...
17839855 - Molluscan phylogeny: the paleontological viewpoint.
12831905 - New approaches to analyzing microbial biodiversity data.
21600745 - Conversational case-based reasoning in medical decision making.
10723875 - Selection of measurement frequencies for optimal extraction of tissue impedance model p...
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't; Research Support, U.S. Gov't, Non-P.H.S.    
Journal Detail:
Title:  Systematic biology     Volume:  47     ISSN:  1063-5157     ISO Abbreviation:  Syst. Biol.     Publication Date:  1998 Dec 
Date Detail:
Created Date:  2002-06-17     Completed Date:  2002-07-10     Revised Date:  2006-11-15    
Medline Journal Info:
Nlm Unique ID:  9302532     Medline TA:  Syst Biol     Country:  England    
Other Details:
Languages:  eng     Pagination:  604-16     Citation Subset:  IM    
Department of Biology, University of Michigan, Ann Arbor, Michigan 48109-1048, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Monte Carlo Method
Random Allocation*

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Can weighting improve bushy trees? Models of cytochrome b evolution and the molecular systematics of...
Next Document:  Using a nonrecursive formula to determine cladogram probabilities.