Document Detail

Resolving discrepancy between nucleotides and amino acids in deep-level arthropod phylogenomics: differentiating serine codons in 21-amino-acid models.
MedLine Citation:
PMID:  23185239     Owner:  NLM     Status:  MEDLINE    
BACKGROUND: In a previous study of higher-level arthropod phylogeny, analyses of nucleotide sequences from 62 protein-coding nuclear genes for 80 panarthopod species yielded significantly higher bootstrap support for selected nodes than did amino acids. This study investigates the cause of that discrepancy.
METHODOLOGY/PRINCIPAL FINDINGS: The hypothesis is tested that failure to distinguish the serine residues encoded by two disjunct clusters of codons (TCN, AGY) in amino acid analyses leads to this discrepancy. In one test, the two clusters of serine codons (Ser1, Ser2) are conceptually translated as separate amino acids. Analysis of the resulting 21-amino-acid data matrix shows striking increases in bootstrap support, in some cases matching that in nucleotide analyses. In a second approach, nucleotide and 20-amino-acid data sets are artificially altered through targeted deletions, modifications, and replacements, revealing the pivotal contributions of distinct Ser1 and Ser2 codons. We confirm that previous methods of coding nonsynonymous nucleotide change are robust and computationally efficient by introducing two new degeneracy coding methods. We demonstrate for degeneracy coding that neither compositional heterogeneity at the level of nucleotides nor codon usage bias between Ser1 and Ser2 clusters of codons (or their separately coded amino acids) is a major source of non-phylogenetic signal.
CONCLUSIONS: The incongruity in support between amino-acid and nucleotide analyses of the forementioned arthropod data set is resolved by showing that "standard" 20-amino-acid analyses yield lower node support specifically when serine provides crucial signal. Separate coding of Ser1 and Ser2 residues yields support commensurate with that found by degenerated nucleotides, without introducing phylogenetic artifacts. While exclusion of all serine data leads to reduced support for serine-sensitive nodes, these nodes are still recovered in the ML topology, indicating that the enhanced signal from Ser1 and Ser2 is not qualitatively different from that of the other amino acids.
Andreas Zwick; Jerome C Regier; Derrick J Zwickl
Related Documents :
9056179 - Quantitative determination of sialic acid in the monosialoganglioside, gm1, by the thio...
3944269 - Studies on the defect underlying the lysosomal storage of sialic acid in salla disease....
3416319 - An azidoaryl thioglycoside of sialic acid. a potential photoaffinity probe of sialidase...
17551639 - Sialic acid and n-acyl sialic acid analog production by fermentation of metabolically a...
11275259 - Investigation on interaction of achatinin, a 9-o-acetyl sialic acid-binding lectin, wit...
3516259 - Invasion of erythrocytes by plasmodium falciparum malaria parasites: evidence for recep...
6115579 - Influence of free fatty acids on myocardial oxygen consumption and ischemic injury.
17096349 - Familial adenomatous polyposis patients have high levels of arachidonic acid and docosa...
10960919 - Aminomethylphosphonate and 2-aminoethylphosphonate as (31)p-nmr ph markers for extracel...
Publication Detail:
Type:  Journal Article; Research Support, U.S. Gov't, Non-P.H.S.     Date:  2012-11-20
Journal Detail:
Title:  PloS one     Volume:  7     ISSN:  1932-6203     ISO Abbreviation:  PLoS ONE     Publication Date:  2012  
Date Detail:
Created Date:  2012-11-27     Completed Date:  2013-05-14     Revised Date:  2013-07-11    
Medline Journal Info:
Nlm Unique ID:  101285081     Medline TA:  PLoS One     Country:  United States    
Other Details:
Languages:  eng     Pagination:  e47450     Citation Subset:  IM    
Department of Entomology, State Museum of Natural History, Stuttgart, Germany.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Amino Acids / genetics*
Arthropods / genetics*
Codon / genetics*
Databases, Genetic
Genomics / methods*
Likelihood Functions
Models, Genetic
Nucleotides / genetics*
Serine / genetics*
Terminology as Topic
Reg. No./Substance:
0/Amino Acids; 0/Codon; 0/Nucleotides; 56-45-1/Serine

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Transcriptome and gene expression analysis of the rice leaf folder, Cnaphalocrosis medinalis.
Next Document:  Pan-Bcl-2 inhibitor AT-101 enhances tumor cell killing by EGFR targeted T cells.