Document Detail

Statistical analysis of the DNA sequence of human chromosome 22.
MedLine Citation:
PMID:  11690062     Owner:  NLM     Status:  MEDLINE    
We study statistical patterns in the DNA sequence of human chromosome 22, the first completely sequenced human chromosome. We find that (i). the 33.4 x 10(6) nucleotide long human chromosome exhibits long-range power-law correlations over more than four orders of magnitude, (ii). the entropies H(n) of the frequency distribution of oligonucleotides of length n (n-mers) grow sublinearly with increasing n, indicating the presence of higher-order correlations for all of the studied lengths 1<or=n<or=10, and (iii). the generalized entropies H(n)(q) of n-mers decrease monotonically with increasing q and the decay of H(n)(q) with q becomes steeper with increasing n<or=10, indicating that the frequency distribution of oligonucleotides becomes increasingly nonuniform as the length n increases. We investigate to what degree known biological features may explain the observed statistical patterns. We find that (iv). the presence of interspersed repeats may cause the sublinear increase of H(n) with n, and that (v). the presence of monomeric tandem repeats as well as the suppression of CG dinucleotides may cause the observed decay of H(n)(q) with q.
D Holste; I Grosse; H Herzel
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't     Date:  2001-09-26
Journal Detail:
Title:  Physical review. E, Statistical, nonlinear, and soft matter physics     Volume:  64     ISSN:  1539-3755     ISO Abbreviation:  Phys Rev E Stat Nonlin Soft Matter Phys     Publication Date:  2001 Oct 
Date Detail:
Created Date:  2001-11-05     Completed Date:  2004-12-16     Revised Date:  2006-11-15    
Medline Journal Info:
Nlm Unique ID:  101136452     Medline TA:  Phys Rev E Stat Nonlin Soft Matter Phys     Country:  United States    
Other Details:
Languages:  eng     Pagination:  041917     Citation Subset:  IM    
Department of Theoretical Biophysics, Humboldt University Berlin, Invalidenstrasse 42, D-10115, Berlin, Germany.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Alu Elements
Chromosomes, Human, Pair 22 / ultrastructure*
DNA / ultrastructure*
Genome, Human
Models, Statistical
Oligonucleotides / chemistry
Repetitive Sequences, Nucleic Acid
Reg. No./Substance:
0/Oligonucleotides; 9007-49-2/DNA

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Neural representation of alpha-oriented moving light bars in the cortex: a neural network study.
Next Document:  Epidemic spread and bifurcation effects in two-dimensional network models with viral dynamics.