Document Detail


Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana.
MedLine Citation:
PMID:  23311369     Owner:  NLM     Status:  In-Data-Review    
Abstract/OtherAbstract:
In this article we show how dichotomic classes, binary variables naturally derived from a new mathematical model of the genetic code, can be used in order to characterize different parts of the genome. In particular, we analyze and compare different parts of whole chromosome 1 of Arabidopsis thaliana: genes, exons, introns, coding sequences (CDS), intergenes, untranslated regions (UTR) and regulatory sequences. In order to accomplish the task we encode each sequence in the 3 possible reading frames according to the definitions of the dichotomic classes (parity, Rumer and hidden). Then, we perform a statistical analysis on the binary sequences. Interestingly, the results show that coding and non-coding sequences have different patterns and proportions of dichotomic classes. This suggests that the frame is important only for coding sequences and that dichotomic classes can be useful to recognize them. Moreover, such patterns seem to be more enhanced in CDS than in exons. Also, we derive an independence test in order to assess whether the percentages observed could be considered as an expression of independent random processes. The results confirm that only genes, exons and CDS seem to possess a dependence structure that distinguishes them from i.i.d sequences. Such informational content is independent from the global proportion of nucleotides of a sequence. The present work confirms that the recent mathematical model of the genetic code is a new paradigm for understanding the management and the organization of genetic information and is an innovative tool for investigating informational aspects of error detection/correction mechanisms acting at the level of DNA replication.
Authors:
Enrico Properzi; Simone Giannerini; Diego Luis Gonzalez; Rodolfo Rosa
Related Documents :
18221489 - Identification of the mycobacterial glucosyl-3-phosphoglycerate synthase.
9501229 - Multilocus sequence typing: a portable approach to the identification of clones within ...
23091369 - Genome-wide survey of ds exonization to enrich transcriptomes and proteomes in plants.
11532219 - The genome of mycobacterium leprae: a minimal mycobacterial gene set.
23453329 - Molecular cloning and protein characterization of swine 4-1bb.
24177999 - Nucleotide sequence of the clustered genes for the 44 kd chlorophyll a apoprotein and t...
15772689 - Sirna targeting the leader sequence of sars-cov inhibits virus replication.
11768539 - Mutational analysis of beta-glucanase genes from the plant-pathogenic fungus cochliobol...
9724769 - A general procedure for locating and analyzing protein-binding sequence motifs in nucle...
Publication Detail:
Type:  Journal Article    
Journal Detail:
Title:  Mathematical biosciences and engineering : MBE     Volume:  10     ISSN:  1551-0018     ISO Abbreviation:  Math Biosci Eng     Publication Date:  2013 Feb 
Date Detail:
Created Date:  2013-01-14     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101197794     Medline TA:  Math Biosci Eng     Country:  United States    
Other Details:
Languages:  eng     Pagination:  199-219     Citation Subset:  IM    
Affiliation:
Dipartimento di Scienze Statistiche, Universita di Bologna, Via delle Belle Arti 41, 40126, Bologna, Italy. enrico.properzi3@unibo.it.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  A therapy inactivating the tumor angiogenic factors.
Next Document:  Mathematical analysis and simulations involving chemotherapy and surgery on large human tumours unde...