Document Detail


Genome organization of the SARS-CoV.
MedLine Citation:
PMID:  15629035     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
Annotation of the genome sequence of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) is indispensable to understand its evolution and pathogenesis. We have performed a full annotation of the SARS-CoV genome sequences by using annotation programs publicly available or developed by ourselves. Totally, 21 open reading frames (ORFs) of genes or putative uncharacterized proteins (PUPs) were predicted. Seven PUPs had not been reported previously, and two of them were predicted to contain transmembrane regions. Eight ORFs partially overlapped with or embedded into those of known genes, revealing that the SARS-CoV genome is a small and compact one with overlapped coding regions. The most striking discovery is that an ORF locates on the minus strand. We have also annotated non-coding regions and identified the transcription regulating sequences (TRS) in the intergenic regions. The analysis of TRS supports the minus strand extending transcription mechanism of coronavirus. The SNP analysis of different isolates reveals that mutations of the sequences do not affect the prediction results of ORFs.
Authors:
Jing Xu; Jianfei Hu; Jing Wang; Yujun Han; Yongwu Hu; Jie Wen; Yan Li; Jia Ji; Jia Ye; Zizhang Zhang; Wei Wei; Songgang Li; Jun Wang; Jian Wang; Jun Yu; Huanming Yang
Related Documents :
18062665 - Whole genome searching with shotgun proteomic data: applications for genome annotation.
15193305 - Recent advances in gene structure prediction.
21070285 - Structural annotation of equine protein-coding genes determined by mrna sequencing.
15184545 - Peergad: a peer-review-based and community-centric web application for viewing and anno...
7668045 - Sequence, map position and genome organization of the rpl17b gene, encoding ribosomal p...
2428685 - Undermethylation of structural gene sequences in extraembryonic lineages of the mouse.
Publication Detail:
Type:  Journal Article; Research Support, Non-U.S. Gov't    
Journal Detail:
Title:  Genomics, proteomics & bioinformatics     Volume:  1     ISSN:  1672-0229     ISO Abbreviation:  Genomics Proteomics Bioinformatics     Publication Date:  2003 Aug 
Date Detail:
Created Date:  2005-01-04     Completed Date:  2005-02-11     Revised Date:  2012-09-10    
Medline Journal Info:
Nlm Unique ID:  101197608     Medline TA:  Genomics Proteomics Bioinformatics     Country:  China    
Other Details:
Languages:  eng     Pagination:  226-35     Citation Subset:  IM    
Affiliation:
Beijing Genomics Institute, Chinese Academy of Sciences, Beijing 101300, China.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Amino Acid Substitution
Base Composition
Base Sequence
Computational Biology / methods
Genome, Viral*
Isoelectric Point
Models, Genetic
Molecular Sequence Data
Molecular Weight
Open Reading Frames
SARS Virus / genetics*
Sequence Analysis
Transcription, Genetic

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Evolution and variation of the SARS-CoV genome.
Next Document:  EST pipeline system: detailed and automated EST data processing and mining.