Document Detail

A proteogenomic analysis of Anopheles gambiae using high-resolution Fourier transform mass spectrometry.
MedLine Citation:
PMID:  21795387     Owner:  NLM     Status:  MEDLINE    
Anopheles gambiae is a major mosquito vector responsible for malaria transmission, whose genome sequence was reported in 2002. Genome annotation is a continuing effort, and many of the approximately 13,000 genes listed in VectorBase for Anopheles gambiae are predictions that have still not been validated by any other method. To identify protein-coding genes of An. gambiae based on its genomic sequence, we carried out a deep proteomic analysis using high-resolution Fourier transform mass spectrometry for both precursor and fragment ions. Based on peptide evidence, we were able to support or correct more than 6000 gene annotations including 80 novel gene structures and about 500 translational start sites. An additional validation by RT-PCR and cDNA sequencing was successfully performed for 105 selected genes. Our proteogenomic analysis led to the identification of 2682 genome search-specific peptides. Numerous cases of encoded proteins were documented in regions annotated as intergenic, introns, or untranslated regions. Using a database created to contain potential splice sites, we also identified 35 novel splice junctions. This is a first report to annotate the An. gambiae genome using high-accuracy mass spectrometry data as a complementary technology for genome annotation.
Raghothama Chaerkady; Dhanashree S Kelkar; Babylakshmi Muthusamy; Kumaran Kandasamy; Sutopa B Dwivedi; Nandini A Sahasrabuddhe; Min-Sik Kim; Santosh Renuse; Sneha M Pinto; Rakesh Sharma; Harsh Pawar; Nirujogi Raja Sekhar; Ajeet Kumar Mohanty; Derese Getnet; Yi Yang; Jun Zhong; Aditya P Dash; Robert M MacCallum; Bernard Delanghe; Godfree Mlambo; Ashwani Kumar; T S Keshava Prasad; Mobolaji Okulate; Nirbhay Kumar; Akhilesh Pandey
Related Documents :
22731987 - Toward almost closed genomes with gapfiller.
22235347 - First description of natural and experimental conjugation between mycobacteria mediated...
2122457 - Characterization of the gene encoding the protective paracrystalline-surface-layer prot...
21925617 - Identification of hornet silk gene with a characteristic repetitive sequence in vespa s...
7545667 - Silencer elements modulate the expression of the gene for the neuron-glia cell adhesion...
22149497 - Partial genome sequence of murine gammaherpesvirus 72 and its analysis.
Publication Detail:
Type:  Journal Article; Research Support, N.I.H., Extramural; Research Support, Non-U.S. Gov't     Date:  2011-07-27
Journal Detail:
Title:  Genome research     Volume:  21     ISSN:  1549-5469     ISO Abbreviation:  Genome Res.     Publication Date:  2011 Nov 
Date Detail:
Created Date:  2011-11-02     Completed Date:  2012-03-05     Revised Date:  2013-06-28    
Medline Journal Info:
Nlm Unique ID:  9518021     Medline TA:  Genome Res     Country:  United States    
Other Details:
Languages:  eng     Pagination:  1872-81     Citation Subset:  IM    
McKusick-Nathans Institute of Genetic Medicine and Department of Biological Chemistry, Johns Hopkins University, Baltimore, Maryland 21205, USA.
Data Bank Information
Bank Name/Acc. No.:
GENBANK/GO935137;  GO935138
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Alternative Splicing
Anopheles gambiae / genetics*,  metabolism*
Chromosome Mapping
Codon, Initiator
Genes, Insect
Mass Spectrometry
Molecular Sequence Annotation
Molecular Sequence Data
Open Reading Frames
Peptides / genetics
RNA Splice Sites
Reproducibility of Results
Untranslated Regions / genetics
Grant Support
Reg. No./Substance:
0/Codon, Initiator; 0/Peptides; 0/RNA Splice Sites; 0/Untranslated Regions

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Dynamics of the epigenetic landscape during erythroid differentiation after GATA1 restoration.
Next Document:  Kinesin molecular motor Eg5 functions during polypeptide synthesis.