Document Detail


Large Disclosing the Nature of Computational Tools for the Analysis of Next Generation Sequencing Data.
MedLine Citation:
PMID:  22690679     Owner:  NLM     Status:  Publisher    
Abstract/OtherAbstract:
Next-generation sequencing (NGS) technologies are rapidly changing the approach to complex genomic studies, opening the way to personalized drugs development and personalized medicine. NGS technologies are characterized by a massive throughput for relatively short-sequences (30-100), and they are currently the most reliable and accurate method for grouping individuals on the basis of their genetic profiles. The first and crucial step in sequence analysis is the conversion of millions of short sequences (reads) into valuable genetic information by their mapping to a known (reference) genome. New computational methods, specifically designed for the type and the amount of data generated by NGS technologies, are replacing earlier widespread genome alignment algorithms which are unable to cope with such massive amount of data. This review provides an overview of the bioinformatics techniques that have been developed for the mapping of NGS data onto a reference genome, with a special focus on polymorphism rate and sequence error detection. The different techniques have been experimented on an appropriately defined dataset, to investigate their relative computational costs and usability, as seen from an user perspective. Since NGS platforms interrogate the genome using either the conventional nucleotide space or the more recent color space, this review does consider techniques both in nucleotide and color space, emphasizing similarities and diversities.
Authors:
Francesca Cordero; Marco Beccuti; Susanna Donatelli; Raffaele A Calogero
Related Documents :
20877579 - Evolution and distribution of the ospc gene, a transferable serotype determinant of bor...
21994619 - Towards viral genome annotation standards, report from the 2010 ncbi annotation workshop.
21901739 - Using the saccharomyces genome database (sgd) for analysis of genomic information.
12853379 - Characterization of the major antigenic protein 2 of ehrlichia canis and ehrlichia chaf...
19485769 - Identification of spotted fever group rickettsia species by polymerase chain reaction-r...
21790699 - Babesia canis rossi infection in a texas dog.
6328279 - Drosophila melanogaster has only one myosin alkali light-chain gene which encodes a pro...
24323929 - Endogenous microrna clusters outperform chimeric sequence clusters in chinese hamster o...
1843569 - Selection of lacz operon fusions in genes of gluconate metabolism in e. coli. character...
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2012-6-07
Journal Detail:
Title:  Current topics in medicinal chemistry     Volume:  -     ISSN:  1873-4294     ISO Abbreviation:  -     Publication Date:  2012 Jun 
Date Detail:
Created Date:  2012-6-13     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101119673     Medline TA:  Curr Top Med Chem     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
Affiliation:
Department of Computer Science, University of Torino, Torino, Italy. fra.cordero@gmail.com.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  A Systems Biology Perspective on Rational Design of Peptide Vaccine against Virus Infections.
Next Document:  Chemical Genomics and Emerging DNA Technologies in the Identification of Drug Mechanisms and Drug Ta...