Document Detail


Fulcrum: condensing redundant reads from high-throughput sequencing studies.
MedLine Citation:
PMID:  22419786     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
MOTIVATION: Ultra-high-throughput sequencing produces duplicate and near-duplicate reads, which can consume computational resources in downstream applications. A tool that collapses such reads should reduce storage and assembly complications and costs.
RESULTS: We developed Fulcrum to collapse identical and near-identical Illumina and 454 reads (such as those from PCR clones) into single error-corrected sequences; it can process paired-end as well as single-end reads. Fulcrum is customizable and can be deployed on a single machine, a local network or a commercially available MapReduce cluster, and it has been optimized to maximize ease-of-use, cross-platform compatibility and future scalability. Sequence datasets have been collapsed by up to 71%, and the reduced number and improved quality of the resulting sequences allow assemblers to produce longer contigs while using less memory.
Authors:
Matthew S Burriesci; Erik M Lehnert; John R Pringle
Related Documents :
21652306 - Biology and systematics of heterokont and haptophyte algae.
8968916 - Characterization of a spotted fever group rickettsia from ixodes ricinus ticks in sweden.
21796436 - Phylogenetic analysis of rice tungro bacilliform virus orfs revealed strong correlation...
21819656 - Detection of banned ruminant-derived material in industrial feedstuffs by taqman real-t...
10760176 - The aspergillus niger transcriptional activator xlnr, which is involved in the degradat...
14661116 - Functional classification of the microbial feruloyl esterases.
Publication Detail:
Type:  Journal Article; Research Support, N.I.H., Extramural; Research Support, Non-U.S. Gov't; Research Support, U.S. Gov't, Non-P.H.S.     Date:  2012-03-13
Journal Detail:
Title:  Bioinformatics (Oxford, England)     Volume:  28     ISSN:  1367-4811     ISO Abbreviation:  Bioinformatics     Publication Date:  2012 May 
Date Detail:
Created Date:  2012-05-10     Completed Date:  2012-12-05     Revised Date:  2014-06-06    
Medline Journal Info:
Nlm Unique ID:  9808944     Medline TA:  Bioinformatics     Country:  England    
Other Details:
Languages:  eng     Pagination:  1324-7     Citation Subset:  IM    
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Algorithms
Gene Expression Profiling
High-Throughput Nucleotide Sequencing / methods*
Humans
Pseudomonas / genetics
Sequence Analysis, DNA / methods
Software*
Grant Support
ID/Acronym/Agency:
5 T32 HG000044/HG/NHGRI NIH HHS; T32 HG000044/HG/NHGRI NIH HHS
Comments/Corrections

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  SAMSCOPE: an OpenGL-based real-time interactive scale-free SAM viewer.
Next Document:  TachoSil for postinfarction ventricular free wall rupture.