Document Detail


Noise reduction in genome-wide perturbation screens using linear mixed-effect models.
MedLine Citation:
PMID:  21685046     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
MOTIVATION: High-throughput perturbation screens measure the phenotypes of thousands of biological samples under various conditions. The phenotypes measured in the screens are subject to substantial biological and technical variation. At the same time, in order to enable high throughput, it is often impossible to include a large number of replicates, and to randomize their order throughout the screens. Distinguishing true changes in the phenotype from stochastic variation in such experimental designs is extremely challenging, and requires adequate statistical methodology.
RESULTS: We propose a statistical modeling framework that is based on experimental designs with at least two controls profiled throughout the experiment, and a normalization and variance estimation procedure with linear mixed-effects models. We evaluate the framework using three comprehensive screens of Saccharomyces cerevisiae, which involve 4940 single-gene knock-out haploid mutants, 1127 single-gene knock-out diploid mutants and 5798 single-gene overexpression haploid strains. We show that the proposed approach (i) can be used in conjunction with practical experimental designs; (ii) allows extensions to alternative experimental workflows; (iii) enables a sensitive discovery of biologically meaningful changes; and (iv) strongly outperforms the existing noise reduction procedures.
AVAILABILITY: All experimental datasets are publicly available at www.ionomicshub.org. The R package HTSmix is available at http://www.stat.purdue.edu/~ovitek/HTSmix.html.
CONTACT: ovitek@stat.purdue.edu
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors:
Danni Yu; John Danku; Ivan Baxter; Sungjin Kim; Olena K Vatamaniuk; David E Salt; Olga Vitek
Related Documents :
19931006 - Management interventions in dairy herds: exploring within herd uncertainty using an int...
9039426 - The lipids in human milk.
7592076 - Maximizing selection efficiency for categorical traits.
9684176 - Covariance functions across herd production levels for test day records on milk, fat, a...
19594876 - Comparing 2-nt 3' overhangs against blunt-ended sirnas: a systems biology based study.
12199136 - Sensitivity and specificity of the autism diagnostic inventory-telephone screening in s...
Publication Detail:
Type:  Journal Article; Research Support, N.I.H., Extramural; Research Support, U.S. Gov't, Non-P.H.S.     Date:  2011-06-17
Journal Detail:
Title:  Bioinformatics (Oxford, England)     Volume:  27     ISSN:  1367-4811     ISO Abbreviation:  Bioinformatics     Publication Date:  2011 Aug 
Date Detail:
Created Date:  2011-08-05     Completed Date:  2012-02-09     Revised Date:  2013-06-28    
Medline Journal Info:
Nlm Unique ID:  9808944     Medline TA:  Bioinformatics     Country:  England    
Other Details:
Languages:  eng     Pagination:  2173-80     Citation Subset:  IM    
Affiliation:
Department of Statistics, Purdue University, West Lafayette, IN 47907, USA.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Gene Knockout Techniques
Genomics
High-Throughput Screening Assays*
Linear Models
Phenotype*
Saccharomyces cerevisiae / genetics
Grant Support
ID/Acronym/Agency:
4R33DK070290-02/DK/NIDDK NIH HHS
Comments/Corrections

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Immune System Simulation Online.
Next Document:  AMIGO, a toolbox for advanced model identification in systems biology using global optimization.