Document Detail

Statistically invalid classification of high throughput gene expression data.
MedLine Citation:
PMID:  23346359     Owner:  NLM     Status:  In-Data-Review    
Classification analysis based on high throughput data is a common feature in neuroscience and other fields of science, with a rapidly increasing impact on both basic biology and disease-related studies. The outcome of such classifications often serves to delineate novel biochemical mechanisms in health and disease states, identify new targets for therapeutic interference, and develop innovative diagnostic approaches. Given the importance of this type of studies, we screened 111 recently-published high-impact manuscripts involving classification analysis of gene expression, and found that 58 of them (53%) based their conclusions on a statistically invalid method which can lead to bias in a statistical sense (lower true classification accuracy then the reported classification accuracy). In this report we characterize the potential methodological error and its scope, investigate how it is influenced by different experimental parameters, and describe statistically valid methods for avoiding such classification mistakes.
Shahar Barbash; Hermona Soreq
Related Documents :
22930479 - Blood biomarkers of methylation in down syndrome and metabolic simulations using a math...
24335339 - Towards a greater understanding of the illicit tobacco trade in europe: a review of the...
25407879 - Assessing trade-offs to inform ecosystem-based fisheries management of forage fish.
25336179 - Retina-v1 model of detectability across the visual field.
24954019 - An effective haplotype assembly algorithm based on hypergraph partitioning.
22750569 - Dynamic causal modelling of precision and synaptic gain in visual perception - an eeg s...
21929079 - Global modeling of complex data series using the term-ranking approach and its applicat...
21590789 - Fitting dynamic models with forcing functions: application to continuous glucose monito...
25189459 - A primer on marginal effects-part i: theory and formulae.
Publication Detail:
Type:  Journal Article     Date:  2013-01-22
Journal Detail:
Title:  Scientific reports     Volume:  3     ISSN:  2045-2322     ISO Abbreviation:  Sci Rep     Publication Date:  2013  
Date Detail:
Created Date:  2013-01-24     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  101563288     Medline TA:  Sci Rep     Country:  England    
Other Details:
Languages:  eng     Pagination:  1102     Citation Subset:  IM    
The Edmond & Lily Safra Center for Brain Sciences and the Department of Biological Chemistry at the Hebrew University of Jerusalem.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Radial arrangement of Janus-like setae permits friction control in spiders.
Next Document:  Feedback mechanism in depolarization-induced sustained activation of extracellular signal-regulated ...