Document Detail

Balancing Type One and Two Errors in Multiple Testing for Differential Expression of Genes.
MedLine Citation:
PMID:  20161303     Owner:  NLM     Status:  Publisher    
A new procedure is proposed to balance type I and II errors in significance testing for differential expression of individual genes. Suppose that a collection, F(k), of k lists of selected genes is available, each of them approximating by their content the true set of differentially expressed genes. For example, such sets can be generated by a subsampling counterpart of the delete-d-jackknife method controlling the per-comparison error rate for each subsample. A final list of candidate genes, denoted by S(*), is composed in such a way that its contents be closest in some sense to all the sets thus generated. To measure "closeness" of gene lists, we introduce an asymmetric distance between sets with its asymmetry arising from a generally unequal assignment of the relative costs of type I and type II errors committed in the course of gene selection. The optimal set S(*) is defined as a minimizer of the average asymmetric distance from an arbitrary set S to all sets in the collection F(k). The minimization problem can be solved explicitly, leading to a frequency criterion for the inclusion of each gene in the final set. The proposed method is tested by resampling from real microarray gene expression data with artificially introduced shifts in expression levels of pre-defined genes, thereby mimicking their differential expression.
Alexander Gordon; Linlin Chen; Galina Glazko; Andrei Yakovlev
Related Documents :
9847233 - The university of minnesota biocatalysis/biodegradation database: specialized metabolis...
10977093 - Genes, themes and microarrays: using information retrieval for large-scale gene analysis.
9584153 - The cardiac tissue-restricted homeobox protein csx/nkx2.5 physically associates with th...
Publication Detail:
Journal Detail:
Title:  Computational statistics & data analysis     Volume:  53     ISSN:  0167-9473     ISO Abbreviation:  -     Publication Date:  2009 Mar 
Date Detail:
Created Date:  2010-7-13     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  100960938     Medline TA:  Comput Stat Data Anal     Country:  -    
Other Details:
Languages:  ENG     Pagination:  1622-1629     Citation Subset:  -    
Department of Mathematics and Statistics, University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, North Carolina, U.S.A.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Grant Support
R01 GM075299-01//NIGMS NIH HHS

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  CFD Simulation and Experimental Validation of Fluid Flow and Particle Transport in a Model of Alveol...
Next Document:  Lipid Compositions in Escherichia coli and Bacillus subtilis During Growth as Determined by MALDI-TO...