Document Detail

Exploiting Genome Structure in Association Analysis.
MedLine Citation:
PMID:  21548809     Owner:  NLM     Status:  Publisher    
Abstract A genome-wide association study involves examining a large number of single-nucleotide polymorphisms (SNPs) to identify SNPs that are significantly associated with the given phenotype, while trying to reduce the false positive rate. Although haplotype-based association methods have been proposed to accommodate correlation information across nearby SNPs that are in linkage disequilibrium, none of these methods directly incorporated the structural information such as recombination events along chromosome. In this paper, we propose a new approach called stochastic block lasso for association mapping that exploits prior knowledge on linkage disequilibrium structure in the genome such as recombination rates and distances between adjacent SNPs in order to increase the power of detecting true associations while reducing false positives. Following a typical linear regression framework with the genotypes as inputs and the phenotype as output, our proposed method employs a sparsity-enforcing Laplacian prior for the regression coefficients, augmented by a first-order Markov process along the sequence of SNPs that incorporates the prior information on the linkage disequilibrium structure. The Markov-chain prior models the structural dependencies between a pair of adjacent SNPs, and allows us to look for association SNPs in a coupled manner, combining strength from multiple nearby SNPs. Our results on HapMap-simulated datasets and mouse datasets show that there is a significant advantage in incorporating the prior knowledge on linkage disequilibrium structure for marker identification under whole-genome association.
Seyoung Kim; Eric P Xing
Related Documents :
21138759 - No association between histamine n-methyltransferase functional polymorphism thr105ile ...
21080949 - Common genetic variation in the estrogen receptor beta (esr2) gene and osteoarthritis: ...
16392639 - Matrix gamma-carboxyglutamic acid protein (mgp) g-7a and t-138c gene polymorphisms in i...
21475449 - Association of cytokine gene polymorphism with susceptibility and clinical types of lep...
21178099 - Spp1 genotype is a determinant of disease severity in duchenne muscular dystrophy.
9686479 - Two ethnic-specific polymorphisms in the human beta pseudogene of hemoglobin.
10430599 - Estimation of pairwise relatedness with molecular markers.
2575899 - Novel restriction fragment length polymorphism of the growth hormone gene in inbred rats.
23559409 - Identification of 99 novel mutations in a worldwide cohort of 1,056 patients with a nep...
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2011-5-6
Journal Detail:
Title:  Journal of computational biology : a journal of computational molecular cell biology     Volume:  -     ISSN:  1557-8666     ISO Abbreviation:  -     Publication Date:  2011 May 
Date Detail:
Created Date:  2011-5-9     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  9433358     Medline TA:  J Comput Biol     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
School of Computer Science, Carnegie Mellon University , Pittsburgh, Pennsylvania.
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

Previous Document:  Counting RNA pseudoknotted structures.
Next Document:  Gene Expression Complex Networks: Synthesis, Identification, and Analysis.