Document Detail


GEL: a novel genotype calling algorithm using empirical likelihood.
MedLine Citation:
PMID:  16809396     Owner:  NLM     Status:  MEDLINE    
Abstract/OtherAbstract:
MOTIVATION: Preliminary results on the data produced using the Affymetrix large-scale genotyping platforms show that it is necessary to construct improved genotype calling algorithms. There is evidence that some of the existing algorithms lead to an increased error rate in heterozygous genotypes, and a disproportionately large rate of heterozygotes with missing genotypes. Non-random errors and missing data can lead to an increase in the number of false discoveries in genetic association studies. Therefore, the factors that need to be evaluated in assessing the performance of an algorithm are the missing data (call) and error rates, but also the heterozygous proportions in missing data and errors. RESULTS: We introduce a novel genotype calling algorithm (GEL) for the Affymetrix GeneChip arrays. The algorithm uses likelihood calculations that are based on distributions inferred from the observed data. A key ingredient in accurate genotype calling is weighting the information that comes from each probe quartet according to the quality/reliability of the data in the quartet, and prior information on the performance of the quartet. AVAILABILITY: The GEL software is implemented in R and is available by request from the corresponding author at nicolae@galton.uchicago.edu.
Authors:
Dan L Nicolae; Xiaolin Wu; Kazuaki Miyake; Nancy J Cox
Related Documents :
18271056 - Evaluating cost efficiency of snp chips in genome-wide association studies.
16173096 - Multilocus ld measure and tagging snp selection with generalized mutual information.
7432006 - Some mathematics of recombination: evolution of complexity and genotypic modification i...
4452476 - A model for analysis of population structure.
14871566 - Phytoplankton blooms and fish recruitment rate: effects of spatial distribution.
3289906 - Male reproductive toxicology: comparison of the human to animal models.
Publication Detail:
Type:  Journal Article; Research Support, N.I.H., Extramural; Research Support, Non-U.S. Gov't     Date:  2006-06-29
Journal Detail:
Title:  Bioinformatics (Oxford, England)     Volume:  22     ISSN:  1367-4811     ISO Abbreviation:  Bioinformatics     Publication Date:  2006 Aug 
Date Detail:
Created Date:  2006-08-10     Completed Date:  2006-09-28     Revised Date:  2009-11-04    
Medline Journal Info:
Nlm Unique ID:  9808944     Medline TA:  Bioinformatics     Country:  England    
Other Details:
Languages:  eng     Pagination:  1942-7     Citation Subset:  IM    
Affiliation:
Department of Statistics, The University of Chicago. nicolae@galton.uchicago.edu
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:
Algorithms
Computational Biology / methods*
Genotype*
Likelihood Functions
Oligonucleotide Array Sequence Analysis
Programming Languages
Reproducibility of Results
Software
Grant Support
ID/Acronym/Agency:
DK-20595/DK/NIDDK NIH HHS; DK-47486/DK/NIDDK NIH HHS; DK-55889/DK/NIDDK NIH HHS

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Parallel multifactor dimensionality reduction: a tool for the large-scale analysis of gene-gene inte...
Next Document:  The human X chromosome is enriched for germline genes expressed in premeiotic germ cells of both sex...