|Genome-wide association study of intracranial aneurysm identifies three new risk loci.|
|Jump to Full Text|
|PMID: 20364137 Owner: NLM Status: MEDLINE|
|Saccular intracranial aneurysms are balloon-like dilations of the intracranial arterial wall; their hemorrhage commonly results in severe neurologic impairment and death. We report a second genome-wide association study with discovery and replication cohorts from Europe and Japan comprising 5,891 cases and 14,181 controls with approximately 832,000 genotyped and imputed SNPs across discovery cohorts. We identified three new loci showing strong evidence for association with intracranial aneurysms in the combined dataset, including intervals near RBBP8 on 18q11.2 (odds ratio (OR) = 1.22, P = 1.1 x 10(-12)), STARD13-KL on 13q13.1 (OR = 1.20, P = 2.5 x 10(-9)) and a gene-rich region on 10q24.32 (OR = 1.29, P = 1.2 x 10(-9)). We also confirmed prior associations near SOX17 (8q11.23-q12.1; OR = 1.28, P = 1.3 x 10(-12)) and CDKN2A-CDKN2B (9p21.3; OR = 1.31, P = 1.5 x 10(-22)). It is noteworthy that several putative risk genes play a role in cell-cycle progression, potentially affecting the proliferation and senescence of progenitor-cell populations that are responsible for vascular formation and repair.|
|Katsuhito Yasuno; Kaya Bilguvar; Philippe Bijlenga; Siew-Kee Low; Boris Krischek; Georg Auburger; Matthias Simon; Dietmar Krex; Zulfikar Arlier; Nikhil Nayak; Ynte M Ruigrok; Mika Niemelä; Atsushi Tajima; Mikael von und zu Fraunberg; Tamás Dóczi; Florentina Wirjatijasa; Akira Hata; Jordi Blasco; Agi Oszvald; Hidetoshi Kasuya; Gulam Zilani; Beate Schoch; Pankaj Singh; Carsten Stüer; Roelof Risselada; Jürgen Beck; Teresa Sola; Filomena Ricciardi; Arpo Aromaa; Thomas Illig; Stefan Schreiber; Cornelia M van Duijn; Leonard H van den Berg; Claire Perret; Carole Proust; Constantin Roder; Ali K Ozturk; Emília Gaál; Daniela Berg; Christof Geisen; Christoph M Friedrich; Paul Summers; Alejandro F Frangi; Matthew W State; H Erich Wichmann; Monique M B Breteler; Cisca Wijmenga; Shrikant Mane; Leena Peltonen; Vivas Elio; Miriam C J M Sturkenboom; Patricia Lawford; James Byrne; Juan Macho; Erol I Sandalcioglu; Bernhard Meyer; Andreas Raabe; Helmuth Steinmetz; Daniel Rüfenacht; Juha E Jääskeläinen; Juha Hernesniemi; Gabriel J E Rinkel; Hitoshi Zembutsu; Ituro Inoue; Aarno Palotie; François Cambien; Yusuke Nakamura; Richard P Lifton; Murat Günel|
Related Documents :
|14288147 - Carotid "thrombosis" and "fixed neurological deficit". two cases surgically treated.
8917037 - Brain parenchyma po2, pco2, and ph during and after hypoxic, ischemic brain insult in d...
12070117 - Assessment of coronary function in children with a history of kawasaki disease using (1...
1789497 - Duplex ultrasonographic insonation and visualization of intracerebral arteries.
22260957 - A single centre study of coil embolization of intracranial aneurysms comparing bare pla...
3043547 - Popliteal venous aneurysm.
|Type: Journal Article; Research Support, N.I.H., Extramural; Research Support, Non-U.S. Gov't Date: 2010-04-04|
|Title: Nature genetics Volume: 42 ISSN: 1546-1718 ISO Abbreviation: Nat. Genet. Publication Date: 2010 May|
|Created Date: 2010-04-29 Completed Date: 2010-05-27 Revised Date: 2014-09-24|
Medline Journal Info:
|Nlm Unique ID: 9216904 Medline TA: Nat Genet Country: United States|
|Languages: eng Pagination: 420-5 Citation Subset: IM|
|APA/MLA Format Download EndNote Download BibTex|
Genome-Wide Association Study*
Hemorrhage / genetics
Intracranial Aneurysm / genetics*
Polymorphism, Single Nucleotide
|089061//Wellcome Trust; 089062//Wellcome Trust; R01 NS 057756/NS/NINDS NIH HHS; R01 NS057756-03/NS/NINDS NIH HHS; RR19895/RR/NCRR NIH HHS; U24 NS 051869/NS/NINDS NIH HHS; U24 NS051869/NS/NINDS NIH HHS; U24 NS051869-05/NS/NINDS NIH HHS; UL1 RR024139/RR/NCRR NIH HHS; //Howard Hughes Medical Institute; //Howard Hughes Medical Institute|
Journal ID (nlm-journal-id): 9216904
Journal ID (pubmed-jr-id): 2419
Journal ID (nlm-ta): Nat Genet
License:Users may view, print, copy, download and text and data- mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
nihms-submitted publication date: Day: 16 Month: 3 Year: 2010
Electronic publication date: Day: 4 Month: 4 Year: 2010
Print publication date: Month: 5 Year: 2010
pmc-release publication date: Day: 1 Month: 11 Year: 2010
Volume: 42 Issue: 5
First Page: 420 Last Page: 425
PubMed Id: 20364137
U24 NS051869-05 ||NS
R01 NS057756-03 ||NS
National Institute of Neurological Disorders and Stroke : NINDS
Howard Hughes Medical Institute
|Genome-wide association study of intracranial aneurysm identifies three new risk loci|
|Siew Kee Low4|
|Ynte M Ruigrok9|
|Mikael von und zu Fraunberg12|
|Cornelia M van Duijn27|
|Leonard H van den Berg9|
|Ali K Ozturk12|
|Christoph M Friedrich31|
|Alejandro F Frangi32|
|Matthew W State233|
|Monique M B Breteler27|
|Miriam CJM Sturkenboom22|
|Erol I Sandalcioglu19|
|Juha E Jääskeläinen12|
|Gabriel J E Rinkel9|
|Richard P Lifton239*|
1 Departments of Neurosurgery and Neurobiology, Yale University School of Medicine, New Haven, Connecticut 06510, USA
2 Department of Genetics, Yale Program on Neurogenetics, Yale Center for Human Genetics and Genomics, Yale University School of Medicine, New Haven, Connecticut 06510, USA
3 Service de Neurochirurgie, Department of Clinical Neurosciences, Geneva University Hospital, 1211 Geneva 4, Switzerland
4 Human Genome Center, Institute of Medical Science, University of Tokyo, Tokyo, Japan
5 Department of Neurosurgery, University of Tuebingen, Germany
6 Department of Neurology, Goethe University, Frankfurt am Main, Germany
7 Department of Neurosurgery, University of Bonn, Bonn, Germany
8 Klinik und Poliklinik für Neurochirurgie Universitätsklinikum Carl Gustav Carus der Technischen Universität Dresden Fetscherstraße 74 01307 Dresden, Germany
9 Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Center Utrecht, 3584 CX Utrecht, The Netherlands
10 Department of Neurosurgery, Helsinki University Central Hospital, Helsinki, P.O. Box 266, FI-00029 HUS, Finland
11 Division of Molecular Life Science, School of Medicine, Tokai University, Shimokasuya 143, Isehara, Kanagawa 259-1193, Japan
12 Department of Neurosurgery, Kuopio University Hospital, Kuopio FI-70211, Finland
13 Neurosurgery, University of Pècs Medical School, Pècs, Hungary
14 Department of Public Health, School of Medicine, Chiba University, Chiba 260-8670, Japan
15 Department of Vascular Radiology, Hospital Clinic, Barcelona, Spain
16 Department of Neurosurgery, Goethe University, Frankfurt am Main, Germany
17 Department of Neurosurgery, Medical Center East, Tokyo Women's University, Tokyo 116-8567, Japan
18 Nuffield Department of Surgery, John Radcliffe Hospital, University of Oxford, Oxford, UK
19 Department of Neurosurgery, University Hospital, Essen, Germany
20 Departments of Medical Physics and Neurosurgery, Royal Hallamshire Hospital, Sheffield, UK
21 Department of Neurosurgery, Technical University of Munich, Germany
22 Department of Medical Informatics, Erasmus University Medical Center, 3000CA Rotterdam, The Netherlands
23 Therapeutic Neuroangiography, Hospital General de Catalunya, San Cugat del Valles, Spain
24 Department of Health and Functional Capacity, National Public Health Institute, Helsinki, Finland
25 Institute of Epidemiology, German Research Center for Environmental Health, Helmholtz Zentrum München, Munich, Germany
26 Institute for Clinical Molecular Biology, Christian-Albrechts-University, Kiel, Germany
27 Genetic Epidemiology Unit, Department of Epidemiology and Biostatistics and Department of Clinical Genetics, Erasmus Medical Center, 2040, 3000 CA Rotterdam, The Netherlands
28 UMR INSERM S937 - University Pierre and Marie Curie, Paris 06, France
29 Center of Neurology, Department of Neurodegeneration and Hertie Institute for Clinical Brain Research, University of Tuebingen, Germany
30 Institute of Transfusion Medicine and Immunohaematology, Department of Molecular Haemostasis, DRK Blood Donor Service Baden Wuerttemberg and Hessen, Frankfurt am Main, Germany
31 Fraunhofer-Institut for Algorithms and Scientific Computing, 53754 Sankt Augustin, Germany
32 Center for Computational Imaging & Simulation Technologies in Biomedicine, Universitat Pompeu Fabra, Barcelona, Spain
33 Department of Psychiatry and Child Study Center, Yale University School of Medicine, New Haven, Connecticut 06510, USA
34 Department of Genetics, University Medical Center Groningen and University of Groningen, 9700 RR Groningen, The Netherlands
35 Keck Foundation Biotechnology Resource Laboratory, Yale University, 300 George Street, New Haven, Connecticut 06510, USA
36 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, UK
37 Academic Unit of Medical Physics, School of Medicine and Biomedical Sciences, University of Sheffield, Sheffield, UK
38 Neuroradiology - SNI - Clinic Hirslanden, 8032 Zürich, Switzerland
39 Howard Hughes Medical Institute and Department of Internal Medicine, Yale University School of Medicine, New Haven, Connecticut 06510, USA
|* Correspondence should be addressed to R.P.L. (email@example.com) or M.G. (firstname.lastname@example.org)
IA affects approximately 2% of the general population and arises from the action of multiple genetic and environmental risk factors1. We previously reported the first genome-wide association study (GWAS) of IA2 that identified three IA risk loci on chromosomes 8q11.23-q12.1, 9p21.3 and 2q33.1 with P < 5×10-8. This previous study had limited power to detect loci imparting genotypic relative risk (GRR) < 1.35 (Supplementary Table 1).
To increase the power to detect additional loci of similar or smaller effect, we ascertained and whole-genome genotyped 2 new European case cohorts (n = 1,616) and collected genotyping data from 5 additional European control cohorts (Supplementary Note, n = 11,955). We also increased the size of the original Japanese replication cohort and added a new Japanese replication cohort (2,282 cases and 905 controls) (Table 1). The new combined cohort has nearly 3-fold more cases than the original cohort and increased our power to detect variants with modest effect sizes. For example, this study had 89% and 64% average power to detect common variants (minor allele frequencies ≥ 10%) with GRR of 1.25 and 1.20, respectively (Supplementary Table 1).
All subjects were genotyped using the Illumina platform. The new as well as the previously analyzed genotyping data were subjected to well-established quality control (QC) measures (Supplementary Table 2). We sought to eliminate potential confounding due to population stratification and gender1,3 by matching cases and controls of the same gender based on inferred genetic ancestry. As previous studies4,5 demonstrated that the Finnish population forms an ancestry cluster distinct from other European populations like those included in this study, we analyzed our Finnish cohort independently from others. To maximize opportunities for genetic matching and analytic power, we analyzed all subjects in the remaining European cohorts together. The resulting matched case-control data consisted of 808 cases and 4,393 controls in the Finnish (FI) cohort and 1,972 cases and 8,122 controls in the rest of the combined European (CE) cohort (Supplementary Table 3). We used the QC-passed genotype data and phased chromosomes from the HapMap CEU sample to impute missing genotypes6. We based our further analyses on 831,534 SNPs that passed the QC filters both in the FI and CE samples (Table 1 and Supplementary Table 2).
We tested for association of each QC-passed SNP with IA using conditional logistic regression, assuming a log-additive effect of allele dosage. We corrected each cohort for residual overdispersion (Table 1) using genomic control7, and combined the results from FI and CE to obtain P-values, odds ratios (ORs) and confidence intervals (CIs) for the discovery cohort of 2,780 cases and 12,515 controls using a fixed-effects model.
To evaluate the strength of association, in addition to using P-values, we employed a Bayesian approach8. We used the Bayes factor (BF) that represents the fold-change of the odds of association before and after observing the data9, and the posterior probability of association (PPA), calculated through the BF, that provides a simple probabilistic measure of the evidence of association8,10. For every SNP, we assumed a uniform prior probability of association of 1/10,000 and set the prior of the logarithm of per-allele OR as a normal distribution with a 95% probability of the OR to be between 0.67 and 1.5, with larger weights for smaller effect sizes9,11.
From the discovery results, we eliminated 2 imputed SNPs that showed PPAs of 0.97 and 0.94 as their association signals were not supported by surrounding genotyped SNPs and their genotypes were not confirmed by direct genotyping results (data not shown). This resulted in 831,532 QC-passed SNPs (Supplementary Table 2).
We observed 3 regions that showed very high PPA (> 0.995; Fig. 1a) and also a substantial excess of SNPs with P < 1×10-3 (1,295 SNPs versus 831 SNPs expected by chance) even after excluding those within previously identified associated regions2 (Fig. 1b). Moreover, we observed a strong correlation between the P-values and BFs for the upper tail of the distribution (Fig. 1c).
We focused on 5 genomic regions (Fig. 1a) that contained at least one SNP with PPA > 0.5, for which the hypothesis of association with IA is more likely than the null hypothesis of no association. The PPAs and P-values of the most highly associated SNPs in these intervals ranged from 0.6621 to > 0.9999 and 7.9×10-7 to 2.2×10-16, respectively (Supplementary Table 4). The 5 chromosomal segments included 3 newly identified SNP clusters at 10q24.32, 13q13.1 and 18q11.2. The remaining 2 regions were previously identified loci at 8q11.23-q12.1 and 9p21.32 (Fig. 2). The third locus identified in our previous study at 2q33 did not contain any SNPs with PPA > 0.5. Furthermore, consistent with our previous results2, detailed analysis of the 8q11.23-q12.1 region detected two independent association signals within < 100 kb interval that spans the SOX17 locus (Fig. 2 and Supplementary Fig. 1); hereafter these two signals are referred to as 5′-SOX17 and 3′-SOX17. Thus the 5 chromosomal segments comprised 6 independent association signals for follow-up.
We performed replication genotyping in 2 Japanese cohorts including 3,111 cases and 1,666 controls (JP1 and JP2, see Table 1). For each independent signal, we selected for replication the genotyped SNP with the highest PPA, and added up to 2 additional SNPs per locus. For the 5′-SOX17 region, we selected 2 SNPs analyzed previously, as they tag the best SNP in the current study (Supplementary Fig. 1).
All but one of the SNPs (rs12411886 on 10q24.32 in JP1) were successfully genotyped and passed QC filters. We tested for association of each SNP with IA using logistic regression stratified by gender, specifying the same model as for the discovery cohort (Supplementary Table 5). We combined results from JP1 and JP2 using a fixed-effects model (Table 2 and Supplementary Table 4). We considered an association to be replicated if the BF increased the odds of association > 10-fold after observing the replication data.
Of the 6 candidate loci, all but the 5′-SOX17 interval were replicated, with replication P-values ranging from 0.0019 to 1.0×10-7 and the odds of association with IA increasing by 22.9 to 1.5×105-fold, yielding robust evidence for replication for each interval (Table 2).
We combined the discovery and replication results using a fixed-effects model. All of the 5 loci that replicated in the Japanese cohort surpassed the conventional threshold for genome-wide significance (P < 5×10-8), with P-values ranging from 2.5×10-9 to 1.5×10-22, and all also had PPAs ≥ 0.998 (Table 2).
In order to determine each cohort's contribution to the observed association and to assess the consistency of the effect size across cohorts, we analyzed each ascertained cohort separately (Table 1 and Supplementary Table 5) and then combined the results from the 6 cohorts using a random-effects model. The association results remained highly significant (Fig. 3). For the 5 loci that were replicated in the Japanese cohorts, we found no evidence of significant heterogeneity across cohorts (P > 0.1). Every cohort had the same risk allele and provided support for association with the exception of JP1 cohort for the 3′-SOX17 locus, consistent with our previous study2 (Fig. 3).
The most significant association was detected in the previously reported2 9p21.3 region near CDKN2A and CDKN2B with P = 1.5×10-22 (OR = 1.32; PPA > 0.9999). All of the newly studied cohorts strongly supported this association with IA (Fig. 3). These same alleles are also associated with coronary artery disease, but not with type 2 diabetes12. Similarly, the previously reported 8q11.23-q12.1 region showed significant association. The 3′-SOX17 interval (rs92986506) showed robust association with P = 1.3×10-12 (OR = 1.28; PPA > 0.9999) and all new cohorts supported association of this SNP with IA (Fig. 3). For the 5′-SOX17 region (rs10958409), the new cohorts introduced a substantial heterogeneity across cohorts, lowering PPA to 0.016 (Fig. 3).
Among the newly identified loci, the strongest association was found at rs11661542 on 18q11.2 (OR = 1.22; P = 1.1×10-12; PPA > 0.9999). A cluster of SNPs that are associated with IA spans the interval between 18.400Mb and 18.509Mb and are strongly correlated with rs11661542 (Fig. 2). A single gene, RBBP8 (retinoblastoma binding protein 8), is located within an extended linkage disequilibrium (LD) interval (Fig. 2).
The second strongest new association was at rs12413409 on 10q24.32 (OR = 1.29; P = 1.2×10-9; PPA = 0.9990), which maps to intron 1 of CNNM2 (cyclin M2) (Fig. 2). A cluster of SNPs strongly correlated with rs12413409 and located within a ∼247kb interval in the same LD block supported the association (Fig. 2).
The third new locus is defined by rs9315204 at 13q13.1 (OR = 1.20; P = 2.5×10-9; PPA = 0.9981) in intron 7 of STARD13 (StAR-related lipid transfer (START) domain containing 13) (Fig. 2). Two SNPs, rs1980781 and rs3742321, that are strongly correlated with rs9315204 (r2 > 0.9) also showed significant association with IA (Fig. 2 and Supplementary Table 4). These two SNPs are missense (lysine to arginine) and synonymous coding variants of STARD13, respectively. Another gene that has been implicated in aging phenoytpes, KL (klotho), is located nearby13.
A search of the gene-expression database (eQTL browser, http://eqtl.uchicago.edu/) for all the IA-risk loci did not reveal any consistent pattern of association of IA SNPs with variation in gene expression levels.
In this second GWAS of IA, which included nearly 3 times as many cases as the initial study, we detected 3 novel risk loci and obtained strong independent evidence for association of 2 previously identified loci. The evidence that these are bona fide risk loci for IA is very strong from both Bayesian measures and conventional P-values.
Given our power (∼90%) to detect variants that confer risk of IA with GRR = 1.25 and MAFs ≥ 10%, we expect that we have identified most of these variants, limited principally by potential gaps in SNP coverage. Indeed, across the rest of the genome, there was no locus with PPA > 0.22 and MAF ≥ 10%, while there were 14 loci with PPAs between 0.1 and 0.22 and ORs between 1.16 and 1.25 (data not shown). We expect that a fraction of these loci are genuine IA risk loci, as suggested by the excess of SNPs with P < 1×10-3 (Fig. 1b); exploring this possibility will require analysis of still larger IA cohorts and/or genotyping of alleles with lower MAF.
Based on the results of the first GWAS of IA and the role of the implicated gene products, Sox17 and p15INK4b/p16INK4a, we previously hypothesized2 that the IA genes implicated might play a role in determining cell cycle progression, affecting proliferation14 and senescence of progenitor cell populations and/or the balance between production of progenitor cells versus cells committed to differentiation. Genes located within the newly identified regions support this idea. RBBP8, located within the 18q11.2 region, influences progression through the cell cycle by interacting with BRCA115. Similarly, of the two genes located within the 13q13.1 interval, STARD13 contains Rho-GAP and C-terminal STAR related lipid transfer (START) domains and its overexpression results in suppression of cell proliferation16. The other gene, KL, encodes a transmembrane protein that modulates FGF receptor specificity17; KL-deficient mice display accelerated aging in diverse organ systems13.
On the assumption that there is a four-fold increase in the risk of IA among siblings of cases18,19 and that the SNPs combine to increase log-odds of disease in an additive fashion, the 5 IA risk loci explain 5.2% (FI), 4.0% (CE) and 3.5% (combined JP1 and JP2) of the familial risk of IA. Under this model, the odds of developing IA varies 4.99 to 7.63 fold across the top and bottom 1% of genetic risk profile at these loci in these populations and 3.61 to 4.64 fold across the 5% extremes (Supplementary Fig. 2). When combined with traditional risk factors such as gender, blood pressure and smoking, these findings form the basis of future work aimed at pre-clinical identification of individuals who are at high risk of IA formation and rupture.
Whole-genome genotyping for discovery cohort was performed on the Illumina platform according to the manufacturer's protocol (Illumina, SanDiego, CA, USA). Beadchips used for individual cohorts are presented in Supplementary Table 2. Replication genotyping in the JP1 cohort was performed using either Taqman (Applied Biosystems) or MassARRAY (Sequenom) assays. For the JP2 cohort, genotyping for cases was performed using the multiplex PCR-based Invader assay (Third Wave Technologies Inc.); genotyping for controls was performed on Illumina platform as described previously20.
The study protocol was approved by the Yale Human Investigation Committee (HIC protocol #7680). Institutional review board approval for genetic studies, along with written consent from all study participants, was obtained at all participating institutions.
Prior to the analysis of genotyping data, we excluded SNPs that were located either on mtDNA or sex chromosomes; with A/T or C/G alleles; for which all subjects were assigned as ‘no call’; and assayed on Hap300v1 or 550v1 but dropped from newer versions.
We excluded subjects in the discovery cohort that did not conform to our study design on the basis of genotyping and information quality, cryptic relatedness and population outliers. We summarized the sample exclusion steps in Supplementary Table 2. This filtering process resulted in 835 cases and 6,529 controls in the Finnish (FI) cohort and 2,000 cases and 8,722 controls in the rest of the combined European (CE) cohort.
We performed imputation analysis with the HapMap phase II CEU reference panel (release 24) using the IMPUTE v1 software6. The analysis was performed separately for the FI and CE cohorts. We converted posterior probabilities of three possible genotypes to the fractional allele dosage scores (between 0 and 2) and used these scores for association tests in order to take into account the imputation uncertainty23. For the quality assessment of imputed SNPs, we also converted the posterior probabilities to the most likely genotypes with the threshold at 0.9.
Population stratification and independent genotyping of cases and controls are major causes of confounding in genome-wide association studies24. Because our study consisted of multiple independently ascertained cohorts that were genotyped separately, we performed a stringent analysis to control for these biases by inferring genetic ancestries of subjects25,26. We used the Laplacian eigenmaps27 to infer population structure. Following the determination of the number of dimensions (K + 1) using the threshold given in Lee et al.28, we used the K-dimensional non-trivial generalized eigenvectors29 to calculate the Euclidean distance between two subjects.
In the course of this analysis, we excluded “isolated” subjects who were identified by using the nearest-neighbor distance distributions in any of the 2-dimensional sections. After excluding these subjects, we observed 13 and 5 dimensions in FI and CE, respectively. The larger dimensions observed in FI could be attributable to the presence of many isolated populations in Finland5.
Before matching, we stratified data into males and females because female gender is a known risk factor of IA1,3. We also set the maximum distance between cases and controls to match to be less than 0.028 and 0.009 in FI and CE cohorts, respectively. These values were determined by examining the distribution of the nearest-neighbor distances in K-dimensions (data not shown). We matched cases and controls using the fullmatch function in the R-package optmatch30,31.
For both genotyped and imputed SNPs in the discovery cohort, we applied QC filters to individual cohorts and to cases and controls separately, on the basis of the missing rate, minor allele frequency (MAF) and the P-value of the exact test of Hardy-Weinberg equilibrium (HWE)32. For imputed SNPs, we also assessed imputation quality using the average posterior probability, MAF and allelic R2 metric33. Finally, we assessed differential missingness between cases and controls (Supplementary Table 2).
Any genotyped SNP that passed the QC filters both in the CE and FI cohorts is referred to as a “genotyped SNP” while one for which we used the QC-passed imputation data either in one or both of the cohorts is classified as an “imputed SNP”.
For genotyping data of the replication cohorts, we excluded SNPs if any of the following 3 conditions were met in either cases or controls: (i) missing rate > 0.05; (ii) P-value of the exact test of HWE < 0.001; or (iii) MAF < 0.01.
We tested for association between each QC-passed SNP and IA using the conditional and unconditional logistic regression for the discovery and replication cohorts, respectively34. For the discovery cohort, we used the matched strata to correct for potential confounding due to population stratification and gender, while for the replication cohorts we adjusted for gender. We assumed the log-additive effect of allele dosage on disease risk. We obtained P-values from the score test (two-sided) and estimated the logarithm of per-allele odds ratios (ORs) with standard errors (SEs) by maximizing the (conditional or unconditional) likelihood. Both the test statistic and the SE of log-OR were corrected using genomic control7. We performed the association analysis for FI and CE, as well as sub-cohorts of CE that consisted of NL cases, DE cases or @neurIST cases and their matched controls (Table 1 and Supplementary Table 3). We used the following R-functions to perform the association analysis: clogit, glm and snp.rhs.tests22.
We combined the cohort-wise per-allele ORs in FI and CE using a fixed-effects model of meta-analysis for 831,534 QC-passed SNPs to obtain the discovery results. For SNPs analyzed both in the discovery and replication cohorts, we combined JP1 and JP2 to obtain replication results and all 4 cohorts to obtain combined results. Our primary analysis was based on the fixed-effects model23. In order to assess the heterogeneity of the effect size between cohorts, we first divided CE into 3 cohorts as described above, aiming to analyze data without averaging effect sizes over the combined European cohorts, and then combined 6 cohorts using the random-effects model. We employed the restricted maximum likelihood procedure to estimate the between-cohort heterogeneity variance (τ 2) using the R-function MiMa35 (http://www.wvbauer.com/). From this estimate, we calculated the Cochran's Q statistic and the I2 statistic36.
To evaluate the strength of association, we employed a Bayesian approach9,37. A limitation of the use of P-values alone is that variability in factors such as effect size, MAF and sample size can result in identical statistics that might correspond to markedly different levels of evidence regarding the strength of association10. The Bayes factor (BF) provides an alternative that compares the probabilities of the data under the alternative hypothesis versus the null hypothesis. For computational simplicity, we approximated BF as described by Wakefield8. For all SNPs, we assumed a single prior for the log-OR: a normal distribution with mean 0 and standard deviation log(1.5)/ Φ-1(0.975), where Φ is the normal distribution function9.
The posterior probability of association10 (PPA) provides a simple probabilistic measure of evidence by introducing the prior probability of association, π1. We assumed a uniform prior, π1 = 1/10,000, for all the SNPs11. For BF > 106, changing π1 to a more conservative value of 1/100,000 would result in little change in the posterior probability of association.
To combine the results from multiple cohorts, we extended the formula38 to be applicable to multiple (> 2) cohorts.
For each region that contained a SNP with PPA > 0.5, we examined the number of independent association signals by testing for association of every genotyped SNP with IA by adjusting for the effect of a specified SNP (Supplementary Fig. 1).
We tested for deviation from a linear model, which assumes that two SNPs combine to increase the log-odds of disease in an additive fashion, using conditional (FI and CE) or unconditional (JP: JP1 plus JP2, stratified by cohorts and gender) logistic regression. There was no significant deviation from the linear model (data not shown).
We evaluated potential clinical implications of the genetic profiles of the 5 IA risk loci following the approach described by Clayton39. We fitted a 5-locus conditional (FI and CE) or unconditional (JP) logistic regression model including the additive and dominance-deviation terms for each locus. Using the estimated effect sizes and individual's genotypes, we calculated the risk scores for every individual. The receiver-operating characteristic (ROC) curve for each ethnic cohort (FI, CE and JP) was depicted using the risk score.
We also calculated the ratio of the exponential of the mean of the risk scores for control subjects within the top versus bottom 5 or 1% to obtain approximated odds ratios of disease between these classes.
The sibling recurrence risk was estimated by assuming the polygenic model that fits well to our data39. Fraction of the sibling recurrence risk attributable to all of the 5 loci was calculated by taking the ratio of the logarithm of this value and epidemiologically estimated value of 418,19.
FN2Competing Financial Interest: The authors declare competing financial interests. The authors have a provisional patent application under consideration based on the findings of this work.
FN3Author Contributions: Study Cohorts: ascertainment, characterization and DNA preparation: M.N., M.v.u.z.F., E.G., J.E.J., J.H. and A.P. (FI case-control); Y.M.R. and G.J.E.R. (NL cases); P.B., T.D., J.B., G.Z., P.S., R.R., S.T., C.M.F., P.S., A.F.F., V.E., M.C.J.M.S., P.L., J.B., J.M. and D.R. (@neurIST case series); B.K., G.A., M.S., D.K., F.W., A.O., B.S., C.S., J.B., F.R., C.R., D.B., C.G., E.I.S., B.M., A.R. and H.S. (DE case series); A.T., A.H., H.K. and I.I. (JP1); S.K.L., H.Z. and Y.N. (JP2). Control Cohorts: A.A., A.P. and L.P. (Health2000); A.A., A.P. and L.P. (NFBC1966); C.M.v.D. and M.M.B.B. (Rotterdam Study); L.H.v.d.B. and C.W. (Utrecht); T.I. and H.E.W. (KORA-gen); S.S. (PopGen). Genotyping: K.B., Z.A., N.N., A.K.O., E.G., S.M., R.P.L. and M.G. (Yale); P.C., P.C. and F.C. (Aneurist); S.K.L., H.Z. and Y.N. (JP2). Data management and informatics: K.Y., K.B., Z.A., N.N. and M.G. (Yale); S.K.L., H.Z. and Y.N. (JP2 cohort); Statistical analysis: K.Y. and M.G. Writing team: K.Y., K.B., M.W.S., R.P.L. and M.G. Study design and analysis plan: K.Y., R.P.L. and M.G.
We are grateful to the participants who made this study possible. We thank Andrea Chamberlain, Birgitt Meseck-Selchow and members of the Keck Foundation Biotechnology Resource Laboratory for their technical help. This study was supported by the Yale Center for Human Genetics and Genomics and the Yale Program on Neurogenetics, the US National Institute of Health grants R01NS057756 (M.G.) and U24 NS051869 (S.M.) and the Howard Hughes Medical Institute (R.P.L.). The @neurIST project was funded by European Commission, VI Framework Programme, Priority 2, Information Society Technologies, a European Public Funded Organization (Research Grant No. IST-FP6-027703). The Frankfurt case cohort collection was supported by BMBF (01GI9907), Utrecht Control cohort by the Prinses Beatrix Fonds and the Adessium foundation (L.H.vdB.). S.M. was supported in part by the Clinical and Translational Science Award UL1 RR024139, National Center for Research Resources, NIH. We would also like to acknowledge the use of Yale University Biomedical High Performance Computing Center (NIH grant: RR19895).
|1.||Rinkel GJ,Djibuti M,Algra A,van Gijn J. Prevalence and risk of rupture of intracranial aneurysms: a systematic reviewStroke 29:251–6.1998; [pmid: 9445359]|
|2.||Bilguvar K,et al. Susceptibility loci for intracranial aneurysm in European and Japanese populationsNat Genet 40:1472–7.2008; [pmid: 18997786]|
|3.||Iwamoto H,et al. Prevalence of intracranial saccular aneurysms in a Japanese community based on a consecutive autopsy series during a 30-year observation period. The Hisayama studyStroke 30:1390–5.1999; [pmid: 10390312]|
|4.||Salmela E,et al. Genome-wide analysis of single nucleotide polymorphisms uncovers population structure in Northern EuropePLoS One 3:e3519.2008; [pmid: 18949038]|
|5.||Jakkula E,et al. The genome-wide patterns of variation expose significant substructure in a founder populationAm J Hum Genet 83:787–94.2008; [pmid: 19061986]|
|6.||Marchini J,Howie B,Myers S,McVean G,Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypesNat Genet 39:906–13.2007; [pmid: 17572673]|
|7.||Devlin B,Roeder K. Genomic control for association studiesBiometrics 55:997–1004.1999; [pmid: 11315092]|
|8.||Wakefield J. A Bayesian measure of the probability of false discovery in genetic epidemiology studiesAm J Hum Genet 81:208–27.2007; [pmid: 17668372]|
|9.||Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature 447:661–78.2007; [pmid: 17554300]|
|10.||Stephens M,Balding DJ. Bayesian statistical methods for genetic association studiesNat Rev Genet 10:681–90.2009; [pmid: 19763151]|
|11.||Wacholder S,Chanock S,Garcia-Closas M,El Ghormli L,Rothman N. Assessing the probability that a positive report is false: an approach for molecular epidemiology studiesJ Natl Cancer Inst 96:434–42.2004; [pmid: 15026468]|
|12.||Helgadottir A,et al. The same sequence variant on 9p21 associates with myocardial infarction, abdominal aortic aneurysm and intracranial aneurysmNat Genet 40:217–24.2008; [pmid: 18176561]|
|13.||Kuro-o M,et al. Mutation of the mouse klotho gene leads to a syndrome resembling ageingNature 390:45–51.1997; [pmid: 9363890]|
|14.||Visel A,et al. Targeted deletion of the 9p21 non-coding coronary artery disease risk interval in miceNature. 2010|
|15.||Yun MH,Hiom K. CtIP-BRCA1 modulates the choice of DNA double-strand-break repair pathway throughout the cell cycleNature 459:460–3.2009; [pmid: 19357644]|
|16.||Leung TH,et al. Deleted in liver cancer 2 (DLC2) suppresses cell transformation by means of inhibition of RhoA activityProc Natl Acad Sci U S A 102:15207–12.2005; [pmid: 16217026]|
|17.||Urakawa I,et al. Klotho converts canonical FGF receptor into a specific receptor for FGF23Nature 444:770–4.2006; [pmid: 17086194]|
|18.||Schievink WI. Genetics of intracranial aneurysmsNeurosurgery 40:651–62. discussion 662-31997; [pmid: 9092838]|
|19.||Cannon Albright LA,et al. A genealogical assessment of heritable predisposition to aneurysmsJ Neurosurg 99:637–43.2003; [pmid: 14567597]|
|20.||Kamatani Y,et al. A genome-wide association study identifies variants in the HLA-DP locus associated with chronic hepatitis B in AsiansNat Genet 41:591–5.2009; [pmid: 19349983]|
|21.||Purcell S,et al. PLINK: a tool set for whole-genome association and population-based linkage analysesAm J Hum Genet 81:559–75.2007; [pmid: 17701901]|
|22.||Clayton D,Leung HT. An R package for analysis of whole-genome association studiesHum Hered 64:45–51.2007; [pmid: 17483596]|
|23.||de Bakker P,et al. Practical aspects of imputation-driven meta-analysis of genome-wide association studiesHum Mol Genet 17:R122–8.2008; [pmid: 18852200]|
|24.||Clayton DG,et al. Population structure, differential bias and genomic control in a large-scale, case-control association studyNat Genet 37:1243–6.2005; [pmid: 16228001]|
|25.||Patterson N,Price AL,Reich D. Population structure and eigenanalysisPLoS Genet 2:e190.2006; [pmid: 17194218]|
|26.||Price AL,et al. Principal components analysis corrects for stratification in genome-wide association studiesNat Genet 38:904–9.2006; [pmid: 16862161]|
|27.||Belkin M,Niyogi P. Laplacian eigenmaps for dimensionality reduction and data representationNeural Comput 15:1373–1396.2003;|
|28.||Lee A,Luca D,Klei L,Devlin B,Roeder K. Discovering genetic ancestry using spectral graph theoryGenet Epidemiol. 2009|
|29.||von Luxburg U. A tutorial on spectral clusteringStat Comput 17:395–416.2007;|
|30.||Rosenbaum P. A characterization of optimal designs for observational studiesJ R Statist Soc B 53:597–610.1991;|
|31.||Hansen B,Klopfer S. Optimal full matching and related designs via network flowsJ Comput Graph Stat 15:609–627.2006;|
|32.||Wigginton J,Cutler D,Abecasis G. A note on exact tests of Hardy-Weinberg equilibriumAm J Hum Genet 76:887–93.2005; [pmid: 15789306]|
|33.||Browning B,Browning S. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individualsAm J Hum Genet 84:210–23.2009; [pmid: 19200528]|
|34.||Breslow N,Day N. Statistical methods in cancer research Volume I - The analysis of case-control studiesIARC Sci Publ :5–338.1980|
|35.||Viechtbauer W. Bias and efficiency of meta-analytic variance estimators in the random-effects modelJ Educ Behav Stat 30:261–293.2005;|
|36.||Higgins J,Thompson S,Deeks J,Altman D. Measuring inconsistency in meta-analysesBMJ 327:557–60.2003; [pmid: 12958120]|
|37.||Goodman S. Toward evidence-based medical statistics. 2: The Bayes factorAnn Intern Med 130:1005–1013.1999; [pmid: 10383350]|
|38.||Wakefield J. Reporting and interpretation in genome-wide association studiesInt J Epidemiol 37:641–53.2008; [pmid: 18270206]|
|39.||Clayton D. Prediction and interaction in complex disease genetics: experience in type 1 diabetesPLoS Genet 5:e1000540.2009; [pmid: 19584936]|
[Figure ID: F1]
Genome-wide association analysis results in the discovery cohort
(a) The posterior probabilities of association (PPAs) for 831,532 QC-passed SNPs analyzed specifying a prior probability of association of 1/10,000 are plotted against genomic locations of SNPs. A gray horizontal line at PPA = 0.5 indicates the cutoff value for follow-up genotyping. (b) Quantile-quantile (QQ) plots of P-values (−log10 scale) are shown for: all the SNPs analyzed (black; n = 831,532); SNPs after excluding those within previously identified regions (red; n = 830,907); SNPs after excluding all within the final associated intervals (blue; n = 830,158). (c) A scatter plot of −log10P-values vs. log10 Bayes factors (BFs) is shown with color for each point indicating the range of PPA. There are very close relationships among the P-value for association, the BF and PPA. Note that, given a uniform prior probability of association, the PPA increases as the BF increases. A vertical line indicates the minimum PPA threshold at 0.5 (BF = 1.0×104) for follow-up.
[Figure ID: F2]
Regional plots for associated regions
For each chromosomal interval, −log10P-values for association are plotted against the genomic coordinates (NCBI build 36) in the upper panel; the recombination rates obtained from the HapMap database and the RefSeq genes (hg18) within the regions are shown in the lower panel. In the upper panel, rs identifiers of SNPs listed in Table 2 are shown and their positions are indicated by gray vertical lines. Gray dashed lines indicate locations of other SNPs genotyped in the replication cohorts. Dark blue and light blue dots represent results of genotyped and imputed SNPs for the discovery cohort, respectively; orange and light orange squares represent association results for the replication cohort using JP1 plus JP2 and JP2-only, respectively; combined results for SNPs genotyped both in the discovery and the replication cohort using JP1 plus JP2 and JP2-only are shown by red and light red diamonds, respectively.
[Figure ID: F3]
Consistency of association across cohorts
Forest plots are shown for meta-analysis of SNPs listed in Table 2. Squares and horizontal segments represent estimated per-allele odds ratios (ORs) and 95% confidence intervals (CIs) for individual cohorts. Diamonds represent the summary OR estimates and 95% CIs for the meta-analyses of 6 cohorts (fixed- and random-effects models). log10(BF) > 0 supports association with IA, while log10(BF) < 0 supports no association with IA. Analyzing the results here as 6 distinct cohorts rather than 4 (as in the primary analysis) results in only minor differences, due to different weights given to sub-cohorts of the combined European cohort (CE) associated with genomic control correction.
Overview of the study cohorts
|Cohort||Case (n)||Control (n)||Number of QC-passed SNPs||Genomic inflation factor|
|Combined European (CE)||1,972||8,122||905,906||1.09|
|Replication||Japan 1 (JP1)||829||761||12|
|Japan 2 (JP2)||2,282||905||13|
TFN1Combined European (CE) cohort consisted of all European subjects who were not ascertained in Finland. Sub-cohorts of CE were defined on the basis of case series: NL = Cases from the Netherlands with matched controls; DE = German cases with matched controls; AN = @neurIST cases with matched controls. NL, DE and AN were exclusive subsets of CE (see also Supplementary Table 3). AN cases consisted of subjects from Germany, Great Britain, Hungary, the Netherlands, Switzerland and Spain. JP1 and JP2 were 2 independent Japanese case-control cohorts. Genomic inflation factors of FI and CE (as well as NL, DE and AN) were calculated for 1,303,876 and 905,906 SNPs, respectively. The genomic inflation factor of the discovery cohort (“Total discovery” row) was based on the meta-analysis result for 831,532 SNPs after correcting each cohort for genomic control. The discovery data (combined FI and CE) was not corrected for genomic control.
Representative SNPs analyzed both in the discovery and replication cohorts
|Locus||SNP||Position||Genes||Risk Allele||Cohort||P-value||log10(BF)||PPA||Per-allele OR (95% CI)||Control RAF||Case RAF|
|8q11.23||rs10958409||55,489,644||SOX17||A||Discovery||4.2×10-07||4.64||0.8128||1.24 (1.14-1.35)||0.15, 0.19||0.18, 0.22|
|8q12.1||rs9298506||55,600,077||SOX17||A||Discovery||1.2×10-10||7.94||0.9999||1.33 (1.22-1.45)||0.81, 0.76||0.85, 0.81|
|Combined||1.3×10-12||9.85||1.0 – 1.4×10-06||1.28 (1.20-1.38)|
|9p21.3||rs1333040||22,073,404||CDKN2A, CDKN2B||T||Discovery||2.5×10-16||13.41||1.0 – 3.9×10-10||1.32 (1.24-1.41)||0.56, 0.45||0.63, 0.53|
|Combined||1.5×10-22||19.48||1.0 – 3.3×10-16||1.32 (1.25-1.39)|
|10q24.32||rs12413409||104,709,086||CNNM2||G||Discovery||7.9×10-07||4.29||0.6621||1.38 (1.22-1.57)||0.91, 0.91||0.94, 0.93|
|13q13.1||rs9315204||32,591,837||KL, STARD13||T||Discovery||3.3×10-07||4.73||0.8443||1.21 (1.13-1.31)||0.21, 0.33||0.24, 0.39|
|18q11.2||rs11661542||18,477,693||RBBP8||C||Discovery||5.6×10-09||6.39||0.9959||1.21 (1.14-1.30)||0.49, 0.44||0.54, 0.47|
|Combined||1.1×10-12||9.92||1.0 – 1.2×10-06||1.22 (1.15-1.28)|
TFN2Genomic locations for SNPs are based on NCBI build 36 and risk alleles are aligned to the forward strand of the reference sequence. Control and case risk allele frequencies (RAFs) for the discovery cohort are shown in the form: (RAF of CE), (RAF of FI). log10(BF) indicates the logarithm of the Bayes factor in favor of association. PPA stands for the posterior probability of association. Genes closest to the listed SNPs within the same linkage disequilibrium regions are shown.
Previous Document: Mutation spectrum revealed by breakpoint sequencing of human germline CNVs.
Next Document: Discovery of common Asian copy number variants using integrated high-resolution array CGH and massiv...