- Research article
- Open Access
- Open Peer Review
Genome-wide association study for renal traits in the Framingham Heart and Atherosclerosis Risk in Communities Studies
BMC Medical Genetics volume 9, Article number: 49 (2008)
The Framingham Heart Study (FHS) recently obtained initial results from the first genome-wide association scan for renal traits. The study of 70,987 single nucleotide polymorphisms (SNPs) in 1,010 FHS participants provides a list of SNPs showing the strongest associations with renal traits which need to be verified in independent study samples.
Sixteen SNPs were selected for replication based on the most promising associations with chronic kidney disease (CKD), estimated glomerular filtration rate (eGFR), and serum cystatin C in FHS. These SNPs were genotyped in 15,747 participants of the Atherosclerosis in Communities (ARIC) Study and evaluated for association using multivariable adjusted regression analyses. Primary outcomes in ARIC were CKD and eGFR. Secondary prospective analyses were conducted for association with kidney disease progression using multivariable adjusted Cox proportional hazards regression. The definition of the outcomes, all covariates, and the use of an additive genetic model was consistent with the original analyses in FHS.
The intronic SNP rs6495446 in the gene MTHFS was significantly associated with CKD among white ARIC participants at visit 4: the odds ratio per each C allele was 1.24 (95% CI 1.09–1.41, p = 0.001). Borderline significant associations of rs6495446 were observed with CKD at study visit 1 (p = 0.024), eGFR at study visits 1 (p = 0.073) and 4 (lower mean eGFR per C allele by 0.6 ml/min/1.73 m2, p = 0.043) and kidney disease progression (hazard ratio 1.13 per each C allele, 95% CI 1.00–1.26, p = 0.041). Another SNP, rs3779748 in EYA1, was significantly associated with CKD at ARIC visit 1 (odds ratio per each T allele 1.22, p = 0.01), but only with eGFR and cystatin C in FHS.
This genome-wide association study provides unbiased information implicating MTHFS as a candidate gene for kidney disease. Our findings highlight the importance of replication to identify common SNPs associated with renal traits.
Kidney disease aggregates within families and measures of kidney function, such as estimated glomerular filtration rate (eGFR), are heritable [1–4]. Whereas many monogenetic causes of kidney disease have been discovered, the identification of common genetic variants hypothesized to confer susceptibility to complex diseases, such as chronic kidney disease (CKD), has been difficult and suffers from a lack of replication of initial positive findings [1, 5].
Recently, genome-wide association studies (GWAS) to discover associations of common genetic variants, single nucleotide polymorphisms (SNPs), and a phenotype of interest have become feasible. Surveying the whole genomes of many individuals, preferably as part of a large prospective study that provides extensive and rigorously collected information on phenotypes, can provide unbiased findings and has the power to potentially discover common genetic variants that are associated with complex diseases. So far, the method has successfully and repeatedly identified common SNPs associated with a wide variety of complex diseases such as diabetes mellitus and coronary heart disease [6–11]. Because of the large number of tests conducted and the small a priori probability of a true association between any given SNP and the phenotype, replication of initial findings from a GWAS is essential .
Recently, genome-wide tests of 70,987 autosomal SNPs with renal traits were conducted as part of the Framingham Heart Study (FHS) 100 K SNP GWAS resource . Initial results have been published , but have not been replicated in independent study samples to date. None of the initial associations between SNPs and CKD or eGFR reached genome-wide significance. However, the initial study had limited power, and true associations of moderate size are likely to be associated at p-values on the order of 10e-3 to 10e-5, as will be false positive associations due to the large number of tests conducted. To distinguish true from false positive findings, promising SNPs need to be tested in independent cohorts. Therefore, the objective of the present study was to validate initial findings by replicating the strongest and most promising associations after correction for multiple testing. Here we report the first replication of initially observed associations from a GWAS of kidney disease traits in 11,447 white participants of the community-based, prospective Atherosclerosis Risk in Communities (ARIC) Study.
Description of the initial (stage I) sample, Framingham Heart Study
In 1948, 5,209 participants of the Framingham Heart Study, a prospective community-based cohort, were recruited into the Original Cohort. In 1971, 5,124 of their children or spouses were enrolled into the Offspring Cohort and examined every 4 to 8 years . Members of the largest 330 pedigrees among the Original and Offspring Cohorts were selected for genotyping (n = 1,345 after data cleaning) as detailed elsewhere . Of these, phenotype data from the Offspring examination 7 in 1998–2001 were available for 1,010 individuals with eGFR and CKD and 981 individuals with cystatin C measurements, the final sample sizes used for the stage I analyses of the GWAS of renal traits . The study was approved by the Institutional Review Board of the Boston University Medical Center. All subjects provided written informed consent.
Genotyping was performed using the Affymetrix GeneChip Human Mapping 100 K SNP set. Details of the genotyping process are reported elsewhere . All genotype data were returned to the NHLBI; aggregate results data are publicly available . SNPs with call rates <80%, deviations from Hardy-Weinberg expectations (p < 0.001 in unrelated individuals) or minor allele frequency <10% were excluded. After data cleaning, 70,987 autosomal SNPs remained for analyses .
Estimated GFR (ml/min/1.73 m2) was calculated using the four-variable Modification of Diet in Renal Disease Study equation from calibrated serum creatinine measured at Offspring examination 7 by the modified Jaffe method. CKD was defined based on the National Kidney Foundation Kidney Disease Outcome Quality Initiative working group , and modified slightly, using sex-specific cutoffs for CKD of eGFR<59 ml/min/1.73 m2 in women and <64 ml/min/1.73 m2 in men, as described previously [14, 19]. Serum cystatin C (mg/l) was measured at Offspring examination 7 using particle enhanced immunonephelometry (Dade Behring BN 100 nephelometer) .
For data analysis, multivariable-adjusted residuals were generated for each phenotype. Covariates used for multivariable-adjustment were age, sex, systolic blood pressure, hypertension treatment, HDL-cholesterol, smoking, diabetes, and body mass index . To account for relatedness among the study individuals, generalized estimating equations (GEE) or family-based association tests (FBAT) were used to test associations between phenotype residuals and each SNP; a detailed description is provided elsewhere . Additional analyses re-analyzed SNPs that replicated in ARIC using the raw traits in multivariable adjusted GEE regression models to allow for a direct comparison of effect size estimates between the two studies.
Selection of SNPs for replication
Strongly associated SNPs in the FHS 100 K analyses were prioritized for follow-up genotyping in an independent sample, the ARIC Study (described below) as shown in Figure 1. Ten SNPs showed joint association with all three kidney traits (eGFR, CKD, cystatin C) at p < 0.01 for each trait. Eight of these 10 SNPs were selected for follow-up genotyping; the remaining 2 were dropped due to high linkage disequilibrium (LD) with other SNPs in this set. In addition, statistical and biological evidence were combined by selecting a subset of SNPs showing strong association and additional location in candidate gene regions. Eight SNPs met these criteria, and were selected from the following sets: 1) SNPs with the 100 lowest p-values from GEE models for association with one of the 3 kidney traits; 2) SNPs that showed association with both eGFR and cystatin C at p-values <0.01 from either GEE or FBAT models; 3) SNPs that showed association with both eGFR and CKD at p-values <0.01 from either GEE or FBAT models. Only SNPs with a minimum call rate of 90% were considered for replication. Candidate genes were identified as auch if, based on a thorough literature search, there was evidence for the gene's involvement in renal disease (for example, a renal phenotype in knock-out mice or mutations causing a Mendelian syndrome in humans with renal involvement), or if their gene products are known to be involved in a physiological mechanism important for kidney function such as filtration or electrolyte transport. The SNPs selected for replication for their location in or near a candidate gene were located in FRAS1,NR3C2,SGK1,CFTR,EYA1,IQGAP1,[26, 27] and near GLIS3.
Description of the replication (stage II) sample, Atherosclerosis in Communities Study
The ARIC Study is an ongoing, population-based, prospective cohort of 15,792 adults. Study participants were aged 45–64 years at their recruitment from four US communities in 1987–89, when they underwent the standardized ARIC baseline examination (visit 1). Participants underwent 3 further standardized examinations approximately every 3 years. Since the end of visit 4 in 1998, participants continue to be followed up by annual phone calls as well as by obtaining information from hospitalization discharge records of local hospitals and death certificates. Further details of the study design have been reported previously . Institutional Review Boards of the participating institutions approved the study protocols, and written informed consent was obtained from all participants at each examination. From the total study sample of 15,792 participants at the baseline examination, 45 individuals were excluded because they did not consent to genetic research and 47 because they did not self-identify as "black" or "white". As the genotyped FHS participants were exclusively white, the primary replication sample consisted of white ARIC participants. Black ARIC participants were also genotyped, enabling further characterization of SNPs showing evidence of association in white ARIC participants. Measures of genotyping quality control and allele distributions were assessed in 11,447 white and 4,253 black ARIC participants. For association analyses, participants were additionally excluded for genotyping failure of all SNPs (173 whites, 168 blacks), and for missing serum creatinine (4 whites, 97 blacks) and missing covariates (53 whites, 94 blacks) at visit 1 [missing creatinine (76 whites, 77 blacks) and missing covariates (15 whites, 135 blacks) at visit 4] for a final, primary replication study sample of 11,217 white participants at visit 1 [8,717 at visit 4] (Table 1), and additionally 3,894 black participants at visit 1 [2,358 at visit 4] [see Additional file 1].
Assessment of other study characteristics
Detailed information on obtaining demographic, socioeconomic, health behavior, risk factor control, and medical history have been described previously [29, 30]. Racial affiliation was self-reported using the terms "black" or "white".
Genotyping of all SNPs was performed by the ARIC Central DNA Laboratory. The polymorphisms rs10509132, rs2827732, rs6495446, rs1743955, and rs4835136 were genotyped individually using the TaqMan assay (Applied Biosystems, Foster City, CA). All other SNPs were genotyped in two multiplexes using the iPLEX gold assay (Sequenom, San Diego, CA). The minimum satisfactory call rate was set at 90%, and the cutoff indicating statistically significant deviation from Hardy-Weinberg expectations was set at p < 10-4.
Continuous eGFR and CKD, defined as in FHS, were defined a priori as the primary study outcomes. These outcomes were investigated primarily at ARIC visit 4 which provided more individuals with CKD and a distribution of major kidney disease risk factors, such as hypertension, similar to that in FHS. Outcomes were also investigated at ARIC visit 1 to maximize the sample size for continuous outcomes. Serum cystatin C measurements were not available in the ARIC Study. In secondary analyses, associations between the SNPs and kidney disease progression were conducted, as were analyses defining CKD as eGFR <60 ml/min/1.73 m2 . Serum creatinine was measured using the modified kinetic Jaffe reaction and creatinine values were standardized and calibrated as described previously [31, 32]. Kidney disease progression was defined as either an increase in serum creatinine levels ≥ 0.4 mg/dl above baseline or a hospitalization discharge or death coded for chronic renal disease using international classification of disease codes and analyzed as described elsewhere .
Consistent with FHS, an additive genetic model was used for each SNP in all analyses. Multivariable regression models were adjusted for the same covariates as in FHS. For the secondary analyses using prospective data, follow-up time was counted from the baseline ARIC visit until the date kidney disease progression occurred in cases, or the earlier of the date of last contact (or date of non-CKD death) or December 31, 2004 for non-cases. Multivariable adjusted Cox Proportional Hazards models were used, and the proportionality assumption of all Cox models was assessed by inspection of the complementary log(-log [survival function]) curves.
The two-sided significance level was determined a priori at α = 0.05 for each of the 8 SNPs in plausible candidate genes and at α = 0.00625 (0.05/8) after applying a Bonferroni correction to the other 8 SNPs. In our analyses among white participants, the power to detect an odds ratio of CKD of 1.3 or higher per each increase in risk allele and assuming α = 0.05 is >80% for a risk allele frequency of 0.2, and >90% for a risk allele frequency of 0.3.
Table 1 shows the distribution of study sample characteristics among FHS and white ARIC participants, and Additional file 1 shows this distribution among the additional study sample of black ARIC participants. The mean eGFR in FHS participants was 86.6 ml/min/1.73 m2 (standard deviation (SD) 19.8), compared to 89.6 ml/min/1.73 m2 (SD 17.9) in white ARIC participants at visit 1 and 80.5 ml/min/1.73 m2 (SD 16.9) at visit 4. CKD was present in 80 (7.9%) of FHS participants and in 449 (4.0%) of white ARIC participants at visit 1 and 756 (8.7%) at visit 4.
Minor allele frequencies for the 16 SNPs among FHS and white ARIC participants are provided in Table 2, and minor allele frequencies among black ARIC participants are provided separately [see Additional file 2]. The minor allele frequencies in FHS and white ARIC participants were similar, and both were similar to the minor allele frequencies from the HapMap CEU sample. The distribution of genotypes for all SNPs in ARIC whites conformed to Hardy-Weinberg expectations, while in ARIC blacks, rs2228210 and rs6831700 did not and were excluded from further analyses. The average call rate across SNPs was 94.1% in white and 91.6% in black ARIC participants. The concordance rates for genotyping of 314 replicate samples per SNP in ARIC was >97% for each SNP (kappa coefficient ≥ 0.95).
Associations of SNPs with eGFR and CKD in the ARIC Study
Table 3 shows the replication data for the 16 selected SNPs at visit 4, where participants were on average 9 years older than at visit 1, thus providing a greater number of participants with CKD. Among the 8 SNPs associated with all 3 renal traits in FHS, rs6495446 in MTHFS was significantly associated with CKD among white ARIC participants (odds ratio (OR) 1.24 per each C allele, 95% confidence interval 1.09–1.41, p = 0.001). The association with eGFR at visit 4 (-0.57 ml/min/1.73 m2 per increase in C allele, p = 0.043) was not significant at the pre-specified α of 0.00625. The proportion of eGFR variance explained by rs6495446 was 0.04% (0.4% in FHS). The associations of rs6495446 with CKD and eGFR were of similar magnitude but not statistically significant at ARIC visit 1 (OR of CKD 1.20 per increase in C allele, p = 0.024, -0.48 ml/min/1.73 m2 per increase in C allele, p = 0.073) [see Additional file 3]. The direction of the association results for rs6495446 was consistent with the one observed in FHS, therefore replicating the association of this SNP with CKD. Further investigation of this SNP in black ARIC participants revealed no significant association of rs6495446 and either trait at visit 4 (p = 0.68 for CKD and p = 0.26 for eGFR) or visit 1 (p = 0.44 for CKD and p = 0.03 for eGFR), although the association of rs6495446 and eGFR at visit 1 in ARIC blacks was of borderline significance (-1.2 ml/min/1.73 m2 per increase in T allele, p = 0.03). None of the other 7 SNPs associated with all 3 renal traits in FHS were significantly associated with either kidney trait in white ARIC participants at visit 4 (Table 3).
Among the SNPs selected for their location in a candidate gene, rs3779748 (EYA1) was significantly associated with CKD not at ARIC visit 4 but at visit 1 (OR 1.22 per each increase in T allele, p = 0.01) [see Additional file 3]. The same risk allele had been observed in FHS. However, since rs3779748 had been significantly associated with only the traits eGFR and cystatin C but not CKD in FHS, this does not constitute a true replication. Another SNP, rs10520688 in IQGAP1, was significantly associated with CKD at ARIC visit 4 (p = 0.02), but the risk allele was opposite from the one observed in FHS. None of the other SNPs selected for their location in a candidate gene was significantly associated with either kidney trait in ARIC.
In secondary analyses, the association of all 16 SNPs was investigated prospectively with kidney disease progression in ARIC. Over a mean follow-up of 14.7 years, there were 836 white ARIC participants with kidney disease progression. None of the SNPs was significantly associated with kidney disease progression among white ARIC participants at the pre-specified levels of significance. However, the SNP rs6495446 (MTHFS), which replicated among white ARIC participants for the association with CKD, showed a hazard ratio of 1.13 per each increase in C allele for kidney disease progression (95% CI 1.01–1.26, p = 0.041). The proportional hazards assumption was met.
We further investigated the association of rs6495446 and CKD at ARIC visit 4 stratified by sex as well as by age (<60 years, ≥ 60 years). In these hypothesis-generating analyses, we observed stronger effects in men (OR = 1.32, 95% CI 1.11–1.57) compared to women (OR = 1.15, 95% CI 0.95–1.40; p-interaction = 0.33) and in participants aged <60 years (OR = 1.54, 95% CI 1.12–2.11) compared to those aged ≥ 60 years (OR = 1.16, 95% CI 1.01–1.34; p-interaction = 0.09).
For SNP rs6495446, which had replicated among the ARIC white sample, the FHS data were re-analyzed in secondary analyses using multivariable regression of the raw trait in order to parallel the analyses conducted in ARIC, thus allowing for a direct comparison of effect size estimates. Per each increase in C allele, the OR of CKD was 1.91 (95% CI 1.22–2.99, p = 0.005), and mean eGFR was lower by 3.1 ml/min/1.73 m2 (95% CI 1.2–5.0, p = 0.001). Figure 2 shows these risk estimates in relation to the risk estimates obtained for eGFR (panel A) and CKD (panel B) in ARIC, as well as in relation to the risk estimates obtained from prospective analyses in ARIC.
We also conducted sensitivity analyses in the ARIC Study using an overall cutoff of eGFR <60 ml/min/1,73 m2 to define CKD in order to evaluate the most commonly used clinical definition of CKD . While the direction of the association of rs6495446 and CKD in ARIC whites at study visit 4 was consistent and the magnitude largely unchanged, the odds ratio was no longer statistically significant (OR 1.12, p = 0.075) as it had been using the sex-specific cutoffs.
Here we present results from the first complete GWAS of renal traits, comprised of results from FHS, the discovery sample (stage I) , and the large population-based ARIC Study, the replication sample (stage II). The association of SNP rs6495446 in MTHFS with CKD replicated in white ARIC participants at study visit 4. Another SNP, rs3779748 in EYA1, was significantly associated with CKD at study visit 1, a kidney trait different from but related to the ones significantly associated in FHS, eGFR and cystatin C. Our findings may provide guidance for investigators wishing to further replicate the associations from this GWAS of renal traits, as well as present considerations and mechanisms of SNP selection for replication studies.
We cannot compare our findings to existing literature, since this is the first study presenting replicated results from a GWAS of renal traits. Apart from the results in FHS, another study has published initial but not replicated association results from genome-wide tests for the traits serum creatinine, eGFR, and creatinine clearance in 2,000 white hypertensive individuals . Findings on the association of specific SNPs with serum creatinine or eGFR were not presented as none of these associations met the significance threshold for further investigation of p < 10-5 chosen by the authors .
SNP rs6495446, an intronic SNP in the gene MTHFS on chromosome 15q25.1, was significantly associated with CKD in white ARIC participants at study visit 4 (OR 1.24 per each C allele, 95% confidence interval 1.09–1.41, p = 0.001), replicating the trait and risk allele observed in the initial screen. Comparison of the effect sizes as well as the proportion of eGFR variance explained by rs6495446 between the FHS and white ARIC samples showed greater magnitude of the effects in FHS, consistent with the phenomenon of the "winner's curse": associations in the replication sample are often of lesser magnitude than in the initial sample, especially in samples with relatively low power to detect moderate effect sizes, because only the strongest associations in this initial sample were selected to be followed up. Further replication of the association of MTHFS rs6495446 and CKD in additional studies is warranted, particularly in light of the small effect size observed in the replication and the fact that the association reached statistical significance at the pre-specified level for one but not both traits and visits studied in ARIC.
The gene MTHFS codes for the enzyme methenyltetrahydrofolate synthetase, which is expressed in human and rat kidney and has been reported to play a role in folate turnover and accumulation . To our knowledge, there have not been any prior reports linking this gene to renal traits or kidney disease, although folate supplementation is known to reduce homocysteine levels, which are commonly elevated in individuals with advanced kidney disease [35, 36]. The SNP rs6495446 is located in intron 2 of MTHFS. The only coding SNP in MTHFS provided in the public database dbSNP is rs8923 in exon 3 (T202A); this SNP was not included on the genotyping chip. In the HapMap CEU sample, rs8923 and rs6495446 are grouped into one LD block (pair-wise D' = 1.0; r2 = 0.38). Interestingly, when we searched a database containing results from a GWAS of global gene expression , the association of the rs6495446 C allele with higher expression levels of a MTHFS gene transcript was genome-wide significant (p = 3.3 × 10-10). The coding variant rs8923 was not genotyped in this study, but its location is flanked on both sides by the two markers showing the strongest association with MTHFS transcript levels out of all 408,273 SNPs investigated. rs6495446 by itself accounted for 13% of the total expression variance of this MTHFS transcript . Although this observation does not allow for the conclusion that altered MTHFS mRNA levels associated with rs6495446 are causally related to kidney disease, it provides some functional evidence for rs6495446 or a variant in LD with it. Upcoming denser GWAS may help to further fine-map the observed association
Another SNP significantly associated with CKD in ARIC whites at visit 1, rs3779748, is located in the gene eyes absent homolog 1 (Drosophila) (EYA1) on chromosome 8q13.3. Mutations in EYA1 cause the Mendelian syndrome branchiootorenal dysplasia syndrome (MIM# 113650) featuring a renal phenotype ranging from mild renal hypoplasia to complete absence of the kidneys . Although the same risk allele for rs3779748 was identified in both FHS and ARIC whites, this SNP was associated only with the traits eGFR and cystatin C in FHS. Although CKD and eGFR are highly correlated traits in ARIC, the observed association should therefore not be considered a true replication. Moreover, the association with CKD at ARIC visit 1 was no longer observed at ARIC visit 4, and we can therefore not exclude a chance finding.
Strengths of the work presented here include high power for replication of the association between common genetic variants and renal traits, as well as the availability of a replication study similar in design to the FHS Study, namely another community-based, prospective study with rigorously collected phenotype information. We were therefore able to adjust our analyses for the same covariates as FHS, leading to better comparability of the results across studies. Finally, we were able to prospectively evaluate the association with incident kidney disease progression in secondary analyses.
When interpreting the findings presented here, several limitations should be kept in mind. First, there are limitations to the original 100 K genome-wide screen for renal traits in FHS, which are discussed in detail elsewhere [13, 14]. Among them is the limited power to discover moderate genetic effects of the size we observed (OR~1.2), especially for dichotomous but also for continuous traits. It may therefore be expected that future, better powered screens will identify additional variants of comparable effect size that could not be detected in the FHS 100 K screen. Further, the FBAT test is underpowered as it only relies on a subset of informative families. Therefore, FBAT results should not be interpreted with the same degree of confidence as GEE results. Despite these limitations, however, this genome-wide screen successfully identified a SNP in CST3, the gene encoding the cystatin C protein, as associated with serum cystatin C levels at genome-wide significance . This increases confidence that the FHS 100 K genome-wide screen was able to identify associations which may represent true findings. Another potential limitation in the initial genome-wide screen was the use of a liberal call rate of 80%. We tried to address this in our replication study by using only those SNPs having call rates >90% in FHS. Moreover, SNPs with minor allele frequencies <10% were excluded from the initial association analyses in FHS, as an excess number of significant results was observed for SNPs with low minor allele frequencies . We could therefore not investigate a putative contribution to the phenotypic variation by rare genetic variants, as has been reported for other complex genetic traits . However, genome-wide association in general is not a good technique for detecting rare variants .
We further cannot exclude the possibility that undetected population stratification might have influenced our results, but prior investigations within FHS have found no evidence for the presence of population stratification . In addition, the magnitude of the association between rs6495446 and CKD in ARIC did not differ significantly by ARIC Study center (p-interaction = 0.3).
Secondly, the selection of only 16 SNPs for further genotyping might not be sufficient for a comprehensive evaluation of all strongly associated SNPs from the FHS 100 K genome-wide association scan of renal traits. In particular, the selection and examination of a single SNP for a given genetic region may not be sufficient to capture the full effect of genetic variation in this region. Despite good power to detect significant associations between common SNPs and renal traits in ARIC, we observed true replication for only one of the selected SNPs, rs6495446. This may have been influenced by the selection strategy we used to prioritize SNPs for follow-up genotyping. A formal comparison between SNPs selected based on joint association with related phenotypes and those selected based on low p-values and biologic reasons is limited by the small number of selected SNPs, but our data does not lend strong support to either strategy. The results presented here, specifically for rs6495446 in MTHFS, should be further replicated and additional variants in MTHFS should be genotyped in future studies for fine-mapping of the association before a definite conclusion about the presence or absence of a moderate-sized association between variation in MTHFS and renal traits can be drawn.
Limitations to the phenotype definition include the lack of serum cystatin C measurements in ARIC as well as estimation of GFR. Although serum creatinine measurements were calibrated in both FHS and ARIC using the same method, GFR estimation based on serum creatinine has inherent limitations including lower accuracy in the higher GFR ranges . We tried to address this issue by selecting SNPs for replication that were significantly associated with more than one renal trait in FHS, and in fact 50% of the SNPs selected for further genotyping in ARIC were associated with all 3 kidney traits investigated in FHS.
In summary, the association of SNP rs6495446 in MTHFS with CKD was replicated in an independent study sample of white ARIC participants, constituting the first GWAS of kidney disease traits including both discovery and replication. Further work is needed to fully characterize the association of genetic variants in MTHFS with kidney disease. These findings highlight the importance of replication of initial GWAS findings to identify common SNPs associated with renal function traits.
Freedman BI, Satko SG: Genes and renal disease. Curr Opin Nephrol Hypertens. 2000, 9: 273-277. 10.1097/00041552-200005000-00011.
Fox CS, Yang Q, Cupples LA, Guo CY, Larson MG, Leip EP, Wilson PW, Levy D: Genomewide linkage analysis to serum creatinine, GFR, and creatinine clearance in a community-based population: The Framingham Heart Study. J Am Soc Nephrol. 2004, 15: 2457-2461. 10.1097/01.ASN.0000135972.13396.6F.
Bochud M, Elston RC, Maillard M, Bovet P, Schild L, Shamlaye C, Burnier M: Heritability of renal function in hypertensive families of African descent in the Seychelles (Indian ocean). Kidney Int. 2005, 67: 61-69. 10.1111/j.1523-1755.2005.00055.x.
Placha G, Poznik GD, Dunn J, Smiles A, Krolewski B, Glew T, Puppala S, Schneider J, Rogus JJ, Rich SS, Duggirala R, Warram JH, Krolewski AS: A genome-wide linkage scan for genes controlling variation in renal function estimated by serum cystatin C levels in extended families with type 2 diabetes. Diabetes. 2006, 55: 3358-3365. 10.2337/db06-0781.
Hirschhorn JN, Lohmueller K, Byrne E, Hirschhorn K: A comprehensive review of genetic association studies. Genet Med. 2002, 4: 45-61.
Sladek R, Rocheleau G, Rung J, Dina C, Shen L, Serre D, Boutin P, Vincent D, Belisle A, Hadjadj S, Balkau B, Heude B, Charpentier G, Hudson TJ, Montpetit A, Pshezhetsky AV, Prentki M, Posner BI, Balding DJ, Meyre D, Polychronakos C, Froguel P: A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature. 2007, 445: 881-5. 10.1038/nature05616.
Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, Duren WL, Erdos MR, Stringham HM, Chines PS, Jackson AU, Prokunina-Olsson L, Ding CJ, Swift AJ, Narisu N, Hu T, Pruim R, Xiao R, Li XY, Conneely KN, Riebow NL, Sprau AG, Tong M, White PP, Hetrick KN, Barnhart MW, Bark CW, Goldstein JL, Watkins L, Xiang F, Saramies J, Buchanan TA, Watanabe RM, Valle TT, Kinnunen L, Abecasis GR, Pugh EW, Doheny KF, Bergman RN, Tuomilehto J, Collins FS, Boehnke M: A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science. 2007, 316: 1341-5. 10.1126/science.1142382.
Zeggini E, Weedon MN, Lindgren CM, Frayling TM, Elliott KS, Lango H, Timpson NJ, Perry JR, Rayner NW, Freathy RM, Barrett JC, Shields B, Morris AP, Ellard S, Groves CJ, Harries LW, Marchini JL, Owen KR, Knight B, Cardon LR, Walker M, Hitman GA, Morris AD, Doney AS, McCarthy MI, Hattersley AT: Replication of genome-wide association signals in U.K. samples reveals risk loci for type 2 diabetes. Science. 2007, 316: 1336-41. 10.1126/science.1142364.
Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, Chen H, Roix JJ, Kathiresan S, Hirschhorn JN, Daly MJ, Hughes TE, Groop L, Altshuler D, Almgren P, Florez JC, Meyer J, Ardlie K, Bengtsson K, Isomaa B, Lettre G, Lindblad U, Lyon HN, Melander O, Newton-Cheh C, Nilsson P, Orho-Melander M, Rastam L, Speliotes EK, Taskinen MR, Tuomi T, Guiducci C, Berglund A, Carlson J, Gianniny L, Hackett R, Hall L, Holmkvist J, Laurila E, Sjogren M, Sterner M, Surti A, Svensson M, Svensson M, Tewhey R, Blumenstiel B, Parkin M, Defelice M, Barry R, Brodeur W, Camarata J, Chia N, Fava M, Gibbons J, Handsaker B, Healy C, Nguyen K, Gates C, Sougnez C, Gage D, Nizzari M, Gabriel SB, Chirn GW, Ma Q, Parikh H, Richardson D, Ricke D, Purcell S: Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science. 2007, 316: 1331-6. 10.1126/science.1142358.
Helgadottir A, Thorleifsson G, Manolescu A, Gretarsdottir S, Blondal T, Jonasdottir A, Jonasdottir A, Sigurdsson A, Baker A, Palsson A, Masson G, Gudbjartsson D, Magnusson KP, Andersen K, Levey AI, Backman VM, Matthiasdottir S, Jonsdottir T, Palsson S, Einarsdottir H, Gunnarsdottir S, Gylfason A, Vaccarino V, Hooper WC, Reilly MP, Granger CB, Austin H, Rader DJ, Shah SH, Quyyumi AA, Gulcher JR, Thorgeirsson G, Thorsteinsdottir U, Kong A, Stefansson K: A common variant on chromosome 9p21 affects the risk of myocardial infarction. Science. 2007, 316: 1491-3. 10.1126/science.1142842.
McPherson R, Pertsemlidis A, Kavaslar N, Stewart A, Roberts R, Cox DR, Hinds DA, Pennacchio LA, Tybjaerg-Hansen A, Folsom AR, Boerwinkle E, Hobbs HH, Cohen JC: A common allele on chromosome 9 associated with coronary heart disease. Science. 2007, 316: 1488-91. 10.1126/science.1142447.
NCI-NHGRI Working Group on Replication in Association Studies, Chanock SJ, Manolio T, Boehnke M, Boerwinkle E, Hunter DJ, Thomas G, Hirschhorn JN, Abecasis G, Altshuler D, Bailey-Wilson JE, Brooks LD, Cardon LR, Daly M, Donnelly P, Fraumeni JF, Freimer NB, Gerhard DS, Gunter C, Guttmacher AE, Guyer MS, Harris EL, Hoh J, Hoover R, Kong CA, Merikangas KR, Morton CC, Palmer LJ, Phimister EG, Rice JP, Roberts J, Rotimi C, Tucker MA, Vogan KJ, Wacholder S, Wijsman EM, Winn DM, Collins FS: Replicating genotype-phenotype associations. Nature. 2007, 447: 655-660. 10.1038/447655a.
Cupples LA, Arruda HT, Benjamin EJ, D'Agostino RBS, Demissie S, DeStefano AL, Dupuis J, Falls KM, Fox CS, Gottlieb DJ, Govindaraju DR, Guo CY, Heard-Costa NL, Hwang SJ, Kathiresan S, Kiel DP, Laramie JM, Larson MG, Levy D, Liu CY, Lunetta KL, Mailman MD, Manning AK, Meigs JB, Murabito JM, Newton-Cheh C, O'Connor GT, O'Donnell CJ, Pandey M, Seshadri S, Vasan RS, Wang ZY, Wilk JB, Wolf PA, Yang Q, Atwood LD: The Framingham Heart Study 100 K SNP genome-wide association study resource: Overview of 17 phenotype working group reports. BMC Med Genet. 2007, 8 (Suppl 1): S1-10.1186/1471-2350-8-S1-S1.
Hwang SJ, Yang Q, Meigs JB, Pearce EN, Fox CS: A genome-wide association for kidney function and endocrine-related traits in the NHLBI's Framingham Heart Study. BMC Med Genet. 2007, 8 (Suppl 1): S10-10.1186/1471-2350-8-S1-S10.
Kannel WB, Feinleib M, McNamara PM, Garrison RJ, Castelli WP: An investigation of coronary heart disease in families. The Framingham Offspring Study. Am J Epidemiol. 1979, 110: 281-290.
Framingham SNP Health Association Resource (SHARe). [http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000007.v2.p1]
Levey AS, Bosch JP, Lewis JB, Greene T, Rogers N, Roth D: A more accurate method to estimate glomerular filtration rate from serum creatinine: A new prediction equation. Modification of Diet in Renal Disease Study group. Ann Intern Med. 1999, 130: 461-470.
National Kidney Foundation: K/DOQI clinical practice guidelines for chronic kidney disease: Evaluation, classification, and stratification. Am J Kidney Dis. 2002, 39: S1-266.
Fox CS, Larson MG, Leip EP, Culleton B, Wilson PW, Levy D: Predictors of new-onset kidney disease in a community-based population. JAMA. 2004, 291: 844-850. 10.1001/jama.291.7.844.
Parikh NI, Hwang SJ, Larson MG, Meigs JB, Levy D, Fox CS: Cardiovascular disease risk factors in chronic kidney disease: Overall burden and rates of treatment and control. Arch Intern Med. 2006, 166: 1884-1891. 10.1001/archinte.166.17.1884.
McGregor L, Makela V, Darling SM, Vrontou S, Chalepakis G, Roberts C, Smart N, Rutland P, Prescott N, Hopkins J, Bentley E, Shaw A, Roberts E, Mueller R, Jadeja S, Philip N, Nelson J, Francannet C, Perez-Aytes A, Megarbane A, Kerr B, Wainwright B, Woolf AS, Winter RM, Scambler PJ: Fraser syndrome and mouse blebbed phenotype caused by mutations in FRAS1/Fras1 encoding a putative extracellular matrix protein. Nat Genet. 2003, 34: 203-208. 10.1038/ng1142.
Le Menuet D, Isnard R, Bichara M, Viengchareun S, Muffat-Joly M, Walker F, Zennaro MC, Lombes M: Alteration of cardiac and renal functions in transgenic mice overexpressing human mineralocorticoid receptor. J Biol Chem. 2001, 276: 38911-38920. 10.1074/jbc.M103984200.
Vallon V, Lang F: New insights into the role of serum- and glucocorticoid-inducible kinase SGK1 in the regulation of renal function and blood pressure. Curr Opin Nephrol Hypertens. 2005, 14: 59-66. 10.1097/00041552-200501000-00010.
Devuyst O, Guggino WB: Chloride channels in the kidney: Lessons learned from knockout animals. Am J Physiol Renal Physiol. 2002, 283: F1176-91.
Abdelhak S, Kalatzis V, Heilig R, Compain S, Samson D, Vincent C, Weil D, Cruaud C, Sahly I, Leibovici M, Bitner-Glindzicz M, Francis M, Lacombe D, Vigneron J, Charachon R, Boven K, Bedbeder P, Van Regemorter N, Weissenbach J, Petit C: A human homologue of the drosophila eyes absent gene underlies branchio-oto-renal (BOR) syndrome and identifies a novel gene family. Nat Genet. 1997, 15: 157-164. 10.1038/ng0297-157.
Hinkes B, Wiggins RC, Gbadegesin R, Vlangos CN, Seelow D, Nurnberg G, Garg P, Verma R, Chaib H, Hoskins BE, Ashraf S, Becker C, Hennies HC, Goyal M, Wharram BL, Schachter AD, Mudumana S, Drummond I, Kerjaschki D, Waldherr R, Dietrich A, Ozaltin F, Bakkaloglu A, Cleper R, Basel-Vanagaite L, Pohl M, Griebel M, Tsygin AN, Soylu A, Muller D, Sorli CS, Bunney TD, Katan M, Liu J, Attanasio M, O'toole JF, Hasselbacher K, Mucha B, Otto EA, Airik R, Kispert A, Kelley GG, Smrcka AV, Gudermann T, Holzman LB, Nurnberg P, Hildebrandt F: Positional cloning uncovers mutations in PLCE1 responsible for a nephrotic syndrome variant that may be reversible. Nat Genet. 2006, 38: 1397-1405. 10.1038/ng1918.
Lehtonen S, Ryan JJ, Kudlicka K, Iino N, Zhou H, Farquhar MG: Cell junction-associated proteins IQGAP1, MAGI-2, CASK, spectrins, and alpha-actinin are components of the nephrin multiprotein complex. Proc Natl Acad Sci USA. 2005, 102: 9814-9819. 10.1073/pnas.0504166102.
Senee V, Chelala C, Duchatelet S, Feng D, Blanc H, Cossec JC, Charon C, Nicolino M, Boileau P, Cavener DR, Bougneres P, Taha D, Julier C: Mutations in GLIS3 are responsible for a rare syndrome with neonatal diabetes mellitus and congenital hypothyroidism. Nat Genet. 2006, 38: 682-687. 10.1038/ng1802.
The Atherosclerosis Risk in Communities (ARIC) Study: Design and objectives. The ARIC investigators. Am J Epidemiol. 1989, 129: 687-702.
Hsu CC, Kao WH, Coresh J, Pankow JS, Marsh-Manzi J, Boerwinkle E, Bray MS: Apolipoprotein E and progression of chronic kidney disease. JAMA. 2005, 293: 2892-2899. 10.1001/jama.293.23.2892.
Coresh J, Astor BC, McQuillan G, Kusek J, Greene T, Van Lente F, Levey AS: Calibration and random variation of the serum creatinine assay as critical elements of using equations to estimate glomerular filtration rate. Am J Kidney Dis. 2002, 39: 920-929. 10.1053/ajkd.2002.32765.
Manjunath G, Tighiouart H, Ibrahim H, MacLeod B, Salem DN, Griffith JL, Coresh J, Levey AS, Sarnak MJ: Level of kidney function as a risk factor for atherosclerotic cardiovascular outcomes in the community. J Am Coll Cardiol. 2003, 41: 47-55. 10.1016/S0735-1097(02)02663-3.
Wallace C, Newhouse SJ, Braund P, Zhang F, Tobin M, Falchi M, Ahmadi K, Dobson RJ, Marcano AC, Hajat C, Burton P, Deloukas P, Brown M, Connell JM, Dominiczak A, Lathrop GM, Webster J, Farrall M, Spector T, Samani NJ, Caulfield MJ, Munroe PB: Genome-wide association study identifies genes for biomarkers of cardiovascular disease: Serum urate and dyslipidemia. Am J Hum Genet. 2008, 82: 139-149. 10.1016/j.ajhg.2007.11.001.
Dayan A, Bertrand R, Beauchemin M, Chahla D, Mamo A, Filion M, Skup D, Massie B, Jolivet J: Cloning and characterization of the human 5,10-methenyltetrahydrofolate synthetase-encoding cDNA. Gene. 1995, 165: 307-311. 10.1016/0378-1119(95)00321-V.
Wilcken DE, Wilcken B: The natural history of vascular disease in homocystinuria and the effects of treatment. J Inherit Metab Dis. 1997, 20: 295-300. 10.1023/A:1005373209964.
Robinson K, Gupta A, Dennis V, Arheart K, Chaudhary D, Green R, Vigo P, Mayer EL, Selhub J, Kutner M, Jacobsen DW: Hyperhomocysteinemia confers an independent increased risk of atherosclerosis in end-stage renal disease and is closely linked to plasma folate and pyridoxine concentrations. Circulation. 1996, 94: 2743-2748.
Dixon AL, Liang L, Moffatt MF, Chen W, Heath S, Wong KC, Taylor J, Burnett E, Gut I, Farrall M, Lathrop GM, Abecasis GR, Cookson WO: A genome-wide association study of global gene expression. Nat Genet. 2007, 39: 1202-1207. 10.1038/ng2109.
mRNA by SNP Browser v 1.0.1. [http://www.sph.umich.edu/csg/liang/asthma/]
Ji W, Foo JN, O'Roak BJ, Zhao H, Larson MG, Simon DB, Newton-Cheh C, State MW, Levy D, Lifton RP: Rare independent mutations in renal salt handling genes contribute to blood pressure variation. Nat Genet. 2008, Apr 6
McCarthy MI, Abecasis GR, Cardon LR, Goldstein DB, Little J, Ioannidis JP, Hirschhorn JN: Genome-wide association studies for complex traits: Consensus, uncertainty and challenges. Nat Rev Genet. 2008, 9: 356-369. 10.1038/nrg2344.
Stevens LA, Coresh J, Greene T, Levey AS: Assessing kidney function–measured and estimated glomerular filtration rate. N Engl J Med. 2006, 354: 2473-2483. 10.1056/NEJMra054415.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/9/49/prepub
The Framingham Heart Study is supported by the National Heart, Lung and Blood Institute (N01-HC-25195). The Atherosclerosis Risk in Communities Study was supported by contracts N01-HC-55015, N01-HC-55016, N01-HC-55018, N01-HC-55019, N01-HC-55020, N01-HC-55021, and N01-HC-55022 with the National Heart, Lung, and Blood Institute. A.K. is supported by a German Research Foundation Fellowship. W.H.K. is supported by K01DK067207. We are indebted to the staff and participants in the Framingham Heart Study and the Atherosclerosis Risk in Communities Study for their important contributions.
The authors declare that they have no competing interests.
AK obtained funding, participated in conception and design of the study, selected SNPs for replication, performed the statistical analyses in the ARIC Study, and drafted the manuscript. WHLK was involved in ARIC funding and data collection, participated in the conception and design of the study and helped draft sections of the manuscript. S–JH assisted in the design of the study and in performing the statistical analyses for FHS. EB carried out genotyping for the ARIC Study and interpretation of study results. QY was involved in FHS study design and data analyses. DL contributed to the collection of data for FHS, interpretation of results, and critical edits to the manuscript. EJB participated in discussions of study design and provided critical revisions to the manuscript. MGL participated in the interpretation of results and revision of the manuscript. BCA was involved in ARIC funding, participated in the conception and design of the study and helped draft a section of the manuscript. JC was involved in ARIC funding and data collection, participated in the conception and design of the study and helped draft sections of the manuscript. CSF conceived of the study, helped acquire the data, interpret the findings, and helped to draft and critically revise the manuscript. All authors read and approved the final manuscript.
Josef Coresh and Caroline S Fox contributed equally to this work.
Electronic supplementary material
Additional file 3: Word document, Supplementary Table 3: Replication data of 16 SNPs significantly associated with renal traits on the FHS 100 K chip in 11,217 white participants at ARIC visit 1. Lists association results between SNPs and renal traits in white ARIC participants at ARIC visit 1, analogous to Table 3. (DOC 149 KB)
About this article
Cite this article
Kottgen, A., Kao, W.H.L., Hwang, S. et al. Genome-wide association study for renal traits in the Framingham Heart and Atherosclerosis Risk in Communities Studies. BMC Med Genet 9, 49 (2008). https://doi.org/10.1186/1471-2350-9-49
- Chronic Kidney Disease
- Generalize Estimate Equation
- Framingham Heart Study
- Serum Cystatin
- Kidney Disease Progression