- Research article
- Open Access
Genome-wide association study identifies PERLD1 as asthma candidate gene
BMC Medical Genetics volume 12, Article number: 170 (2011)
Recent genome-wide association studies (GWAS) for asthma have been successful in identifying novel associations which have been well replicated. The aim of this study is to identify the genetic variants that influence predisposition towards asthma in an ethnic Chinese population in Singapore using a GWAS approach.
A two-stage GWAS was performed in case samples with allergic asthma, and in control samples without asthma and atopy. In the discovery stage, 490 case and 490 control samples were analysed by pooled genotyping. Significant associations from the first stage were evaluated in a replication cohort of 521 case and 524 control samples in the second stage. The same 980 samples used in the discovery phase were also individually genotyped for purposes of a combined analysis. An additional 1445 non-asthmatic atopic control samples were also genotyped.
19 promising SNPs which passed our genome-wide P value threshold of 5.52 × 10-8 were individually genotyped. In the combined analysis of 1011 case and 1014 control samples, SNP rs2941504 in PERLD1 on chromosome 17q12 was found to be significantly associated with asthma at the genotypic level (P = 1.48 × 10-6, ORAG = 0.526 (0.369-0.700), ORAA = 0.480 (0.361-0.639)) and at the allelic level (P = 9.56 × 10-6, OR = 0.745 (0.654-0.848)). These findings were found to be replicated in 3 other asthma GWAS studies, thus validating our own results. Analysis against the atopy control samples suggested that the SNP was associated with allergic asthma and not to either the asthma or allergy components. Genotyping of additional SNPs in 100 kb flanking rs2941504 further confirmed that the association was indeed to PERLD1. PERLD1 is involved in the modification of the glycosylphosphatidylinositol anchors for cell surface markers such as CD48 and CD59 which are known to play multiple roles in T-cell activation and proliferation.
These findings reveal the association of a PERLD1 as a novel asthma candidate gene and reinforce the involvement of genes on the 17q12-21 chromosomal region in the etiology of asthma.
Asthma is a highly complex disease of airway inflammation. The intricacies of asthma are exemplified in its diverse clinical characteristics in terms of its triggers, symptoms and presentation, as well as how the disease is understood at the molecular level. More than 180 genes have been found to be associated with asthma [1–3]. Recent genome-wide association studies (GWAS) for asthma have identified novel genes such as ORMDL3 , CHI3L1  and DENND1B , and chromosomal regions such as 9q21.31 . These as well as other known associations to genes PDE4D , TGFB1, IL1RL1, IL18R1, DPP10 , and regions of the highly replicated 5q31-33 and HLA gene clusters [10, 11] have also been confirmed by other GWAS and meta-analyses of GWAS for asthma [12, 13]. The identification of many of these novel asthma candidates has widened the scope of possible mechanisms involved in its etiology. The premise that genome-wide association studies when used as tools to elucidate the genetic variants associated with complex genetic diseases, will lead to greater understanding of the underlying biology is being exemplified in studies on asthma.
Pooled GWAS have been used as an alternative to large-scale GWAS, especially when performed as part of a two-stage study design to retain sufficient power to detect association in a particular sample size . Since 2007, at least 15 separate pooled GWAS have shown significantly reasonable successes in identifying candidate genes for complex diseases such as autism , schizophrenia  and atopy . We have also previously shown that pooled genotyping on the Affymetrix SNP6.0 platform is sufficiently accurate and reliable in a case-control study design .
In this study, we have carried out a two-stage GWAS using pooled genotyping in the discovery stage which we followed up with individual genotyping in a replication sample set to replicate the associated signals (Figure 1). Additional genotyping was performed using the case and control samples used in the pools as well as in atopy control samples.
The case and control DNA samples used in this study were part of a larger cross-sectional epidemiological collection of unrelated adult ethnic Chinese students in the National University of Singapore who were recruited following standard protocols for written informed consent, over the span of four years. We have previously shown that there is negligible population stratification in samples from our study population . Approval to conduct the study was obtained from the National University of Singapore Institutional Review Board (NUS-IRB Reference Code: 07-023 and NUS10-343). Recruitment was performed in compliance with the Helsinki Declaration.
Genomic DNA was extracted from buccal cells obtained from a mouthwash in 0.9% saline solution. In short, the buccal cells were pelleted and lysed; DNA was extracted using the phenol-chloroform phase-separation technique , purified by two washes in ethanol, with the DNA pellet resuspended in reduced Tris-EDTA buffer. The quality of the genomic DNA samples was assessed by gel electrophoresis and through the measurement of light absorption at 260 and 230 nm wavelengths. Samples were also quantified in quintuplicate using the PicoGreen (Molecular Probes Inc. Eugene, Oregon) dsDNA quantification reagent. We have also previously shown that these buccal cell-derived genomic DNA samples were amenable to Sanger sequencing  and individual genome-wide analysis on the Illumina and Affymetrix platforms .
Samples were subsequently stratified into case and control groups according to their disease status based on an International Study of Asthma and Allergies in Childhood (ISAAC)-derived survey questionnaire  and doctor-diagnosis. A skin-prick test for common local allergens in Singapore (Dermatophagoides pteronyssinus, Blomia tropicalis, Elaeis guineensis and Curvularia lunata) was also performed as a measure of their atopic status. We have previously shown that skin-prick reactivity to dust mites Blomia tropicalis and Dermatophagoides pteronyssinus were highly sensitive and specific indicators for allergic sensitization in Singapore . Cases were defined as having asthma by a positive response to the question "Have you ever had asthma?", a doctor's diagnosis and a positive skin prick reaction (wheal diameter ≥ 3 mm) to either one of the dust mite allergens. In our sampling frame, 17% of the samples were considered to be cases. Conversely, a control answered "No" to the above question, did not have any symptoms of asthma and also showed no positive skin prick reactions to any of the allergens evaluated. About 17% of the sampled individuals were controls. A subset of the individuals completed lung function and peak flow reversibility test, as well as a complete physicians' assessment for asthma (data not shown). This is done to cross validate our diagnosis as well as classification of cases and controls. A third category of individuals without any symptoms of asthma or diagnosis but with positive skin-prick responses were selected as atopic controls. Table 1 summarizes the demographics of the population used in this study.
As our cross-sectional sampling was performed in various stages over a four year period, at the time we carried out the GWAS, a total of 980 case and control samples were available for genotyping. These samples were selected for pooling because they fell within a 5% error margin in the replicate measurements by PicoGreen. 100 ng from each DNA sample was pooled together. Pooled DNA samples were re-quantified by PicoGreen to ensure accuracy of pooling. Four pools of varying size (from 90-160 samples per pool) were created leading up to 490 samples for each study group.
Genotyping of the pooled DNA samples on the Affymetrix Genome-Wide Human SNP Array 6.0 (Affymetrix. Santa Carla, California) platform was outsourced to a local service provider (Origen Laboratories Pte Ltd) in Singapore. Each of the eight pooled samples was genotyped in triplicate, resulting in the use of 24 microarray chips to genotype 490 case and 490 control samples.
Allelotyping for pooled genotyping
Allele frequencies for the 906,600 SNPs on the Affymetrix SNP6.0 chip were estimated from probe intensities using the polynomial-based probe specific correction (PPC)  as previously described .
Individual genotyping for replication
Genomic DNA samples were diluted to 50 ng/μl prior to being outsourced to the DNA Sequencing and Genomics Facility at the University of Utah for individual genotyping on the Illumina BeadXpress platform using the GoldenGate assay (Illumina, Inc. San Diego, California) and to the SNP Research Facility at the Washington University in St Louis for genotyping on the Sequenom platform using the MassARRAY iPlex assay (Sequenom, Inc. San Diego, California). An additional 521 case and 524 control samples (together with the initial 490 case and 490 control samples making up a total of 1011 cases and 1014 controls) were individually genotyped to replicate the observed associations from the discovery stage. An additional 1445 atopic samples without asthma were also genotyped (Table 1).
Association analysis using the estimated allele frequencies from the pooled genotyping was performed in R  via a Z test and by generating a single-tail P value for significance.
The genotypes obtained in the replication stage were analysed using PLINK v1.07  using the full model association tests (--model) and via Fisher's exact test (--fisher). This permitted us to test for association under the general, multiplicative, additive dominant and recessive genetic models . P values from the tests for association under the different models are presented as P general , P multiplicative , P additive , P dominant and P recessive . P general and P multiplicative refer to the P values obtained from a Fisher's test performed on the genotype and allele counts respectively in cases and controls. P additive refers to the P value obtained from a Cochran Armitage Trend test for allele counts in cases and controls. P dominant refers to the P values obtained from a Fisher's test on the pooled genotype counts of the homozygous minor and heterozygous genotypes against the homozygous major genotype. P recessive refers to the P value from a Fisher's test on the genotype counts of the homozygous minor against the pooled counts of the heterozygous and homozygous major genotypes. Odds ratios are presented with the 95% confidence interval in parentheses. As we did not have any assumptions on the nature of the genetic association, association was tested for all the genetic models. However, only the general and multiplicative models were considered for determining whether an association was positive (or not). SNP call rates (--missing) and Hardy-Weinberg Equilibrium (--hardy) were also determined as quality control measures.
Functional prediction of variants
In this study, a pooled GWAS was performed on 490 cases and 490 controls on the Affymetrix SNP6.0 platform (Table 1). Cases were defined as atopic individuals who have ever had asthma, while controls were neither atopic nor have ever had symptoms of asthma. As only ethnic Chinese individuals were selected to participate in this study, there was no bias or stratification based on ethnicity. Secondly, as the study samples were sourced from a cross-sectional sampling frame, selection bias was also avoided. We have also previously highlighted that our study population was notably similar to the Han Chinese population in Hapmap (CHB) and that there was negligible population stratification within samples from our cross-sectional epidemiological collection .
Four pools (A-D in Figure 1) were created within each study group, and each pool was genotyped in triplicate to account for potential errors due to pooled genotyping. SNP call rates were better than 96% while the concordance of probe intensities across replicates was higher than 97.5%. We have previously shown that estimated allele frequencies from pooled genotyping on the Affymetrix SNP 6.0 platform were highly comparable to actual allele frequencies obtained from individual genotyping on the same platform . Estimated allele frequencies for SNPs were calculated from the measured probe intensities. The allele frequency estimates were averaged across the three replicates in each of the 4 pairs of case and control groups. These averaged estimated allele frequencies were compared between the study groups via a Z-test to generate 4 sets of P values.
Of the 906,600 SNPs on the Affymetrix SNP 6.0 array, 185,875 non-variable and 37,376 non-autosomal SNPs SNPs were discarded. The remaining 683,349 SNPs were sorted according to their P values to generate 4 SNP lists. As these P values were solely used to rank the SNPs for replication and weren't considered to be indicative of the strength of association, SNPs were selected for further study using a standard Bonferroni P value cut-off of 5.52 × 10-8 (α = 0.05 divided by 906,600 SNPs). Furthermore, as only estimated allele frequencies were available, the obtained P values could merely be considered indicative of association in a multiplicative allelic model. Common SNPs (> 5% based on Hapmap CHB) which were ranked below this cut-off were short listed from the 4 case-control comparisons. 19 SNPs were found to consistently appear in at least 2 of these SNP lists; 18 of which were consistently ranked below the cut-off in two case-control comparisons while 1 in three case-control comparisons (Table 2). These 19 SNPs were shortlisted for replication.
These 19 SNPs were individually genotyped in 1045 samples consisting of 521 case and 524 control samples (Table 1) in the replication stage. Genotyping success rate was more than 97% for all SNPs. None of the genotyped SNPs showed any major deviation (P value < 1 × 10-3) from Hardy Weinberg Equilibrium. Association with asthma was tested under the general and multiplicative genetic models  to account for the genotypic and allelic effect of these 19 SNPs (Table 3). Since the gender ratio was skewed in our sample set, we tested for any gender-related effects by performing the association tests in male and female samples separately. No gender-specific associations were found (data not shown).
After accounting for multiple testing, one SNP was found to be significant with a P value less than the Bonferroni-corrected value of 2.63 × 10-3 (0.05/19 SNPs). This SNP (rs2941504) falls within the gene PERLD1 on chromosome 17q12 and is coding synonymous. It showed positive association to asthma in the replication stage (P general = 2.60 × 10-3). While the minor G allele was present at a lower frequency in cases (MAFcase = 0.305) than in controls (MAFcontrol = 0.359) with an odds ratio of 0.784 (0.653-0.941), it was not significantly associated at the allelic level (P multiplicative = 9.26 × 10-3) after accounting for multiple testing.
The SNP rs2941504 was subsequently genotyped in the 490 case and 490 control samples used in the discovery phase, to improve on the sample numbers and in an attempt to obtain genotype data to perform a combined analysis. This permitted association testing in a total of 1011 case and 1014 control samples. Given this sample size, the frequency of the minor allele in our genotyped population (MAF = 0.350), the prevalence of asthma in our cross-sectional sampling frame (17%), and a genome-wide significance level (α = 5 × 10-8), we had 80% power, based on joint analysis in a two-staged study design , to detect associations with odds ratios greater than 1.4 . While either one of the stages of our study would not have been individually sufficiently powered, the combined analysis from both stages did have sufficient power to say that the discovered associations were indeed true.
The coding SNP, rs2941504, in the PERLD1 gene showed strong association (P general = 1.48 × 10-6) to asthma (Table 4) in the combined analysis. The genotypic odds ratio of the minor GG genotype as the reference revealed that it had a moderate effect of between 0.480 and 0.526 over the AA and AG genotypes respectively. The presence of the minor G allele at a significantly (P multiplicative = 9.56 × 10-6) higher frequency in controls than in cases (MAFcontrols = 0.381, MAFcases = 0.315) suggested that it conferred some protection against asthma (OR: 0.745 (0.654-0.848)). The effect of the minor allele is seen more pronounced in the recessive genetic model with a P recessive value of 4.34 × 10-7 with odds ratio of 0.503 (0.384-0.658). As there was no major deviation from Hardy Weinberg Equilibrium, the allelic association as tested by a Cochran-Armitage trend test generated a P value (P additive = 1.25 × 10-5) which was similar to that obtained from a Fisher's test on the allele counts.
Localizing the association signal
To confirm that this association on 17q12 was indeed at PERLD1 and not at other adjacent genes, tagging was performed in the 100 kb flanking SNP rs2941504 to identify additional SNPs for genotyping. Tagging performed using the Hapmap CHB population as a reference with r2 and MAF cut-offs of 0.8 and 0.05 respectively gave 7 SNPs (including rs2941504) to represent the region. The 6 additional SNPs were genotyped across 1011 case and 1014 controls (Table 4). After accounting for multiple testing, SNPs rs1877031 on STARD3 and rs1058808 on ERBB2 were found to have P values less than 8.33 × 10-3 (0.05/6 SNPs). To determine whether these additional associations were influenced by the strong linkage disequilibrium in the region to PERLD1 (Figure 2), logistic regression using SNP rs2941504 as a covariate was performed in PLINK (--logistic --condition). None of the SNPs remained significant in the regression analysis (Table 4). This confirmed that genome-wide association signal at 17q12 was indeed due to PERLD1, and not any other gene in the region.
Genotyping of Atopic Controls
The SNP rs2941504 was also genotyped in an additional 1445 atopic individuals without asthma (Table 1). To study the effect this SNP had on asthma without considering the effects of atopy, association was tested between the 1011 case and 1445 atopy control samples. Association was detected at the genotypic (P general = 2.25 × 10-2) and allelic (P multiplicative = 6.97 × 10-3) level with an allelic odds ratio of 0.845 (0.747-0.954). Similarly, the effect this SNP had at the level of atopy was also measured by testing for association between 1445 atopy controls and 1014 non-asthmatic non-atopic controls. Positive association was detected at the genotypic (P general = 1.96 × 10-3) as well as the allelic levels (P multiplicative = 3.87 × 10-2) with an allelic odds ratio of 0.882 (0.783-0.993) (Table 5).
At the advent of genome-wide association studies (GWAS), investigators saw it as a "magic bullet" to deciphering the intricacies of complex diseases. While the initial acceptance was slow due to the massive investment normally available only to large-scale consortia, early successes , advances in technology and highly reduced prices have enabled more than 800 GWAS to be conducted on more than 400 diseases to date . For asthma, recent reviews [34–36] have summarized the contribution of GWAS to the study of genetics of this disease. While different GWAS have identified numerous susceptibility variants, the results from these studies have not been able to fully explain the heritability of asthma and allergy. While epistasis, gene-environment interactions, rare and un-detected/un-known variants have been suggested to account for the "missing" heritability, none of these discount the ability of the GWAS to highlight important genes that are likely to be involved in susceptibility to the disease.
This paper presents a genome-wide case-control association study performed using pooled genotyping to study asthma in an ethnic Chinese population. Giampaolo Ricci et al. very recently also published a pooling-based GWAS on asthma. They tested 269 children of European descent comparing asthmatics and non-asthmatics with rhino-conjunctivitis . The strongest association detected was for a SNP on the C5 gene at a P value of 1.1 × 10-3. Our study improves on this with a larger sample size in the discovery as well as replication stages, which resulted in associations with higher levels of significance. Observations from our GWAS were that multiple SNPs throughout the genome (Table 2) passed the standard genome-wide threshold of 5.52 × 10-8 in multiple case versus control comparisons. Replication in a larger sample set via individual genotyping revealed that the strongest association signal was at PERLD1 on chromosome 17.
The 17q chromosomal region has been linked with asthma and allergy, via both linkage and candidate gene based studies in the past. Early genome-wide linkage studies identified associations of the 17q12-23 region to asthma and atopy in European populations [38, 39]. Two recent meta-analyses of genome-wide linkage screens for asthma and its associated phenotypes have also indicated the significance of the 17q12-q24 region with allergic asthma and associated sub-phenotypes [40, 41]. Candidate gene-based studies have identified genes such as CCL2 and CCL5 (17q11-q12), CCL11 and ITGB3 (17q21), and ACE (17q23), which have been found to be associated and well replicated in multiple populations [1, 2]. The first GWAS for childhood asthma identified ORMDL3 (and GSDMB) also present in the 17q21 region . This association has also been well replicated in multiple populations including Scottish, Northern Europeans, North Americans of European ancestry, African, Australian, French-Canadian, Mexicans, Puerto Ricans, and Koreans [12, 13, 42–46].
The PERLD1 SNP (rs2941504), whose association we identified and replicated, was also found to be associated with asthma in a number of other studies. In one of the studies which replicated the association on 17q21 to asthma , a GWAS was performed on 4917 case and 34,589 control samples from six European and one Korean populations. In this study, the presence of the A allele of SNP rs2941504 was reported to be significantly associated with asthma with P value of 1.2 × 10-8 and odds ratio of 1.29; this association remained significant (P = 3.2 × 10-2) even after adjusting for the effect of the originally associated ORMDL3 variant (P = 1.56 × 10-7). In another GWAS on asthma , 473 cases, 1892 population controls and 363 phenotyped controls of European descent were genotyped. While the strongest associations reported were to RAD50 (P = 3.04 × 10-7) and HLA-DQB1 (P = 9.55 × 10-6), two SNPs within PERLD1 (rs1565922 and rs2941503) also had P values which were less than 0.05 (P = 4.63 × 10-4 and P = 6.46 × 10-4). Most recently, a large-scale, consortium-based GWAS of asthma reiterated the significance of the 17q region in childhood asthma through the genotyping of more than 26,000 individuals  of European descent. In this latest study, genes such as ORMDL3 and GSDMB expectedly showed the strongest significance to childhood asthma (random-effect P values of 5.24 × 10-21 and 6.45 × 10-23 respectively). Interestingly, 4 SNPs within PERLD1 (including rs2941504 which we had found significant) were also presented with highly significant P values ranging from 1.26 × 10-7 to 1.74 × 10-10 with odds ratios around 0.84 (0.79-0.90). While none of the above mentioned studies explicitly mentioned PERLD1 being associated with asthma, the indicated P values were either listed in tables within the main text or in supplementary material. While many studies replicating the association of the 17q region to asthma have been published, association with SNPs in PERLD1 have only appeared in the above mentioned three. This suggests that despite the proximity to ORMDL3 and GSDMB, the association at PERLD1 might possibly be independent from them. In this study, the genome-wide signal whose association we confirmed was indeed at PERLD1 (Table 4) rather than anywhere else in the region despite the strong linkage disequilibrium to adjacent genes on 17q12 (Figure 2). In all, the results from these different studies rather effectively replicate the association that we independently discovered through our own GWAS.
There has been considerable evidence to suggest that allergic sensitization does not correlate well with allergic disease presentation [47, 48]. As such, many genetic studies choose to use case-control definitions based purely on doctor diagnosis or self-reported symptomology without taking into account the atopic condition. This is done to exclude any interference from genes involved in the general mechanisms of allergy. However, we feel that not accounting for allergic sensitization detracts the involvement of immunological mechanisms from the etiology of allergic disease. We might also be discrediting the possibility of atopic individuals developing asthma later on. While this was the impetus for our use of non-asthmatic, non-atopic controls in our study, we also genotyped an additional 1445 non-asthmatic atopic control samples. Doing so allowed us to see the effect our significant SNP had on atopy itself as well asthma while disregarding the effects of atopy. Positive associations at P values less than 0.05 were detected in both the asthma versus atopy and atopy versus control comparisons. This suggested that there were indeed significant differences between the diseased, healthy and "intermediate" conditions. This was not unexpected, as an association to allergic asthma would likely involve both an 'asthmatic' and an 'allergy' component. However, the difference in the P values obtained from a case versus control comparison (P general = 1.48 × 10-6), and those obtained from a case versus atopy (P general = 2.25 × 10-2) or atopy versus control (P general = 1.96 × 10-3) comparisons are suggestive that the association is likely to allergic asthma and not to either of the conditions individually (Table 5).
The PERLD1 gene is observed to be in complete linkage disequilibrium based on the Hapmap CHB reference population (Figure 3). In other words, rs2941504 was able to tag all the variation in the whole gene. This suggests that genotyping other variants within the gene would produce similar associations. Furthermore, any functional variants within the gene could possibly lead to variation in gene function. Sequence motif-based prediction of functionality of the SNPs within PERLD1 (based on Hapmap CHB) revealed that most of the SNPs were likely to be involved in splicing regulation, either as exon-splicing enhancer or exon-splicing silencer recognition sites (data not shown). Alternative splicing of the mRNA due to the effects of any combination of these SNPs can possibly result in structural and/or functional diversification of PERLD1.
PERLD1, originally discovered as PER1 in yeast, was found to be involved in the lipid remodeling of glycosylphosphatidylinositol (GPI) anchors . GPI-anchors are glycolipid moieties which are added on to the C-terminus of proteins during post-translational modification. Once attached, the GPI-anchor allows cell-surface proteins such as CD48 and CD59 to be attached onto the cell membrane . The importance of GPI-anchors in cell-signaling has been observed in mammalian T-cells where cell-surface molecules which were displayed via GPI-anchors lost their ability to stimulate T-cell activation and proliferation. This occurred when either the proteins themselves were inhibited by specific antibodies or their ability to be displayed on the cell surface was affected through the inhibition of GPI-anchor attachment to their c-terminus . CD48 was found to be a critical effector molecule in cell adhesion, pathogen recognition, and activation and regulation of various immunoregulatory pathways [52–55]. CD59, the principal ligand for CD2, has been shown to enhance T-cell activation and proliferation following cross-linking with CD2 . Both these GPI-anchored markers were shown to be involved in proliferation of T-cells in patients suffering from bone marrow failure  and paroxysmal nocturnal haemoglobinuria . The ability to regulate the activity and proliferative ability of T-cells through the control of GPI-anchor attachment suggest PERLD1's involvement in the inflammatory aspect of asthma.
We understand that the main limitation of this study is the pooling approach we took in the discovery phase. The use of pooled genotyping introduces errors due to array variation  as a result of differential hybridization of samples within the pool to the various probes of the array. Secondly, even though we had previously shown that the estimated allele frequencies obtained from pooled genotyping highly resemble the actual allele frequencies of the samples within the pool , we still suspected that they might represent the relative frequency of the allele within the pooled samples rather than the actual allele frequency of the samples. This could possibly result in an inflation of the differences in allele frequencies between case and control pools, and hence inflated P values and potential false positives. Lastly, the use of pooled samples means that actual genotypes will not be available thus preventing more detailed analysis of the nature or model of association.
We solved these issues by firstly genotyping each pool in triplicate and averaging across the replicates to account for the error due to the array itself. Secondly, we used the obtained P values only to rank the SNPs in terms of relative differences in allele frequencies within each pool pair; these P values were not considered to be indicative of strength of association. The commonly employed Bonferroni-corrected genome-wide cut-off was used to narrow down a list of potential hits. To filter out false positives, we compared the 4 separate lists of SNPs obtained from separate analysis of the 4 sets of pools (A-D), and shortlisted SNPs which appeared in 2 or more lists. We hypothesized that SNPs which passed the cut-off in more case-control comparisons were more likely to be truly associated and not just false positive hits. True enough, the association we are able to replicate (SNP rs2941504 on PERLD1) was the only one to appear in 3 out of the 4 sets of shortlisted SNPs; the P value in the 4th pool set (Pool A, Table 2) was also relatively lower than any of the other P values of the remainder 18 shortlisted SNPs. Furthermore, it was the only SNP out of the 19 shortlisted ones which remained significant in the replication stage, suggesting that all the other SNPs were likely to have been false positives. We also individually genotyped the samples used in the original pools to permit a combined analysis with the replication sample set which allowed the elucidation of the nature of and identification of the best fit model to explain the observed association. This allowed us to prove that the association at rs2941504 was indeed real, and not just an artefact of pooled genotyping in the discovery phase.
In this study, we have overcome the limitations of using pooled samples in the discovery phase, to identify a novel asthma gene whose association we replicated in an independent sample set; this association can also be corroborated by data from other studies. While it is the ORMDL3 and GSDMB genes in the 17q21 region that have been repeatedly linked with asthma in numerous other populations, it is not unreasonable to expect the immediately adjacent PERLD1 in the 17q12 region to also be directly associated with asthma in a Singapore Chinese population which has yet to be studied for the disease. The association of this gene offers new insights into the mechanisms of genetic susceptibility to asthma. While further studies are needed to investigate its functional relevance to the disease, our identification of PERLD1 as an asthma gene reinforces the importance of chromosome 17q12 in the etiology of the disease.
- ACE :
angiotensin I converting enzyme
- C5 :
complement component 5
- CCL11 :
chemokine (C-C motif) ligand 11
- CCL2 :
chemokine (C-C motif) ligand 2
- CCL5 :
chemokine (C-C motif) ligand 5
- CD2 :
Cluster of differentiation 2
- CD48 :
Cluster of differentiation 48
- CD59 :
Cluster of differentiation 59
- CHI3L1 :
chitinase 3-like 1
- DENND1B :
DENN/MADD domain containing 1B
- DPP10 :
- ERBB2 :
v-erb-b2 erythroblastic leukemia viral oncogene homolog 2
- GPI :
- GPI-AP :
- GSDMB :
- GWAS :
genome-wide association study
- HLA :
human leukocyte antigen
- HLA-DQB1 :
major histocompatibility complex, class II, DQ beta 1
- IL18R1 :
interleukin 18 receptor 1
- IL1RL1 :
interleukin 1 receptor-like 1
- ITGB3 :
integrin beta 3
- MHC :
major histocompatibility complex
- ORMDL3 :
- PDE4D :
- PERLD1 :
Per1-like domain containing 1
- RAD50 :
- SNP :
single nucleotide polymorphism
- STARD3 :
StAR-related lipid transfer (START) domain containing 3
- TGFB1 :
transforming growth factor beta 1.
Weiss ST, Raby BA, Rogers A: Asthma genetics and genomics 2009. Current Opinion in Genetics & Development. 2009, 19 (3): 279-282. 10.1016/j.gde.2009.05.001.
Vercelli D: Discovering susceptibility genes for asthma and allergy. Nature Reviews Immunology. 2008, 8 (3): 169-182. 10.1038/nri2257.
Gu ML, Dong XQ, Zhao J: New Insight Into the Genes Susceptible to Asthma. Journal of Asthma. 2010, 47 (2): 113-116. 10.3109/02770900903498442.
Moffatt MF, Kabesch M, Liang LM, Dixon AL, Strachan D, Heath S, Depner M, von Berg A, Bufe A, Rietschel E, et al: Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma. Nature. 2007, 448 (7152): 470-U475. 10.1038/nature06014.
Ober C, Tan Z, Sun Y, Possick JD, Pan L, Nicolae R, Radford S, Parry RR, Heinzmann A, Deichmann KA, et al: Effect of variation in CHI3L1 on serum YKL-40 level, risk of asthma, and lung function. New England Journal of Medicine. 2008, 358 (16): 1682-1691. 10.1056/NEJMoa0708801.
Sleiman PMA, Flory J, Imielinski M, Bradfield JP, Annaiah K, Willis-Owen SAG, Wang K, Rafaels NM, Michel S, Bonnelykke K, et al: Variants of DENND1B Associated with Asthma in Children. New England Journal of Medicine. 2010, 362 (1): 36-44. 10.1056/NEJMoa0901867.
Hancock DB, Romieu I, Shi M, Sienra-Monge JJ, Wu H, Chiu GY, Li HL, del Rio-Navarro BE, Willis-Owens SAG, Weiss ST, et al: Genome-Wide Association Study Implicates Chromosome 9q21.31 as a Susceptibility Locus for Asthma in Mexican Children. PLoS Genetics. 2009, 5 (8):
Himes BE, Hunninghake GM, Baurley JW, Rafaels NM, Sleiman P, Strachan DP, Wilk JB, Willis-Owen SAG, Klanderman B, Lasky-Su J, et al: Genome-wide Association Analysis Identifies PDE4D as an Asthma-Susceptibility Gene. American Journal of Human Genetics. 2009, 84 (5): 581-593. 10.1016/j.ajhg.2009.04.006.
Wu H, Romieu I, Shi M, Hancock DB, Li HL, Sienra-Monge JJ, Chiu GY, Xu H, del Rio-Navarro BE, London SJ: Evaluation of candidate genes in a genome-wide association study of childhood asthma in Mexicans. Journal of Allergy and Clinical Immunology. 2010, 125 (2): 321-327. 10.1016/j.jaci.2009.09.007.
Li XN, Howard TD, Zheng SL, Haselkorn T, Peters SP, Meyers DA, Bleecker ER: Genome-wide association study of asthma identifies RAD50-IL13 and HLA-DR/DQ regions. Journal of Allergy and Clinical Immunology. 2010, 125 (2): 328-335. 10.1016/j.jaci.2009.11.018.
Noguchi E, Sakamoto H, Hirota T, Ochiai K, Imoto Y, Sakashita M, Kurosaka F, Akasawa A, Yoshihara S, Kanno N, et al: Genome-Wide Association Study Identifies HLA-DP as a Susceptibility Gene for Pediatric Asthma in Asian Populations. PLos Genetics. 2011, 7 (7):
Moffatt MF, Gut IG, Demenais F, Strachan DP, Bouzigon E, Heath S, von Mutius E, Farrall M, Lathrop M, Cookson W, et al: A Large-Scale, Consortium-Based Genomewide Association Study of Asthma. New England Journal of Medicine. 2010, 363 (13): 1211-1221. 10.1056/NEJMoa0906312.
Torgerson DG, Ampleford EJ, Chiu GY, Gauderman WJ, Gignoux CR, Graves PE, Himes BE, Levin AM, Mathias RA, Hancock DB, et al: Meta-analysis of genome-wide association studies of asthma in ethnically diverse North American populations. Nature Genetics. 2011, 43 (9): 887-892. 10.1038/ng.888.
Zhao YH, Wang S: Optimal DNA Pooling-Based Two-Stage Designs in Case-Control Association Studies. Human Heredity. 2009, 67 (1): 46-56. 10.1159/000164398.
Ronald A, Butcher LM, Docherty S, Davis OSP, Schalkwyk LC, Craig IW, Plomin R: A Genome-Wide Association Study of Social and Non-Social Autistic-Like Traits in the General Population Using Pooled DNA, 500 K SNP Microarrays and Both Community and Diagnosed Autism Replication Samples. Behavior Genetics. 2010, 40 (1): 31-45. 10.1007/s10519-009-9308-6.
Shifman S, Johannesson M, Bronstein M, Chen SX, Collier DA, Craddock NJ, Kendler KS, Li T, O'Donovan M, O'Neill FA, et al: Genome-wide association identifies a common variant in the reelin gene that increases the risk of schizophrenia only in women. PLoS Genetics. 2008, 4 (2):
Castro-Giner F, Bustamante M, Gonzalez JR, Kogevinas M, Jarvis D, Heinrich J, Anto JM, Wjst M, Estivill X, de Cid R: A pooling-based genome-wide analysis identifies new potential candidate genes for atopy in the European Community Respiratory Health Survey (ECRHS). BMC Medical Genetics. 2009, 10:
Anantharaman R, Chew FT: Validation of pooled genotyping on the Affymetrix 500 k and SNP6.0 genotyping platforms using the polynomial-based probe-specific correction. BMC Genetics. 2009, 10:
Andiappan AK, Anantharaman R, Nilkanth PP, Wang DY, Chew FT: Evaluating the transferability of Hapmap SNPs to a Singapore Chinese population. BMC Genetics. 2010, 11:
Moore D, Dowhan D: Unit 2.1A: Purification and Concentration of DNA from Aqueous Solutions. Current Protocols in Molecular Biology. 2002
Parate PN, Wang D, Chew FT: Linkage Disequilibrium Pattern in Asthma Candidate Genes from 5q31-q33 in the Singapore Chinese Population. Annals of Human Genetics. 2010, 74: 137-145. 10.1111/j.1469-1809.2009.00558.x.
Asher MI, Keil U, Anderson HR, Beasley R, Crane J, Martinez F, Mitchell EA, Pearce N, Sibbald B, Stewart AW, et al: International study of asthma and allergies in childhood (ISAAC): rationale and methods. European Respiratory Journal. 1995, 8: 483-491. 10.1183/09031936.95.08030483.
Chew FT, Lim SH, Goh DYT, Lee BW: Sensitization to local dust-mite fauna in Singapore. Allergy. 1999, 54 (11): 1150-1159. 10.1034/j.1398-9995.1999.00050.x.
Brohede J, Dunne R, McKay JD, Hannan GN: PPC: an algorithm for accurate estimation of SNP allele frequencies in small equimolar pools of DNA using data from high density microarrays. Nucleic Acids Research. 2005, 33 (17):
Team RDC: R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. 2011
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, de Bakker PIW, Daly MJ, et al: PLINK: A tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics. 2007, 81 (3): 559-575. 10.1086/519795.
Lewis CM: Genetic association studies: Design, analysis and interpretation. Brief Bioinform. 2002, 3 (2): 146-153. 10.1093/bib/3.2.146.
F-SNP: A collection of functional SNPs, specifically prioritized for disease association studies. [http://compbio.cs.queensu.ca/F-SNP/]
Human Splicing Finder: an online bioinformatics tool to predict splicing signals. [http://www.umd.be/HSF/]
Zuo Y, Zou G, Zhao H: Two-Stage Designs in Case-Control Association Analysis. Genetics. 2006, 173: 1747-1760. 10.1534/genetics.105.042648.
Skol AD, Scott LJ, Abecasis GR, Boehnke M: Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nature Genetics. 2006, 38 (2): 209-213. 10.1038/ng1706.
Burton PR, Clayton DG, Cardon LR, Craddock N, Deloukas P, Duncanson A, Kwiatkowski DP, McCarthy MI, Ouwehand WH, Samani NJ, et al: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007, 447 (7145): 661-678. 10.1038/nature05911.
Hindorff LA, Junkins HA, Hall PN, Mehta JP, Manolio TA: A Catalog of Published Genome-Wide Association Studies. Accessed Sept. 2010, [http://www.genome.gov/gwastudies/]
Sleiman PMA, Hakonarson H: Recent advances in the genetics and genomics of asthma and related traits. Current Opinion in Pediatrics. 2010, 22 (3): 307-312. 10.1097/MOP.0b013e328339553d.
Holloway JW, Yang IA, Holgate ST: Genetics of allergic disease. Journal of Allergy and Clinical Immunology. 2010, 125 (2): S81-S94. 10.1016/j.jaci.2009.10.071.
Postma DS, Koppelman GH: Genetics of Asthma: Where Are We and Where Do We Go?. Proceedings of the American Thoracic Society. 2009, 6 (3): 283-287. 10.1513/pats.200806-047RM.
Ricci G, Astolfi A, Remondini D, Cipriani F, Formica S, Dondi A, Pession A: Pooled Genome-Wide Analysis to Identify Novel Risk Loci for Pediatric Allergic Asthma. PLoS One. 2011, 6 (2): e16912-10.1371/journal.pone.0016912.
Dizier MH, Besse-Schmittler C, Guilloud-Bataille M, Annesi-Maesano I, Boussaha M, Bousquet J, Charpin D, Degioanni A, Gormand F, Grimfeld A, et al: Genome screen for asthma and related phenotypes in the French EGEA study. American Journal of Respiratory and Critical Care Medicine. 2000, 162 (5): 1812-1818.
Koppelman GH, Stine OC, Xu JF, Howard TD, Zheng SQL, Kauffman HF, Bleecker ER, Meyers DA, Postma DS: Genome-wide search for atopy susceptibility genes in Dutch families with asthma. Journal of Allergy and Clinical Immunology. 2002, 109 (3): 498-506. 10.1067/mai.2002.122235.
Bouzigon E, Forabosco P, Koppelman GH, Cookson W, Dizier MH, Duffy DL, Evans DM, Ferreira MAR, Kere J, Laitinen T, et al: Meta-analysis of 20 genome-wide linkage studies evidenced new regions linked to asthma and atopy. European Journal of Human Genetics. 2010, 18 (6): 700-706. 10.1038/ejhg.2009.224.
Denham S, Koppelman GH, Blakey J, Wjst M, Ferreira MA, Hall IP, Sayers I: Meta-analysis of genome-wide linkage studies of asthma and related traits. Respiratory Research. 2008, 9:
Galanter J, Choudhry S, Eng C, Nazario S, Rodriguez-Santana JR, Casal J, Torres-Palacios A, Salas J, Chapela R, Watson HG, et al: ORMDL3 gene is associated with asthma in three ethnically diverse populations. American Journal of Respiratory and Critical Care Medicine. 2008, 177 (11): 1194-1200. 10.1164/rccm.200711-1644OC.
Madore AM, Tremblay K, Hudson TJ, Laprise C: Replication of an association between 17q21 SNPs and asthma in a French-Canadian familial collection. Human Genetics. 2008, 123 (1): 93-95. 10.1007/s00439-007-0444-x.
Halapi E, Gudbjartsson DF, Jonsdottir GM, Bjornsdottir US, Thorleifsson G, Helgadottir H, Williams C, Koppelman GH, Heinzmann A, Boezen HM, et al: A sequence variant on 17q21 is associated with age at onset and severity of asthma. European Journal of Human Genetics. 2010, 18 (8): 902-908. 10.1038/ejhg.2010.38.
Bisgaard H, Bonnelykke K, Sleiman PMA, Brasholt M, Chawes B, Kreiner-Moller E, Stage M, Kim C, Tavendale R, Baty F, et al: Chromosome 17q21 Gene Variants Are Associated with Asthma and Exacerbations but Not Atopy in Early Childhood. American Journal of Respiratory and Critical Care Medicine. 2009, 179 (3): 179-185.
Sleiman PMA, Annaiah K, Imielinski M, Bradfield JP, Kim CE, Frackelton EC, Glessner JT, Eckert AW, Otieno FG, Santa E, et al: ORMDL3 variants associated with asthma susceptibility in North Americans of European ancestry. Journal of Allergy and Clinical Immunology. 2008, 122 (6): 1225-1227. 10.1016/j.jaci.2008.06.041.
Weinmayr G, Weiland SK, Bjorksten B, Brunekreef B, Buchele G, Cookson WOC, Garcia-Marcos L, Gotua M, Gratziou C, van Hage M, et al: Atopic sensitization and the international variation of asthma symptom prevalence in children. American Journal of Respiratory and Critical Care Medicine. 2007, 176 (6): 565-574. 10.1164/rccm.200607-994OC.
Simpson A, Tan VYF, Winn J, Svensen M, Bishop CM, Heckerman DE, Buchan I, Custovic A: Beyond Atopy Multiple Patterns of Sensitization in Relation to Asthma in a Birth Cohort Study. American Journal of Respiratory and Critical Care Medicine. 2010, 181 (11): 1200-1206. 10.1164/rccm.200907-1101OC.
Fujita M, Umemura M, Yoko-o T, Jigami Y: PER1 is required for GPI-phospholipase A(2) activity and involved in lipid remodeling of GPI-anchored proteins. Molecular Biology of the Cell. 2006, 17 (12): 5253-5264. 10.1091/mbc.E06-08-0715.
Okazaki I, Moss J: The Function of GPI-Anchored Proteins. PNH and the GPI-Linked Proteins. 2000, Moss NSYaJ: Academic Press
Loertscher R, Lavery P: The role of glycosyl phosphatidyl inositol (GPI)-anchored cell surface proteins in T-cell activation. Transplant Immunology. 2002, 9 (2-4): 93-96. 10.1016/S0966-3274(02)00013-8.
Munitz A, Bachelet I, Finkelman FD, Rothenberg ME, Levi-Schaffer F: CD48 is critically involved in allergic eosinophilic airway inflammation. American Journal of Respiratory and Critical Care Medicine. 2007, 175 (9): 911-918. 10.1164/rccm.200605-695OC.
Moran M, Miceli MC: Engagement of GPI-linked CD48 contributes to TCR signals and cytoskeletal reorganization: A role for lipid rafts in T cell activation. Immunity. 1998, 9 (6): 787-796. 10.1016/S1074-7613(00)80644-5.
Muhammad A, Schiller HB, Forster F, Eckerstorfer P, Geyeregger R, Leksa V, Zlabinger GJ, Sibilia M, Sonnleitner A, Paster W, et al: Sequential Cooperation of CD2 and CD48 in the Buildup of the Early TCR Signalosome. Journal of Immunology. 2009, 182 (12): 7672-7680. 10.4049/jimmunol.0800691.
Munitz A, Bachelet I, Levi-Schaffer F: CD48 as a Novel Target in Asthma Therapy. Recent Patents on Inflammation & Allergy Drug Discovery. 2007, 1: 9-12. 10.2174/187221307779815057.
Deckert M, Ticchioni M, Mari B, Mary D, Bernard A: The Glycosylphosphatidylinositol-Anchored CD59 Protein Stimulates Both T-Cell Receptor Zeta/Zap-70-Dependent and Zeta/Zap-70-Independent Signaling Pathways in T-Cell. European Journal of Immunology. 1995, 25 (7): 1815-1822. 10.1002/eji.1830250704.
Katagiri T, Qi ZR, Ohtake S, Nakao S: GPI-anchored protein-deficient T cells in patients with aplastic anemia and low-risk myelodysplastic syndrome: implications for the immunopathophysiology of bone marrow failure. European Journal of Haematology. 2011, 86 (3): 226-236. 10.1111/j.1600-0609.2010.01563.x.
Schubert J, Stroehmann A, Scholz C, Schmidt RE: Glycosylphosphatidylinositol (GPI)-Anchored Surface-Antigens in the Allogeneic Activation of T-Cells. Clinical and Experimental Immunology. 1995, 102 (1): 199-203.
Macgregor S: Most Pooling Variation in Array-Based DNA Pooling is Attributable to Array Error Rather Than Pool Construction Error. European Journal of Human Genetics. 2007, 15: 501-504. 10.1038/sj.ejhg.5201768.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/12/170/prepub
This work was funded by grants from the Biomedical Research Council Singapore and Singapore Immunology Network (SIgN-06-006 and SIgN-08-020), and the National Medical Research Council Singapore (NMRC/1150/2008). We would like to thank all the subjects who participated in this study. We would also like to thank the National University of Singapore for the Graduate Research Scholarship funding the PhD students (RA, AKA, PNP and BKS) involved in this study.
The authors declare that they have no competing interests.
RA, AKA, PNP and BKS were involved in the design of the study, recruitment of participants for the study, and extraction of DNA samples. RA also processed and pooled the DNA samples, performed the statistical analysis and wrote the manuscript. FTC and DYW conceived, designed and planned the study and edited the manuscript. All authors have read and approved the final manuscript.
About this article
Cite this article
Anantharaman, R., Andiappan, A.K., Nilkanth, P.P. et al. Genome-wide association study identifies PERLD1 as asthma candidate gene. BMC Med Genet 12, 170 (2011). https://doi.org/10.1186/1471-2350-12-170
- Allergic Asthma
- Estimate Allele Frequency
- Replication Stage
- Pool Genotyping