Whole-exome sequencing of a pedigree segregating asthma

DeWan, Andrew T; Egan, Kathryn Brigham; Hellenbrand, Karen; Sorrentino, Keli; Pizzoferrato, Nicole; Walsh, Kyle M; Bracken, Michael B

doi:10.1186/1471-2350-13-95

Research article
Open access
Published: 09 October 2012

Whole-exome sequencing of a pedigree segregating asthma

Andrew T DeWan¹,
Kathryn Brigham Egan²,
Karen Hellenbrand²,
Keli Sorrentino²,
Nicole Pizzoferrato¹,
Kyle M Walsh^1,3 &
…
Michael B Bracken²

BMC Medical Genetics volume 13, Article number: 95 (2012) Cite this article

6986 Accesses
34 Citations
5 Altmetric
Metrics details

Abstract

Background

Despite the success of genome-wide association studies for asthma, few, if any, definitively causal variants have been identified and there is still a substantial portion of the heritability of the disease yet to be discovered. Some of this “missing heritability” may be accounted for by family-specific coding variants found to be segregating with asthma.

Methods

To identify family-specific variants segregating with asthma, we recruited one family from a previous study of asthma as reporting multiple asthmatic and non-asthmatic children. We performed whole-exome sequencing on all four children and both parents and identified coding variants segregating with asthma that were not found in other variant databases.

Results

Ten novel variants were identified that were found in the two affected offspring and affected mother, but absent in the unaffected father and two unaffected offspring. Of these ten, variants in three genes (PDE4DIP, CBLB, and KALRN) were deemed of particular interest based on their functional prediction scores and previously reported function or asthma association. We did not identify any common risk variants segregating with asthma, however, we did observe an increase in the number of novel, nonsynonymous variants in asthma candidate genes in the asthmatic children compared to the non-asthmatic children.

Conclusions

This is the first report applying exome sequencing to identify asthma susceptibility variants. Despite having sequenced only one family segregating asthma, we have identified several potentially functional variants in interesting asthma candidate genes. This will provide the basis for future work in which more families will be sequenced to identify variants across families that cluster within genes.

Peer Review reports

Background

The genome-wide association approach to uncovering the genetic factors influencing common phenotypes is premised on the “common disease, common variant” hypothesis in which alleles that are common in the population (minor allele frequency [MAF] > 1%) will be associated with these phenotypes. At least twelve genome-wide association studies (GWAS) of asthma have been conducted and have yielded numerous associations, with the most significant (and in most cases replicated) associations occurring in or near the following genes: ORMDL3[1, 2], PDE4D[3], HLA-DRB1[2], HLA-DQ[2, 4], RAD50-IL13[4], DENND1B[5], TLE4[6], SMAD3[2], IL1RL1[7], IL18R1[2], IL33[2], IL2RB[2], RORA[2], and SLC22A5[2]. These findings have greatly expanded our understanding of the disease, having identified several novel genetic loci that had never previously been implicated in the pathogenesis of asthma (e.g. ORMDL3, RAD50, DENND1B, TLE4). Despite these successes, no definitive causal variants have been identified in any of these genes. It is asserted that the associated variants are in LD with the causal variants in these genes, but more effort must be made to identify causal variants so that we can begin to understand the biology of these genes in the etiology of asthma.

It is now recognized that GWAS are not able to identify all or even the majority of the genetic variants contributing to a disease phenotype [8–10]. The proportion of the heritability of a phenotype not explained by known genetic variants has been termed the “missing heritability.” AMD is so far the disease with the highest proportion of heritability explained by known loci, but even then only explains ~50% of the heritability [11]. Large sample sizes have allowed us to identify loci with small effect sizes yielding p-values well below the typical 5 × 10^-8, required for a scan of one million single-nucleotide polymorphisms (SNPs), but we are no longer identifying major contributors to phenotype variation. In the case of height, across two studies, a total of 60,000 subjects were genotyped, but significantly associated, and replicated, variants in two different genes only explained between 0.3 and 0.5% of the variation [12, 13]. For asthma, the population attributable fraction estimates for known asthma loci range from 3.9 to 24% [14], while explaining more risk than typical height loci, still indicate a substantial proportion of missing heritability. This suggests that a paradigm shift in our search for susceptibility loci for common diseases is warranted.

One explanation for the lack of functional variant identification is that the degree of genetic heterogeneity for common diseases is markedly higher than previously thought, possibly due to the presence of rare or even family-specific mutations with a large effect [15]. Our hypothesis is that individual families segregate “family-specific” variants contributing to asthma susceptibility and that at least one family-specific variant is necessary but not sufficient for disease development within individuals in the family. It is when these family-specific mutations occur in the context of common asthma susceptibility variants that the disease will develop.

To identify family-specific variants segregating with asthma, we recruited one family originally identified via one asthmatic child enrolled in the Perinatal Risk of Asthma in Infants of Asthmatic Mothers (PRAM) study [16] as reporting multiple asthmatic and non-asthmatic children. We performed whole-exome sequencing on all four children and both parents.

Methods

Patient recruitment/sample collection

The recruitment, questionnaire, blood collection and exome sequencing were approved by the Yale University Human Investigation Committee. The family was identified from information collected as part of the Perinatal Risk of Asthma in Infants of Asthmatic Mothers (PRAM), a study designed to assess the risk of asthma in children born to asthmatic mothers. We identified one family in this study that originally reported having two asthmatic children and two non-asthmatic children. The family was recontacted by a research associate to explain the study and determine the family’s interest in participating. A trained interviewer and phlebotomist visited the family in their home to conduct the interview and collect blood samples from each family member. The phlebotomist drew 10mL of blood from each family member in purple top EDTA tubes. The phlebotomist stored the blood on ice and delivered it to the lab within 24 hours. Each blood sample was divided into 1mL aliquots. DNA was extracted from one 1mL aliquot using the QIAGEN Blood Maxi kit. Each DNA sample was run on a 1% agarose gel to determine if it was of high quality (strong high molecular weight [> 10kb] band and no smear). DNA was quantified using TaqMan RNase P Detection Reagent Kit (Applied Biosystems).

Phenotype collection

The interviewer conducted an interview with each family member to assess their asthma diagnosis, symptoms and medication. For each subject, the interviewer asked questions about their asthma and/or allergies. The subject was asked about asthma diagnosis by a physician, date of diagnosis and name of physician. Whether or not they had been diagnosed, the subject was asked about any of the following symptoms: wheeze, persistent cough, chest tightness or shortness of breath. A detailed description of the frequency and duration of symptoms for the past 12 months was obtained. If the subject had not experienced symptoms in the past 12 months, then the dates when symptoms last occurred, and a description of their frequency and duration at that time was recorded. The interviewer recorded all asthma medications used in the past 12 months; route of administration (nebulizer, inhaler, oral) and frequency of use. If no medication had been used in the past year, the date medication was last used, name of the medication and frequency of use were ascertained. Questions were asked about the occurrence and frequency of upper respiratory infections in the past 12 months, and life-time lower respiratory infections, particularly: respiratory syncytial virus (RSV), bronchitis, bronchiolitis, pneumonia and croup. Information was gathered about hospitalizations and emergency room visits for asthma, and other respiratory illnesses. Physician visits in the past 12 months and treatment by a pulmonologist, allergist or asthma specialist were recorded. Questions were asked to determine whether their physical activity is limited by asthma, or asthma symptoms. For children, the home interview also obtained information on post-natal risk factors for asthma development (e.g. newborn intensive care nursery stays).

Sequencing

Five micrograms of DNA was submitted to the Yale Center for Genome Analysis. One microgram of genomic DNA was sheared to a mean fragment length of 140 base pairs using focused acoustic energy (Covaris E210, part #5000003). Fragmented DNA samples were then transferred to a 96-well plate and library construction was completed using a liquid handling robot (Caliper Sciclone, part #SG3-11020-0100). Magnetic AMPure XP beads (Beckman Coulter, part #63882) were used to purify the sheared DNA samples and remained with the sample throughout library construction. Following each process step, DNA was selectively precipitated by weight and re-bound to the beads through addition of a 20% polyethylene glycol, 2.5 M NaCl solution. Following fragmentation, T4 DNA polymerase and T4 polynucleotide kinase blunt ended and phosphorylated the fragments. The large Klenow fragment then added a single adenine residue to the 3' end of each fragment and custom adapters (IDT) were ligated using T4 DNA ligase. Adapter-ligated DNA fragments were then amplified via the polymerase chain reaction (PCR) using custom-made primers (IDT). During PCR, a unique 6 base index was inserted at one end of each DNA fragment. Sample concentration and insert size distribution were determined using the Caliper LabChip GX system (Caliper, part #122000/B). Samples yielding at least 1 μg of amplified DNA were used for capture.

Five hundred nanograms of prepared genomic DNA library was lyophilized with Cot-1 DNA and custom adapter blocking oligos (IDT). The dried sample was reconstituted according the manufacturer's protocol (Roche/Nimblegen), heat-denatured, and mixed with biotinylated DNA probes produced by Nimblegen (Nimblegen, SeqCap EZ Exome version 2, part #05860504001). Hybridizations were performed at 47°C for 68 hours. Once the capture was complete the samples were mixed with streptavidin-coated beads and washed with a series of stringent buffers to remove non-specifically bound DNA fragments. The captured fragments were PCR amplified and purified with AMPure XP beads. Capture efficiency was evaluated by quantitative PCR (Roche Light Cycler 480, part #5015243001). Equal amounts of pre- and post-capture libraries were evaluated at 4 sites to confirm successful exome enrichment and at 2 other sites to show non-exome de-enrichment in the captured sample relative to the pre-capture library. All samples met appropriate cut-offs for both and were quantified by qRT-PCR using a commercially available kit (KAPA Biosystems, part #KK4601). Insert size distribution was determined with the LabChip GX.

Sample concentrations were normalized to 2 nM, combined accordingly for the number of samples to be sequenced per lane, and loaded onto Illumina version 3 flow cells at a concentration that yields 170–200 million passing filter clusters. The samples were sequenced using 75bp paired-end sequencing on an Illumina HiSeq 2000 according to Illumina protocols. The 6 base pair index was read during an independent sequencing read that automatically follows the completion of read 1 and uses an additional sequencing primer (Illumina, part #15019606).

Signal intensities were converted to individual base calls on the machine during a run using the system's Real Time Analysis (RTA) software. Sample de-multiplexing was performed using Illumina's CASAVA 1.8 software suite and FASTQ files for each sample were produced.

Alignment, variant identification and filtering

Reads were aligned using the Burrows-Wheeler Algorithm (BWA) [17] to the reference human genome (build 37). Alignments for each sample were converted to BAM format, sorted, indexed, PCR duplicates marked and then merged into one BAM file for all six sample using Samtools [18]. Alignments in the the combined BAM file were then locally realigned around insertions/deletions (indels), recalibrated, and variants called (SNPs and indels using the UnifiedGenotyper) for the all six samples together using the utilities in the Genome Analysis Toolkit (GATK) [19, 20].

Variants were filtered and flagged as low quality using the following metrics: three or more variants detected within 10bp; four or more alignments map to different locations equally well; coverage less than five reads; quality score < 50; low quality for a particular sequence depth (variant confidence/unfiltered depth < 1.5); and strand bias (Phred-scaled p-values using Fisher’s Exact Test > 200). A variant flagged for any ONE of these filters was labeled ‘low quality’ and not considered further in this analysis.

Annotation

Variants were annotated using ANNOVAR [21] for function (exonic or splicing); gene; exon function (synonymous, nonsynonymous, stopgain, nonframeshift or frameshift indel); amino acid change; conservation; allele frequency in 1000 Genomes Project; dbSNP reference number; functional prediction scores (SIFT [prediction of a change being damaging (> 0.95) or tolerated (< 0.95)], Polyphen2 [prediction if a change is damaging (> 0.85), possibly damaging (0.85-0.15) or benign (< 0.15)], LRT [likelihood ratio test for codon constraint ranging from 0–1 with larger scores indicating constrained], MutationTaster [prediction of a disease causing variant, 1-pvalue], PhyloP [prediction of a conserved (> 0.95) or non-conserved (< 0.95) site]) [22]; and chromosome position.

Validation genotyping

To validate the genotypes of the variants in PDE4DIP, CBLB and KALRN we designed a custom TaqMan assay (Applied Biosystems) targeting each of these variants. Each sample was run in triplicate. Fourteen additional samples were run to aid in clustering genotypes and act as negative controls as they were only expected to be homozygous wild-type.

Candidate gene variant counts

We previously identified 251 asthma candidate genes from the literature [23]. We queried the list of 38,103 variants that passed the QC filters to identify those variants that were annotated as being within or near one of these genes. For each subject, we counted the number of non-referent alleles at each variant annotated within one of the 251 genes. We then restricted this to only novel variants (not contained within dbSNP). Finally, we restricted this to novel and non-synonymous variants. In order to compute the statistical significance of the number of novel or novel and nonsynonymous variants among the case children compared to control, we used the Pearson χ2 test and also calculated odds ratios (OR) and 95% confidence intervals (95%CI). We compared the total number of novel rare alleles (n=16 among cases, n=8 among controls) to the total number of rare alleles identified minus the number of novel rare alleles (n=56767 among cases, n=55742 among controls). The same calculation was also performed for the variants classified as novel and nonsynonymous.

de novoVariant quality control

Due to the difficulty identifying de novo variants from false positive genotype calls, we imposed strict quality control criteria. These were based, in part, on the quality control measures used in Neale et al. [24]. For a variant to be considered de novo we required it to meet the following criteria: 1) more than 10 reads for the mother, father and child carrying the de novo mutation; 2) mother or father could have no more than 5% of the total reads being from the non-referent allele; 3) if the de novo variant in the child was heterozygous, the variant could have no more than 70% of the reads being from the referent allele.

Results

Two children reported a doctor’s diagnosis of asthma and allergies and one of the two also reported taking albuterol (up to 14 times per month) and a generic allergy medication (1–4 times per month) in the preceding 12 months. The other asthmatic child was reported (per the mother) as having wheeze and persistent cough during all seasons of the first year of life. The two remaining children reported not having a doctor’s diagnosis of asthma, no allergies and no asthma or allergy medication use. The mother reported having a doctor’s diagnosis of asthma and allergies, and taking albuterol daily or almost daily during the previous 12 months. The father reported not having a doctor’s diagnosis of asthma, but reported having allergies and taking a generic allergy medication 1–4 times in one of the preceding 12 months. See Table 1 for demographic and phenotypic details.

Table 1 Demographic and phenotype information

Full size table

Overall, we attained high coverage of the exome, with four samples achieving >100 million 76 bp paired-end reads and the other two with more than 52 and 89 million reads (Table 2). For called variants, the average coverage was more than 100X for all subjects and among variants that passed all filters more than 110X (Table 2). It should be noted that Child 3 had fewer reads and lower coverage, however this only resulted in ~2% fewer called variants within exons that passed our stringent quality control filters. There were 55,370 variants called that were variable in one or more of the samples. After applying stringent quality control filters to eliminate low quality variants, 38,103 remained. The depth of coverage (i.e. the number of times an individual base was sequenced) of variants was high in all samples, with average per-sample depth of coverage of variants ranging from 92X to 210X among the high quality variants (Table 2). Of these high quality variants, the majority were within exons (66.3%), the 3’ untranslated regions (12.8%), introns (9%), 5’ untranslated regions (6%), or exons of non-coding RNAs (3.7%) (Table 3). Less than 1% of all high quality variants were in regions not considered to be within or around genes (Table 3).

Table 2 Number of reads, variants and variant coverage by sample

Full size table

Table 3 Distribution of variant types

Full size table

Given our interest in functional, family-specific variants, we focused on nonsynonymous variants that were not contained in dbSNP, of which there were 531. We then looked at variants where the two affected children and the mother had at least one minor allele, but the unaffected children and the father had at least one less minor allele compared to the mother and affected children. There were 14 variants that met these criteria, however, four of these variants were identified in the 1000 genomes project and were excluded leaving ten variants for further consideration (Table 4). In all cases, affected individuals were heterozygous and unaffected individuals were homozygous wild-type. Based on the functional prediction scores (two or more algorithms indicate a high probability of functionality (marked with an asterisk in Table 4), four variants in four different genes were deemed functionally interesting (PDE4DIP, CBLB, KALRN, GALNTL6). No mutation was predicted to be fuctional by all programs, and none by more than three of the five programs. The isoleucine to leucine change in PDE4DIP was predicted to be functional by Mutation Taster (high probability of a disease causing site) and LRT (evidence of codon constraint). The CBLB mutation of aspartic acid to alanine was predicted to be functional by PhyloP (highly conserved among 46 vertebrate species), Mutation Taster and LRT. The leucine to phenylalanine mutation in KALRN was supported by PhyloP and LRT, whereas the isoleucine to valine mutation in GALNTL6 was supported by SIFT (high probability of a damaging change) and PhyloP. Three of these genes (PDE4DIP, CBLB, and KALRN) are of increased interest based on either their function or previous work in asthma. All three SNPs were re-genotyped using TaqMan and for each SNP the mother and both affected children were heterozygous, while the father and two unaffected children were homozygous wild-type (Table 5). In addition, we genotyped four SNPs that have been reported to be associated with asthma, rs7216389 in ORMDL3, rs11684634 in PDE11A, rs1544791 in PDE4D and rs2706347 in RAD50 (Table 5). We did not observe the risk allele segregating with the affection status for any of these “common” asthma variants.

Table 4 Family-specific variants identified in mother and two affected children and variant annotations from ANNOVAR

Full size table

Table 5 Genotypes at exome sequence candidate SNPs and four additional asthma associated SNPs

Full size table

To determine if the asthmatic children carried more rare variants in asthma candidate genes than non-asthmatic children, we queried the 251 asthma candidate genes from our previous study [23] to determine the number of variants each child in this family had across this set of genes. This was done for all variants regardless of function or frequency, only variants classified as novel (not observed in dbSNP), and only novel and nonsynonymous variants. While one of the affected children had the most number of rare alleles across candidate genes (n=427), one of the unaffected children had the second highest number of rare alleles (n=422). However, when we limited this to either novel variants or novel and nonynonymous variants, the affected children had more rare alleles than unaffected children in both categories (Table 6), but this difference did not reach statistical significance for either novel (p=0.112) or novel and nonsynonymous (p=0.097) when compared to the total number of variants passing QC among the affected and unaffected children. Despite the lack of statistical significance the direction of the association indicated that the number of rare variants increased risk among both the novel (OR=1.964; 95%CI: 0.84-4.58) and novel and nonsynonymous (OR=2.357; 95%CI: 0.83-6.69).

Table 6 Number of minor alleles across 251 asthma candidate genes

Full size table

One advantage that we had sequencing an entire nuclear family was the ability to identify de novo variants, variants identified in the children not observed in either parent. We identified 279 de novo variants that were present in one or more of the children, but not in the parents that passed our initial QC. Overall these variants had low coverage with an average of 13.5X and drastically higher than the < 1 event per subject observed in recent studies [24, 25]. When we imposed strict quality control criteria (see Methods) to distinguish true positive de novo events from false positive, this was reduced to three de novo variants in three subjects. However, it is reasonable to assume that de novo events are likely not to have been previously ovbserved. Therefore, we restricted this to only variants not in dbSNP, leaving two de novo variants (Table 7). These two variants had high overall coverage with an average of 212X across the six family members. These two variants were both nonsynonymous mutations, the first in the gene DST in a non-asthmatic child and the second in the gene MEF2A in an asthmatic child.

Table 7 De novo variants

Full size table

Discussion

We recruited one family in which the mother and two children reported a doctor’s diagnosis of asthma, but the father and two additional children had no doctor’s diagnosis of asthma. After subjecting each family member’s genome to whole exome sequencing, we identified ten novel, nonsynonymous variants that segregated perfectly with asthma. Three of these variants had high probability to result in deleterious protein coding changes by two or more functional prediction algorithms and were also plausible asthma candidate genes based on their function or previous asthma association studies.

PDE4DIP, also known as myomegalin, is a golgi associated protein that is found to interact with a member of the phosphodiesterase superfamily of proteins, PDE4D, in the sarcomeric structure of skeletal muscle [26]. It has been shown to regulate cardiac contractility through its activity by phosphorylating Cardiac Myosin Binding Protein-C (cMyBPC) and cardiac troponin (cTNI) [27]. While no studies have shown expression of PDE4DIP in lung tissue, interest in this gene potentially playing a role in asthma susceptibility stems from the fact that it interacts with PDE4D. A GWAS identified an association between childhood asthma and variants in PDE4D and was replicated in several populations [3].

CBLB codes for the protein E3 ubiquitin ligase Cbl-b a member of the Cbl family of proteins. Of particular interest is its reported role in the regulation of the T cell response through promotion of the clearance of the T-cell receptor from the cell surface [28]. It has previously been reported that a region of chromosome 7 containing the T-cell receptor gamma (TCRγ) gene is associated with asthma [29, 30].

KALRN codes for the protein kalirin. Reduced levels of kalirin have been associated with increased inducible nitric oxide synthetase (iNOS) activity [31] and promoter polymorphisms in the iNOS gene have been associated with allergic asthma severity [32]. Evidence for an involvement of KALRN is further supported by the finding of a nominal association with a single-nucleotide polymorphism in KALRN in our previously published GWAS of childhood asthma [16].

The focus of GWAS has been on identifying common risk variants associated with disease. However, recent large-scale sequencing studies suggest that the majority of variation present in the genome is rare, and estimates of frequency of novel variants are as high as 82% [33, 34]. Given that the largest proportion of variants in the genome may be novel, we chose to focus our attention on variants not present in variant databases. It can be argued that there may be rare, but not novel, variants segregating with asthma in this family, but at least the large consortium-based GWAS for asthma was likely sufficiently powered to identify associations with even these rare variants [2]. This allowed us to focus on variants that might only be identified through the use of a family-based design.

As expected, common risk variants in four asthma candidate genes did not segregate with asthma in this family. It appears as if one of the unaffected children (child 4) had fewer risk alleles (n=2) than the other children in this family (each had four or more), but it is not possible to conclude if this can account for the phenotypic differences. However, we did observe that the affected children had more rare alleles within a wider array of asthma candidate genes compared to the unaffected children. While this data is inconclusive, there appears to be a trend of more rare variants in asthma candidate genes among case children than control children suggesting that a combination of rare variants across multiple genes may be contributing to asthma susceptibility. This is a hypothesis that will need to be tested in a larger dataset.

One advantage of this family-based design is that we had the ability to identify de novo mutation events that could be potentially contributing to asthma susceptibility in the affected children or protecting against asthma in the unaffected children. We identified two novel variants in two genes, DST and MEF2A, that had high coverage and could potentially be risk, MEF2A, or protective, DST, asthma variants. Both of these variants were nonsynonymous variants, but neither were in genes previously associated with asthma or had an obvious function relationship with asthma. MEF2A codes for a DNA-binding transcription factor and is primarily found in proliferating smooth muscle cells in humans [35]. Mutations in MEF2A have been associated with with coronary artery disease (CAD) [36, 37], however, further evidence suggests that this is not a common cause of CAD among whites [38]. DST codes for the protein, dystonin, a member of the plankin family of proteins, whose members differentially express splice variants involved in the cytoskeletal needs of various specialized cells [39]. A homozygous single base pair deletion in DST was identified as the causal mutation in a family segregating a lethal form of heredity sensory autonomic neuropathy (HSAN) [40].

While many more families will be needed to demonstrate the involvement of any of these genes in asthma, this is the first report of a comprehensive examination of exonic variants within a family segregating asthma and one of the first to describe this for a common, non-Mendelian disease. We chose to focus here on family-specific variants, but asthma development is likely influenced by both family-specific variants in addition to more common asthma susceptibility variants. Future studies will need to focus on identifying family-specific variants across families that are all either within the same gene or within genes in the same genetic pathway. Finally, these family-specific variants will need to be analyzed within the context of common asthma susceptibility variants, the majority of which are non-exonic, to yield a more comprehensive view of genetic contributors to asthma across a range of effect sizes. Whole-genome sequencing will enable such future work.

Conclusions

This is the first report applying exome sequencing to identify asthma susceptibility variants. Despite having sequenced only one family segregating asthma, we identified several potentially functional variants in three interesting asthma candidate genes, PDE4DIP, CBLB and KALRN. This will provide the basis for future work in which more families will be sequenced to identify variants across families that cluster within genes.

Author’s contributions

ATD, KMW and MBB conceived and designed the study. KBE, KH and KS recruited subjects. NP performed all laboratory work. ATD performed all data analysis. ATD and MBB interpreted the results. ATD drafted the manuscript. All authors revised the manuscript for important intellectual content, read and approved the final manuscript.

References

Moffatt MF, Kabesch M, Liang L, Dixon AL, Strachan D, Heath S, Depner M, von Berg A, Bufe A, Rietschel E, et al: Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma. Nature. 2007, 448 (7152): 470-473.
Article CAS PubMed Google Scholar
Moffatt MF, Gut IG, Demenais F, Strachan DP, Bouzigon E, Heath S, von Mutius E, Farrall M, Lathrop M, Cookson WO: A large-scale, consortium-based genomewide association study of asthma. N Engl J Med. 2010, 363 (13): 1211-1221.
Article CAS PubMed PubMed Central Google Scholar
Himes BE, Hunninghake GM, Baurley JW, Rafaels NM, Sleiman P, Strachan DP, Wilk JB, Willis-Owen SA, Klanderman B, Lasky-Su J, et al: Genome-wide association analysis identifies PDE4D as an asthma-susceptibility gene. Am J Hum Genet. 2009, 84 (5): 581-593.
Article CAS PubMed PubMed Central Google Scholar
Li X, Howard TD, Zheng SL, Haselkorn T, Peters SP, Meyers DA, Bleecker ER: Genome-wide association study of asthma identifies RAD50-IL13 and HLA-DR/DQ regions. J Allergy Clin Immunol. 2010, 125 (2): 328-335. e311
Article CAS PubMed PubMed Central Google Scholar
Sleiman PM, Flory J, Imielinski M, Bradfield JP, Annaiah K, Willis-Owen SA, Wang K, Rafaels NM, Michel S, Bonnelykke K, et al: Variants of DENND1B associated with asthma in children. N Engl J Med. 2010, 362 (1): 36-44.
Article CAS PubMed Google Scholar
Hancock DB, Romieu I, Shi M, Sienra-Monge JJ, Wu H, Chiu GY, Li H, del Rio-Navarro BE, Willis-Owen SA, Weiss ST, et al: Genome-wide association study implicates chromosome 9q21.31 as a susceptibility locus for asthma in mexican children. PLoS Genet. 2009, 5 (8): e1000623-
Article PubMed PubMed Central Google Scholar
Gudbjartsson DF, Bjornsdottir US, Halapi E, Helgadottir A, Sulem P, Jonsdottir GM, Thorleifsson G, Helgadottir H, Steinthorsdottir V, Stefansson H, et al: Sequence variants affecting eosinophil numbers associate with asthma and myocardial infarction. Nat Genet. 2009, 41 (3): 342-347.
Article CAS PubMed Google Scholar
Maher B: Personal genomes: The case of the missing heritability. Nature. 2008, 456 (7218): 18-21.
Article CAS PubMed Google Scholar
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, et al: Finding the missing heritability of complex diseases. Nature. 2009, 461 (7265): 747-753.
Article CAS PubMed PubMed Central Google Scholar
Eichler EE, Flint J, Gibson G, Kong A, Leal SM, Moore JH, Nadeau JH: Missing heritability and strategies for finding the underlying causes of complex disease. Nat Rev Genet. 2010, 11 (6): 446-450.
Article CAS PubMed PubMed Central Google Scholar
Maller J, George S, Purcell S, Fagerness J, Altshuler D, Daly MJ, Seddon JM: Common variation in three genes, including a noncoding variant in CFH, strongly influences risk of age-related macular degeneration. Nat Genet. 2006, 38 (9): 1055-1059.
Article CAS PubMed Google Scholar
Sanna S, Jackson AU, Nagaraja R, Willer CJ, Chen WM, Bonnycastle LL, Shen H, Timpson N, Lettre G, Usala G, et al: Common variants in the GDF5-UQCC region are associated with variation in human height. Nat Genet. 2008, 40 (2): 198-203.
Article CAS PubMed PubMed Central Google Scholar
Weedon MN, Lettre G, Freathy RM, Lindgren CM, Voight BF, Perry JR, Elliott KS, Hackett R, Guiducci C, Shields B, et al: A common variant of HMGA2 is associated with adult and childhood height in the general population. Nat Genet. 2007, 39 (10): 1245-1250.
Article CAS PubMed PubMed Central Google Scholar
Lee SH, Park JS, Park CS: The search for genetic variants and epigenetics related to asthma. Allergy Asthma Immunol Res. 2011, 3 (4): 236-244.
Article CAS PubMed PubMed Central Google Scholar
McClellan J, King MC: Genetic heterogeneity in human disease. Cell. 2010, 141 (2): 210-217.
Article CAS PubMed Google Scholar
DeWan A, Triche E, Xu X, Hsu L-I, Zhao C, Belanger K, Hellenbrand K, Willis-Owen SA, Moffat M, Cookson W, et al: PDE11A associations with asthma: results of a genome-wide association scan. Journal of Allergy & Clinical Immunology. 2010, 126 (4): 871-873.
Article CAS Google Scholar
Li H, Durbin R: Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010, 26 (5): 589-595.
Article PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The sequence alignment/map format and SAMtools. Bioinformatics. 2009, 25 (16): 2078-2079.
Article PubMed PubMed Central Google Scholar
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, et al: The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20 (9): 1297-1303.
Article CAS PubMed PubMed Central Google Scholar
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, et al: A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011, 43 (5): 491-498.
Article CAS PubMed PubMed Central Google Scholar
Wang K, Li M, Hakonarson H: ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38 (16): e164-
Article PubMed PubMed Central Google Scholar
Liu X, Jian X, Boerwinkle E: dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions. Hum Mutat. 2011, 32 (8): 894-899.
Article CAS PubMed PubMed Central Google Scholar
Murk W, Walsh K, Hsu LI, Zhao L, Bracken MB, Dewan AT: Attempted replication of 50 reported asthma risk genes identifies a SNP in RAD50 as associated with childhood atopic asthma. Hum Hered. 2011, 71 (2): 97-105.
Article PubMed Google Scholar
Neale BM, Kou Y, Liu L, Ma'ayan A, Samocha KE, Sabo A, Lin CF, Stevens C, Wang LS, Makarov V, et al: Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature. 2012, 485 (7397): 242-245.
Article CAS PubMed PubMed Central Google Scholar
Sanders SJ, Murtha MT, Gupta AR, Murdoch JD, Raubeson MJ, Willsey AJ, Ercan-Sencicek AG, DiLullo NM, Parikshak NN, Stein JL, et al: De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature. 2012, 485 (7397): 237-241.
Article CAS PubMed PubMed Central Google Scholar
Verde I, Pahlke G, Salanova M, Zhang G, Wang S, Coletti D, Onuffer J, Jin SL, Conti M: Myomegalin is a novel protein of the golgi/centrosome that interacts with a cyclic nucleotide phosphodiesterase. J Biol Chem. 2001, 276 (14): 11189-11198.
Article CAS PubMed Google Scholar
Uys GM, Ramburan A, Loos B, Kinnear CJ, Korkie LJ, Mouton J, Riedemann J, Moolman-Smook JC: Myomegalin is a novel A-kinase anchoring protein involved in the phosphorylation of cardiac myosin binding protein C. BMC Cell Biol. 2011, 12: 18-
Article CAS PubMed PubMed Central Google Scholar
Naramura M, Jang IK, Kole H, Huang F, Haines D, Gu H: c-Cbl and Cbl-b regulate T cell responsiveness by promoting ligand-induced TCR down-modulation. Nat Immunol. 2002, 3 (12): 1192-1199.
Article CAS PubMed Google Scholar
Ionita-Laza I, Perry GH, Raby BA, Klanderman B, Lee C, Laird NM, Weiss ST, Lange C: On the analysis of copy-number variations in genome-wide association studies: a translation of the family-based association test. Genet Epidemiol. 2008, 32 (3): 273-284.
Article PubMed Google Scholar
Walsh KM, Bracken MB, Murk WK, Hoh J, Dewan AT: Association between reduced copy-number at T-cell receptor gamma (TCRgamma) and childhood allergic asthma: a possible role for somatic mosaicism. Mutat Res. 2010, 690 (1–2): 89-94.
Article CAS PubMed PubMed Central Google Scholar
Zhang W, Kuncewicz T, Yu ZY, Zou L, Xu X, Kone BC: Protein-protein interactions involving inducible nitric oxide synthase. Acta Physiol Scand. 2003, 179 (2): 137-142.
Article CAS PubMed Google Scholar
Batra J, Pratap Singh T, Mabalirajan U, Sinha A, Prasad R, Ghosh B: Association of inducible nitric oxide synthase with asthma severity, total serum immunoglobulin E and blood eosinophil levels. Thorax. 2007, 62 (1): 16-22.
Article PubMed Google Scholar
Tennessen JA, Bigham AW, O'Connor TD, Fu W, Kenny EE, Gravel S, McGee S, Do R, Liu X, Jun G, et al: Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science. 2012, 337 (6090): 64-69.
Article CAS PubMed PubMed Central Google Scholar
Nelson MR, Wegmann D, Ehm MG, Kessner D, St Jean P, Verzilli C, Shen J, Tang Z, Bacanu SA, Fraser D, et al: An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people. Science. 2012, 337 (6090): 100-104.
Article CAS PubMed PubMed Central Google Scholar
Wang L, Fan C, Topol SE, Topol EJ, Wang Q: Mutation of MEF2A in an inherited disorder with features of coronary artery disease. Science. 2003, 302 (5650): 1578-1581.
Article CAS PubMed PubMed Central Google Scholar
de Quervain DJ, Poirier R, Wollmer MA, Grimaldi LM, Tsolaki M, Streffer JR, Hock C, Nitsch RM, Mohajeri MH, Papassotiropoulos A: Glucocorticoid-related genetic susceptibility for Alzheimer's disease. Hum Mol Genet. 2004, 13 (1): 47-52.
Article CAS PubMed Google Scholar
Bhagavatula MR, Fan C, Shen GQ, Cassano J, Plow EF, Topol EJ, Wang Q: Transcription factor MEF2A mutations in patients with coronary artery disease. Hum Mol Genet. 2004, 13 (24): 3181-3188.
Article CAS PubMed PubMed Central Google Scholar
Weng L, Kavaslar N, Ustaszewska A, Doelle H, Schackwitz W, Hebert S, Cohen JC, McPherson R, Pennacchio LA: Lack of MEF2A mutations in coronary artery disease. J Clin Invest. 2005, 115 (4): 1016-1020.
Article CAS PubMed PubMed Central Google Scholar
Yang Y, Dowling J, Yu QC, Kouklis P, Cleveland DW, Fuchs E: An essential cytoskeletal linker protein connecting actin microfilaments to intermediate filaments. Cell. 1996, 86 (4): 655-665.
Article CAS PubMed Google Scholar
Edvardson S, Cinnamon Y, Jalas C, Shaag A, Maayan C, Axelrod FB, Elpeleg O: Hereditary sensory autonomic neuropathy caused by a mutation in dystonin. Ann Neurol. 2012, 71 (4): 569-572.
Article CAS PubMed Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/13/95/prepub

Download references

Acknowledgments

This work was supported by grant AI41040 from the National Institutes of Health (MBB).

Author information

Authors and Affiliations

Department of Chronic Disease Epidemiology, Yale School of Public Health, 60 College St, New Haven, CT, 06520, USA
Andrew T DeWan, Nicole Pizzoferrato & Kyle M Walsh
Center for Perinatal, Pediatric and Environmental Epidemiology, Yale University Schools of Public Health and Medicine, 1 Church Street, New Haven, CT, 06510, USA
Kathryn Brigham Egan, Karen Hellenbrand, Keli Sorrentino & Michael B Bracken
Current affiliation: Division of Cancer Epidemiology, University of California, San Francisco, CA, USA
Kyle M Walsh

Authors

Andrew T DeWan
View author publications
You can also search for this author in PubMed Google Scholar
Kathryn Brigham Egan
View author publications
You can also search for this author in PubMed Google Scholar
Karen Hellenbrand
View author publications
You can also search for this author in PubMed Google Scholar
Keli Sorrentino
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Pizzoferrato
View author publications
You can also search for this author in PubMed Google Scholar
Kyle M Walsh
View author publications
You can also search for this author in PubMed Google Scholar
Michael B Bracken
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew T DeWan.

Additional information

Competing interests

The authors do not have any conflicts of interest, financial or otherwise.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

DeWan, A.T., Egan, K.B., Hellenbrand, K. et al. Whole-exome sequencing of a pedigree segregating asthma. BMC Med Genet 13, 95 (2012). https://doi.org/10.1186/1471-2350-13-95

Download citation

Received: 05 June 2012
Accepted: 03 October 2012
Published: 09 October 2012
DOI: https://doi.org/10.1186/1471-2350-13-95

Whole-exome sequencing of a pedigree segregating asthma

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Patient recruitment/sample collection

Phenotype collection

Sequencing

Alignment, variant identification and filtering

Annotation

Validation genotyping

Candidate gene variant counts

de novoVariant quality control

Results

Discussion

Conclusions

Author’s contributions

References

Pre-publication history

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Rights and permissions

About this article

Cite this article

Keywords

BMC Medical Genetics

Contact us

Whole-exome sequencing of a pedigree segregating asthma

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Patient recruitment/sample collection

Phenotype collection

Sequencing

Alignment, variant identification and filtering

Annotation

Validation genotyping

Candidate gene variant counts

de novoVariant quality control

Results

Discussion

Conclusions

Author’s contributions

References

Pre-publication history

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Genetics

Contact us