Analysis of meiotic recombination in 22q11.2, a region that frequently undergoes deletions and duplications
BMC Medical Genetics volume 8, Article number: 14 (2007)
The 22q11.2 deletion syndrome is the most frequent genomic disorder with an estimated frequency of 1/4000 live births. The majority of patients (90%) have the same deletion of 3 Mb (Typically Deleted Region, TDR) that results from aberrant recombination at meiosis between region specific low-copy repeats (LCRs).
As a first step towards the characterization of recombination rates and breakpoints within the 22q11.2 region we have constructed a high resolution recombination breakpoint map based on pedigree analysis and a population-based historical recombination map based on LD analysis.
Our pedigree map allows the location of recombination breakpoints with a high resolution (potential recombination hotspots), and this approach has led to the identification of 5 breakpoint segments of 50 kb or less (8.6 kb the smallest), that coincide with historical hotspots. It has been suggested that aberrant recombination leading to deletion (and duplication) is caused by low rates of Allelic Homologous Recombination (AHR) within the affected region. However, recombination rate estimates for 22q11.2 region show that neither average recombination rates in the 22q11.2 region or within LCR22-2 (the LCR implicated in most deletions and duplications), are significantly below chromosome 22 averages. Furthermore, LCR22-2, the repeat most frequently implicated in rearrangements, is also the LCR22 with the highest levels of AHR. In addition, we find recombination events in the 22q11.2 region to cluster within families. Within this context, the same chromosome recombines twice in one family; first by AHR and in the next generation by NAHR resulting in an individual affected with the del22q11.2 syndrome.
We show in the context of a first high resolution pedigree map of the 22q11.2 region that NAHR within LCR22 leading to duplications and deletions cannot be explained exclusively under a hypothesis of low AHR rates. In addition, we find that AHR recombination events cluster within families. If normal and aberrant recombination are mechanistically related, the fact that LCR22s undergo frequent AHR and that we find familial differences in recombination rates within the 22q11.2 region would have obvious health-related implications.
Low copy repeats (LCRs), are 10 to 400 kb long DNA blocks with a complex internal organization that show more than 95% identity between copies. Non-allelic homologous recombination (NAHR) between LCRs has been seen to mediate deletions and duplications in many genomic disorders. All of this supports the view that LCRs are essential players in a common mechanism that causes most genomic syndromes . The 22q11.2 region contains at least 8 LCR22s, four of which are localized within or flanking the most frequent deletions. These LCR22s have been labeled from centromere to telomere: LCR22-2 (or LCR22-A); LCR22-3a (or LCR22-B); LCR22-3b (or LCR22-C); and LCR22-4 (or LCR22-D) [2–4].
The 22q11.2 region undergoes a great amount of germline and somatic rearrangements, and can be classified as one of the most unstable regions of the human genome. As shown in Figure 1, many congenital anomalies are caused by rearrangements within the 22q11.2 region and practically all deletions, duplications and translocations show breakpoints within LCR22s. The del22q11 syndrome is the most frequent of all of these with an estimated de novo frequency of 1/4000 live births and it incorporates classical clinical disorders such as DiGeorge syndrome, Conotruncal Anomaly Face syndrome, or Velocardiofacial syndrome (DG, MIM 188400; CTF, MIM 217095; VCF, MIM 192430) .
The great majority of patients suffering of the del22q11.2 syndrome (97–98%) have a proximal breakpoint within LCR22-2, whereas the distal breakpoint can vary and in 90% of patients it falls within LCR22-4 producing a 3 Mb deletion (Typical deleted region; TDR); or in 7% of patients within LCR22-3a producing a 1.5 Mb deletion (Figure 1) [3, 6, 7].
Most deletions in the 22q11.2 region are the result of NAHR between homologous chromosomes (interchomosomal recombination), as opposed to NAHR within the same chromosome (intrachromosomal recombination) [6, 8]. Deletion and duplication in interchromosomal NAHR are reciprocal products of the same crossover event and current models predict that they should be equally frequent. In support of this prediction, recent studies have found patients with duplications that are the result of NHAR between the same LCR22s implicated in deletions (Figure 1) [9, 10].
Meiotic allelic homologous recombination (AHR) insures proper segregation of chromosomes to gametes and results in the exchange of DNA segments between pairs of homologous chromosomes without loss or gain of genetic material. AHR events are not distributed evenly across the genome and concentrate in hotspots that are 1 to 3 kb in length. In fact, 80% of recombination is calculated to take place in 10–20% of the total genome sequence. However, frequency of recombination within these hotspots can vary by several orders of magnitude, ranging from rates below the genome average (0.3 fold) to frequencies that are far above (120 fold) [11, 12].
Not much is known on the relation between AHR and NAHR within the regions implicated in recurrent genome rearrangements. NAHR hotspots that have been studied so far seem to be localized within larger regions that show low rates of AHR during meiosis [13, 14]. Because of this observation, it has been proposed that mispairing between non-allelic copies of LCRs may be facilitated by reduced recombination within these regions, leading to unequal crossing over between non-allelic LCRs .
There are three available methods that can be used to characterize recombination in the human genome: single-sperm typing, population-based linkage disequilibrium (LD) maps and pedigree-based maps. Single-sperm typing is the method of choice to infer contemporary recombination rates at a very fine scale because it allows typing of large sample sizes. However, single sperm typing detects recombination within short intervals and needs previous information on the location of recombination hotspots and it does not detect female recombination. Population-based LD maps provide an overview of the recombinational history reflecting transmissions from many individuals over thousands of generations and thus reflect only historical recombination rates. These maps although cannot distinguish male and female recombination events, are good predictors of contemporary recombination rates at a scale larger than 5 Mb and detect most (but not all) contemporary hotspots [12, 15]. Finally, pedigree-based maps, although inevitably based on relatively few meiosis, have been shown to give rough but reliable estimates of average recombination rates within large regions and to give an overview of the position of recombination hotspots [14, 16, 17]. Furthermore, a unique feature of pedigree maps is that they allow for the characterization of sex-specific recombination because female and male recombination events can be discriminated.
As a first step towards the characterization of recombination rates and breakpoints within the 22q11.2 region we have constructed a high resolution recombination breakpoint map based on pedigree analysis and a population-based historical recombination map based on LD analysis.
Patients, and samples
Samples from patients and their families were obtained after informed consent. Ethical approval was obtained for this study from the Committee for Ethical Clinical Research of the Government of the Balearic Islands (CEIC). Research was performed in compliance of the Helsinki declaration.
Position and length of LCR22s
Map construction requires the precise location of low copy repeats within the studied region. Although this had been done before [2, 3, 18], we determined the precise boundaries of the repeats within the context of the human genome sequence draft (May 2004 release) to be able to locate new markers and recombination breakpoints in reference to each LCR22. For this purpose, 25 kb windows of genomic sequence that had been filtered out of interspersed sequence repeats (between positions 16 and 23 Mb), were compared with the complete human genome sequence draft using the BLAT and BLAST programs. This strategy enabled us to detect identical or near-identical sequences that were located in other genomic positions on chromosome 22 or other chromosomes and hence constituted low copy repeats.
By this approach we identified the boundaries of all the previously described LCR22s, as well as a new previously non-described small low copy repeat. This new LCR22 is approximately 25 kb-long and is positioned between LCR22-2 and LCR22-3A at positions 18.503114-18528713 and is composed by a sequence that is duplicated only within LCR22-6 at positions 22045473-22069076 with a 93% identity. The repeated sequence includes the 3' end of the ZDHHC8 gene (exons 8–11) and a predicted gene with an unknown function. Interestingly, the ZDHHC8 gene which encodes a putative transmembrane palmitoyltransferase has been proposed to contribute to susceptibility to schizophrenia associated with the 22q11.2 region . The other repeats were found to have the following positions and sizes: LCR2 17016923-17384345 (367, 42 KB); LCR3A 18684169-19068000 (383,381 KB) (this an estimation because of a gap in the sequence); LCR3B 19353714-19416000 (62,286 KB); LCR4A 19777000-20121932 (344,932 KB); LCR4B 20137031-20241670 (104,639 KB); LCR5 21276000-21396500 (120,5 KB) and LCR6 22047000-22153000 (106 KB).
New marker design
New polymorphic simple sequence repeats (CATCH markers) were identified using the Tandem Repeats Finder and the RepeatMasker tracks of the May 2004 release of the human reference sequence available at the University of Santa Cruz (UCSC) Genome Browser . Once a repeat was identified in the region of interest, the design of primers for PCR was performed using the Oligos 9.6 program . Interspersed repetitive sequences identified by RepeatMasker were avoided when possible as primer annealing sequences. Finally, sequences flanking the repeat were checked against the human genome using the BLAST and BLAT programs [19, 22] to ensure it was a single copy marker. PCR conditions and annealing temperatures were determined empirically based on program predictions (see Additional File 1).
Pedigree-based linkage map construction
Fourteen families and a total of 152 genomic DNAs of family members were typed to detect recombination events within the 22q11.2 region. Families were Spanish from Mediterranean regions (Balearic Islands and Catalonia). Ten families included 3 generations or more, and the other four included both parents and at least three offspring. Two of these families had a member affected with the del22q11.2 syndrome, while the other families had no known disease associated to the 22q11.2 region. In total, 204 informative meiosis (83 female, 98 male, and 23 of unknown sex) were typed with 62 different microsatellite polymorphic markers (see Additional file 1) over a region spanning 6,5 Mb. Deleted chromosomes were not counted as informative meiosis because they were the product of an illegitimate recombination (NAHR) event. DNAs were initially typed with 6 microsatellite markers (D22S420, D22S427, D22S264, D22S308, D22S303 and D22S1174) to determine the most likely haplotypes, detect recombination events within the 22q11.2 region, and avoid non-detection of double crossovers. When outlying markers D22S420 or D22S1174 were not informative they were replaced by the nearest informative marker available. In consequence, the total number of informative meiosis is 204 between markers D22S427 and D22S425, but this number decreases towards the outflanking regions. Recombination breakpoints were then positioned by typing all available markers within the interval where the breakpoint had been initially localized.
Haplotypes were determined with the help of the grandparental genotypes, when available, and by minimizing the number of double crossovers. In three of the families, the first generation consists of three siblings or more and haplotypes can be assigned unambiguously, although it can not be determined if it was a female or a male recombination event. Genetic distances between markers were calculated by dividing number of recombination events within the interval by the total number of informative meiosis for the same interval.
We pooled our data with that of the CEPH website  which includes data for markers flanking but not within the TDR for the following families: 17, 66, 102, 884, 1331, 1332, 1333, 1334, 1340, 1341,1346, 1347, 1358, 1362, 1408, 1413, 1416, 1420,1423, 1454 (236 meiosis). In total, within these CEPH families, we identified 33 recombination events, that because of the paucity of markers, we were not able to narrow down recombination breakpoints to small sized segments. As an exception, 3 breakpoints were localized to LCR22-2 (2 male and 1 female) and 1 breakpoint to LCR22-4 (male). All this data was pooled together with that of our linkage map to calculate recombination rates within the 22q11.2 region and within each of the LCR22s.
Population-based linkage-disequilibrium SNP map and hotspot estimation
Population recombination parameters were determined from data available at the public HapMap Project (public data release #21 October 2004). We selected a total of 2074 SNPs between positions 16198487 to 23198486. Genotypes were recorded for 60 unrelated individuals corresponding to parents of the CEPH dataset (northern and western European ancestry).
Estimation of recombination parameters was done using the PHASE v2.1.1 software which calculates linkage disequilibrium (LD) patterns among SNP pairs and derives two parameters for each interval: the population recombination rate (ρ) which reflects the demographic history of the population and a measure of how recombination varies within the same interval (λ)[25, 26]. To this end we divided the 22q11.2 region into 15 400-kb non overlapping windows, except for those containing LCR22-2, LCR22-3A and LCR22-4 that were of 800-kb. An additional window of 437 kb located within the IGL locus region was considered.
Two models of variation of recombination were considered, the general model which is the default model for PHASE v2.1.1 (option -MR0) and the simple hotspot model (option -MR1). The first model considers that the recombination rate in a genomic region is a function of parameters ρ and λ. The second model considers that recombination rate in a genomic region equals ρ, except in the hotspot region in which recombination rate equals λρ. We used the default priors for recombination parameters defined in the PHASE v2.1.1 software . For each input file, we ran PHASE five times and selected the data from the run with the best average value for the goodness of fit (option -x5). The final run for the whole dataset was run 10 times longer to obtain better estimates of the recombination parameters (option -X10). Data was analysed from the recombination output file from which we computed the median of the 1000 draws from the posterior distributions of ρ and λ. We computed ρ at each SNP interval by the product of λρ. Under the simple hotspot model, we estimated the posterior probability of both, λ greater than 10 and λ greater than 100 and computed the associated Bayes Factor (BF). BF measures the strength of evidence for a hot spot by computing the probability of obtaining the data if a hot spot is present divided by the probability of obtaining the data if a the hot spot is not present . We considered intervals of the hotspot to extend from the 5th percentile of the posterior probability for the left end to the 95th percentile of the posterior probability for the right end.
The expected number of families having 0, 1, 2 and ≥ 3 recombination events in the TDR region was modeled following a Poisson distribution, assuming a random occurrence of a recombination event in a family. As the opportunity to observe a recombination event in a family depends on the number of meiosis observed, we only consider those families from witch we have observed a minimum of 10 meiosis. From the 18 families that fulfill this criterion, we estimated a frequency of 1.5 recombination events per family (18 families, with a mean number of 20 meiosis per family and a total of 27 recombination events). The observed number of families having 0, 1, 2, and ≥ 3 recombination events was compared with their expected values assuming a Poisson distribution with a chi square test with 2 degrees of freedom. We also tested for differences among observed and expected values by the mean distance test of Poissonity proposed by Szekely and Rizzo . This test is a goodness-of-fit test based on the estimator of the cumulative distribution function. The test statistic is a Cramer-von Mises type of distance and it is implemented in the R Software by parametric bootstrap. The poisson.mtest of R was run with 10000 replicates.
The pedigree map is based on 204 informative meiosis that were typed with 62 different microsatellite polymorphic markers (38 designed for this study; CATCH markers) (see Additional File 1) over a region spanning 6.58 Mb (Figure 2). This approach led to the detection of 27 single recombination events and no double crossovers. Markers are not distributed uniformly over the entire 22q11.2 region. Regions with a high marker density, as for example the Immunoglobulin lambda locus (IGL), have permitted the location of 5 breakpoints to segments of 54 kb or less (8.6 kb the smallest) (Table 1). Overall, the average marker density between D22S427 and D22S1174 is approximately of 1 marker every 85 kb if we exclude the genomic sequence that corresponds to the repeated DNA of the LCR22s and the median size of breakpoint intervals is of 338 kb.
Recombination events do not spread evenly within the studied region and there are segments where crossovers tend to cluster and segments with little or no recombination. Furthermore, we find that we can divide the 22q11.2 region into segments that alternatively undergo male or female recombination with very little overlap (Figure 2). The regions that go from position 16.23 to 17.95 Mb (D22S420-CATCH48), 18.31 to 19.09 Mb (CATCH23-D22S264) and 20.88 to 22.81 Mb (D22S306-D22S1174) are female-specific with no male recombination. Segments between 19.09 and 19.5 (D22S264-D22S311); 19.74 and 20.34 (CATCH38-CATCH35), and 20.58 and 20.81 (D22S539-CATCH29) are sections of exclusively male recombination. There are only two regions that present both male and female recombination events, those between 18.02 and 18.31 (D22S1623-CATCH23) and 20.81 and 20.89 (CATCH29-CATCH16).
AHR rates in the 22q11.2 region
Average recombination rates in the 22q11.2 region were calculated by pooling our data with that available for the CEPH families at the CEPH Genotype database browser V2.1. In this way, we improved the precision of our estimates by increasing the sample size to 440 meioses and 60 recombination events. Based on these data the sex-averaged recombination rate for the analyzed region (6. 58 Mb, positions 16233835-2281304) is of 2.1 cM/Mb (95% confidence intervals (CI): 1.6–2.76). If we concentrate on the 3 Mb TDR, the sex-averaged recombination rate increases to 2.6 cM/Mb (CI: 1.8–4.4). On the other hand, sex-specific female and male rates in the 6.5 Mb 22q11.2 region are 2.9 cM/Mb (CI: 1.9–4.5) and 1.4 cM/Mb (CI: 0.7–2.5), respectively. In the TDR sex-specific rates increase to 4.0 cM/Mb (CI: 2.2–6.9) in females and 2.0 cM/Mb (CI: 0.9–4.2), in males. Neither 22q11.2 recombination rates, nor those of the 3 Mb TDR included in the larger 6, 58 Mb region, differ significantly from chromosome 22 averages described in previous studies [27, 28]. From this we can conclude, that overall AHR within the 22q11.2 region or within the TDR are not below chromosome 22 averages.
Our pedigree map allows the location of recombination breakpoints with a high resolution but sample size does not permit the accurate measure of recombination rates at a fine scale. However, recombination rate inaccuracies can be reduced by decreasing the resolution and for this reason we analyzed recombination rates in 14 intervals of 500 kb each . This analysis provides an estimation of the variation of contemporary recombination rates within the 22q11.2 region (Figure 3). We find 3 regions to display high recombination rates. The first of these regions has an estimated recombination frequency of 3.2 cM/Mb (CI: 1.1–8.4), extends from position 17 to17.5 Mb and includes LCR22-2. The second spans from position 18 Mb to 19.5 Mb and shows a peak at position 18–18.5 with a rate of 4.9 cM/Mb (CI: 2.2–11.4). This region includes LCR22-3A and B. The third region between positions 20.5 and 21 Mb shows the highest calculated recombination rate of 6.1 cM/Mb (CI: 2.8–12.7), and includes part of the immunoglobulin light chain (IGL) locus.
Coincidence between breakpoints and historical recombination hotspots
An LD map was constructed based on 2074 SNPs from the HapMap project (3,5 kb resolution) that were analyzed in 400–800 kb windows using the general model for recombination rate variation of the PHASE 2.1.1 software (Table 2 and Figure 4) [25, 26]. Regions with a high density of markers in the pedigree map were used to determine if contemporary breakpoints coincide with peaks of historical recombination. The analysis of two of such regions (17, 80 to 18, 40 and 20,60 to 21,00 Mb) found that small breakpoint intervals in the pedigree map coincide in all cases with peaks of historical recombination (Figure 5). Furthermore, small breakpoint segments can be used to interrogate if recombination breakpoints that were found in the pedigree map coincide with historical recombination hotspots and thus potentially reflect the activity of contemporary hotspots. When such analysis was performed, we found that 4 out of 5 breakpoint segments smaller than 54 kb coincide with historical hotspots predicted by our LD map and those at the UCSC genome browser based both on HapMap and Perlegen data (Table 1 and Figure 5) .
Concordance between AHR and NAHR
LCR22s are frequent sites of illegitimate recombination that cause deletions and duplications [3–6, 9]. To determine if LCR22s are also sites of frequent AHR meiotic recombination, we made primer sets that amplify polymorphic markers flanking the different LCR22s (Additional file 1 and Figure 2). Pedigree recombination analysis shows that LCR22-2 displays the highest recombination rates of all the LCR22s: 4, 54 cM/Mb (CI: 2, 35-9, 7 cM/Mb); LCR22-3a, shows a recombination rate of 2,76 cM/Mb (CI: 1, 35-5, 2 cM/Mb) and LCR22-4 of 1,91 cM/Mb (CI: 1, 15-6, 0 cM/Mb). No recombination events were localized within LCR22-5 or LCR22-6.
To assess the support in the HapMap LD map we constructed for each LCR22 being "hot" regions for AHR, we used the PHASE 2.1.1 program (option -MR1) to estimate the most probable hotspot within each of the 400–800 kb windows into which the 22q11.2 region was divided. The program predicts LCR22-2, LCR22-3A and LCR22-4 to contain the main recombination hotspot of their 800 kb window (Table 2), but not LCR22-5 and LCR22-6. LCR22-2 is again the one with the highest recombination intensity (λ = 298; 298 times the background recombination of the window that contains LCR22-2) and likelihood (logBF = 3).
Family clustering of NAHR and AHR events
We analyzed recombination events within several three-generation families with a member carrying a 3 Mb deletion caused by interchromosomal NAHR between LCR22-2 and LCR22-4. In one of these families, we found a grand-maternal AHR event within LCR22-2 that had no pathological consequences. In the next generation, the same chromosome involved in this AHR event participated in a NAHR event causing a 3 Mb deletion (pink haplotype in Figure. 6). Moreover, both events were the product of female recombination.
This observation suggests that certain haplotypes may be more prone to undergo AHR and NAHR events within the 22q11.2 region. To test this we analyzed the presence or absence of recombination events within TDR in families used to construct the pedigree map. This analysis shows a non random distribution of recombination events per family. Families tend to cluster in two extreme groups: one with no recombination events and another with more than 3 recombination events in this region. To test for the statistical significance of this observation we modeled the expected number of families having 0, 1, 2 and more that 3 recombination events as a Poisson distribution of mean 1.5. In a sample of 18 families, the observed number of families having 0, 1, 2 and more than 3 recombination events was of 8, 2, 1 and 7, respectively (Figure 7). These numbers were significantly different from those expected under a Poisson distribution (4, 6, 4.5 and 2.3, respectively) (χ2 = 13.06, 2 df; P = 0.0016 and Mean distance test of Poissonity, P = 0.0033). This result shows that certain families have a higher tendency to recombine within the TDR region than others.
To explore if there is a genetic basis within the TDR for the observed differences between families, we analyzed microsatellite markers that are located within the TDR in individuals recombining within this region. However, allelic frequencies were not significantly different in these individuals from those of the overall population (data not shown). This indicates that high and low recombining families do not show different origins for the TDR. Therefore, although we have found indications that certain individuals may undergo AHR in the TDR more frequently than others, we have not found any apparent differences within the TDR itself between such individuals.
The knowledge of the detailed contemporary recombinational traits of a genomic region is based on the localization of the exact position of crossovers and the accurate characterization of recombination frequencies. However, previously published genetic maps of chromosome 22 provided very little information on the 22q11.2 region [27, 28]. These maps had been obtained within the context of the construction of genome-wide genetic maps and the marker density in the 22q11.2 region was low. In fact, none of them included markers located within the 3 Mb region most frequently implicated in the 22q11.2 deletion and duplication syndrome (typically deleted region; TDR) (Figure 1). In addition, several high resolution maps based on LD data reflecting historical recombination rates have been published [12, 25, 29, 30]. However, correspondence between historical and contemporary recombination rates is not perfect which justifies the need for a recombination map based on breakpoint analysis.
The resolution of a genetic map determines how precisely we know the position at which a crossover is located and is limited by the density of polymorphisms and the distance between the closest markers. On the other hand, the accuracy of the recombination rate of a particular genomic segment is a function of the number of meiotic events studied . The pedigree-based map we present here, although of limited accuracy (204 meiosis), has a much higher marker density (1 marker per 85 kb) than other previous genetic maps of the region . Because of its high resolution our map allows the localization of crossovers to small segments and obtains a comprehensive overview of regions of high and low recombination and of potential recombination hotspots. As a result, we have identified three regions that display higher recombination rates than the others within 22q11.2: one comprising LCR22-2, a second LCR22-3A and B, and a third the IGL locus. In addition, we have seen that male and female recombination is very compartmentalized, with regions containing only male or female crossovers with practically no overlap.
However, the particular traits of the 22q11.2 region warrant a point of caution on recombination rates because there may be inaccuracies in the LCR22 sizes calculated from the available human genome sequence drafts (UCSC; May 2004 release): The size of LCR22-3a may not be exact, as it is an estimation because of a sequence gap. In addition, recently it has been shown that LCR22-2 and LCR22-4 can be polymorphic in size in the normal population (copy number variations; CNV) and thus certain individuals may have larger or smaller LCR22s .
Population-based LD maps which model historical recombination rates have been shown to be able to predict most contemporary recombination hotspots [17, 25, 32]. To determine with increased accuracy and resolution the recombinational traits of the 22q11.2 region we have also constructed a population-based LD map from Hapmap SNP-genotype data (2074 SNPs). This LD map was used to provide support to predictions made by the pedigree map. We find coincidence between small breakpoint segments in the pedigree map and historical recombination hotspots (Table 1 and Figure 5).
Interchromosomal non-allelic homologous recombination (NAHR) between LCR22s has been seen to mediate most of the germline deletions and duplications in the 22q11.2 region. However, not much is known on a possible relation between normal meiotic recombination (AHR) and ectopic recombination (NAHR) in regions implicated in recurrent genome rearrangements. We find some support for a relation between high rates of AHR and NAHR in the 22q11.2 region. Our maps show that LCR22-2, LCR22-3a and LCR22-4 undergo frequent AHR and that LCR22-5 and LCR22-6 support little or no recombination. Interestingly, NAHR between the three LCR22s displaying frequent AHR account for nearly all described deletions in the 22q11.2 region. It is also suggestive that LCR22-2, the LCR that mediates most deletions and duplications within the 22q11.2 region, is also the LCR in which we have found the highest AHR rates. Furthermore, AHR breakpoints detected within LCR22-2 are mostly female and we show through a meta-analysis of published data that there is a significant excess of maternal deletions in del22q11.2 syndrome patients (56%; X2: p = 0, 0238) (Table 3). This result indicates that there is also a female bias in NAHR causing de novo 22q11.2 deletions.
It has been proposed that LCR22-4 replicates later than LCR22-2 in chromosomes of maternal origin and that this favors misalignments between LCR22-2 and LCR22-4 leading to NAHR causing 22q11.2 deletions . This model predicts that the remaining centromeric part of LCR22-2 in the deIeted chromosome would be predominantly of grandmaternal origin. In our case we determined that the centromeric region had a grandmaternal origin in 4 of the observed AHR events within LCR22-2, while 2 cases were not informative (data not shown). This observation seems to support an influence of late replication of the maternal chromosome in AHR events, as well as NAHR events. Nevertheless, when we analyzed three families that had suffered a NAHR event and a deletion, and contrary to previous results, only one had the centromeric portion of grandmaternal origin (data not shown). Futhermore, although 90% of the deletion events have their endpoints within LCR22-2 and 4, we find that the second highest AHR rate within an LCR22 is in LCR22-3a and not in LCR22-4. This observation argues against a direct and simple link between high AHR and NAHR levels. Other potential factors that might influence the participation of an LCR22 in a NAHR event may be, for example, the specific organization and content of each LCR (tandem versus inverted repeats), as well as the size and distance between the participating LCR22s.
The genomic diseases, Charcot-Marie-Tooth disease type 1A (CMT1A, MIM 118220) and hereditary neuropathy with liability to pressure palsies (HNPP, MIM 161500) are caused by LCR-mediated duplications and deletions in 17p12. de novo CMT1A duplication occurs 10 times more frequently in male than in female gametogenesis and AHR frequencies in the CMT1A/HNPP region indicate that the frequency is low in males but high in females . Moreover, a similar observation was made for the genomic region that undergoes deletions causing the Smith-Magenis syndrome (17p11.2, SMS; MIM 182290) where a low rate of recombination was found in both sexes . These observations led to the hypothesis that such reduced recombination may increase unequal crossing over by favoring misalignments .
Our study shows that LCR22s implicated in deletions and duplications are sites of frequent meiotic recombination (AHR) and that average recombination in the 22q11.2 region is similar to the chromosome average. Thus, aberrant recombination leading to 22q11.2 deletion syndrome can't be explained exclusively under a hypothesis of low regional AHR rates. This may reflect that the mechanisms causing NAHR in the 22q11.2 region and those in the regions causing SMS and CMT1A-HNPP may be different; or that available maps of the the 17p11-12 region do not have the resolution to determine AHR rates within the region's LCRs. Comprehensive high resolution maps of the 17p11-12 region may help solve this question.
In addition, we also show that AHR events within the 22q11.2 region cluster in families. If a concordance between AHR and NAHR exists, we would expect families with high AHR levels to be more prone to deletions and duplications. Although we would need a large number of multiple generation pedigrees to prove this, we describe here a family with two female crossovers within LCR22-2: in one generation it was an AHR event and in the next generation a NAHR event that resulted in a 22q11.2 deletion. We have not found differences in allelic frequencies of markers within 22q11.2 between individuals which show AHR and those that do not. However, it has been seen that the genetic background outside of a particular region influences recombination rates and breakpoint location [17, 34]. Thus, it would not be surprising that the overall genetic background may have an effect on a slightly larger susceptibility to suffer rearrangements if NAHR and AHR are mechanistically related. If that is the case, the observed familial differences in recombination rates within the 22q11.2 region would have obvious health-related implications and certain families may have a slightly larger risk to suffer a 22q11.2 deletion. However, because the genetic or molecular factors that make 22q11.2 "hotter" for recombination are unknown the quantification of such risk is impossible right now.
In any case, further characterization of recombination in the 22q11.2 region should provide more information on a potential relation between NAHR and AHR active regions. However, high throughput studies that could clarify if AHR and NAHR show the same breakpoint locations are hampered by the recombinational and genomic traits of the 22q11.2 region. On one hand, LCR22s are large (300–400 kb) highly identical structures (>98% identity) making it very difficult to identify reliable polymorphisms that would allow localization of AHR breakpoints. And on the other hand, some of the most active LCR22s in NAHR and AHR may have higher female than male recombination rates, and thus sperm typing may not be the ideal technique to characterize AHR within these repeats.
We present a high resolution recombination map of the 22q11.2 region based on pedigree data. Our map allows the location of recombination breakpoints with a high resolution (potential recombination hotspots), and this approach has led to the identification of 5 breakpoint segments of 50 kb or less (8.6 kb the smallest), that coincide with historical hotspots. It has been suggested that aberrant recombination leading to deletion (and duplication) is caused by low rates of Allelic Homologous Recombination (AHR) within the affected region . Our study shows that LCR22s implicated in deletions and duplications are sites of frequent meiotic recombination (AHR) and that average recombination in the 22q11.2 region is similar to the chromosome average. Thus, aberrant recombination leading to 22q11.2 deletion syndrome can't be explained exclusively under a hypothesis of low AHR rates. In addition, we find recombination events in the 22q11.2 region to cluster within families. Within this context, the same chromosome recombines twice in one family; first by AHR and in the next generation by NAHR resulting in an individual affected with the del22q11.2 syndrome.
Stankiewicz P, Lupski JR: Genome architecture, rearrangements and genomic disorders. Trends Genet. 2002, 18: 74-82. 10.1016/S0168-9525(02)02592-1.
Dunham I, Shimizu N, Roe BA, Chissoe S, Hunt AR, Collins JE, Bruskiewich R, Beare DM, Clamp M, Smink LJ, Ainscough R, Almeida JP, Babbage A, Bagguley C, Bailey J, Barlow K, Bates KN, Beasley O, Bird CP, Blakey S, Bridgeman AM, Buck D, Burgess J, Burrill WD, O'Brien KP, .: The DNA sequence of human chromosome 22. Nature. 1999, 402: 489-495. 10.1038/990031.
Edelmann L, Pandita RK, Spiteri E, Funke B, Goldberg R, Palanisamy N, Chaganti RS, Magenis E, Shprintzen RJ, Morrow BE: A common molecular basis for rearrangement disorders on chromosome 22q11. Hum Mol Genet. 1999, 8: 1157-1167. 10.1093/hmg/8.7.1157.
Shaikh TH, Kurahashi H, Emanuel BS: Evolutionarily conserved low copy repeats (LCRs) in 22q11 mediate deletions, duplications, translocations, and genomic instability: an update and literature review. Genet Med. 2001, 3: 6-13.
McDermid HE, Morrow BE: Genomic disorders on 22q11. Am J Hum Genet. 2002, 70: 1077-1088. 10.1086/340363.
Saitta SC, Harris SE, Gaeth AP, Driscoll DA, McDonald-McGinn DM, Maisenbacher MK, Yersak JM, Chakraborty PK, Hacker AM, Zackai EH, Ashley T, Emanuel BS: Aberrant interchromosomal exchanges are the predominant cause of the 22q11.2 deletion. Hum Mol Genet. 2004, 13: 417-428. 10.1093/hmg/ddh041.
Shaikh TH, Kurahashi H, Saitta SC, O'Hare AM, Hu P, Roe BA, Driscoll DA, McDonald-McGinn DM, Zackai EH, Budarf ML, Emanuel BS: Chromosome 22-specific low copy repeats and the 22q11.2 deletion syndrome: genomic organization and deletion endpoint analysis. Hum Mol Genet. 2000, 9: 489-501. 10.1093/hmg/9.4.489.
Baumer A, Dutly F, Balmer D, Riegel M, Tukel T, Krajewska-Walasek M, Schinzel AA: High level of unequal meiotic crossovers at the origin of the 22q11. 2 and 7q11.23 deletions. Hum Mol Genet. 1998, 7: 887-894. 10.1093/hmg/7.5.887.
Ensenauer RE, Adeyinka A, Flynn HC, Michels VV, Lindor NM, Dawson DB, Thorland EC, Lorentz CP, Goldstein JL, McDonald MT, Smith WE, Simon-Fayard E, Alexander AA, Kulharya AS, Ketterling RP, Clark RD, Jalal SM: Microduplication 22q11.2, an emerging syndrome: clinical, cytogenetic, and molecular analysis of thirteen patients. Am J Hum Genet. 2003, 73: 1027-1040. 10.1086/378818.
Hassed SJ, Hopcus-Niccum D, Zhang L, Li S, Mulvihill JJ: A new genomic duplication syndrome complementary to the velocardiofacial (22q11 deletion) syndrome. Clin Genet. 2004, 65: 400-404. 10.1111/j.0009-9163.2004.0212.x.
de Massy B: Distribution of meiotic recombination sites. Trends Genet. 2003, 19: 514-522. 10.1016/S0168-9525(03)00201-4.
Myers S, Bottolo L, Freeman C, McVean G, Donnelly P: A fine-scale map of recombination rates and hotspots across the human genome. Science. 2005, 310: 321-324. 10.1126/science.1117196.
Bi W, Yan J, Stankiewicz P, Park SS, Walz K, Boerkoel CF, Potocki L, Shaffer LG, Devriendt K, Nowaczyk MJ, Inoue K, Lupski JR: Genes in a refined Smith-Magenis syndrome critical deletion interval on chromosome 17p11.2 and the syntenic region of the mouse. Genome Res. 2002, 12: 713-728. 10.1101/gr.73702.
Inoue K, Dewar K, Katsanis N, Reiter LT, Lander ES, Devon KL, Wyman DW, Lupski JR, Birren B: The 1.4-Mb CMT1A duplication/HNPP deletion genomic region reveals unique genome architectural features and provides insights into the recent evolution of new genes. Genome Res. 2001, 11: 1018-1033. 10.1101/gr.180401.
Jeffreys AJ, Neumann R, Panayi M, Myers S, Donnelly P: Human recombination hot spots hidden in regions of strong marker association. Nat Genet. 2005, 37: 601-606. 10.1038/ng1565.
Arnheim N, Calabrese P, Nordborg M: Hot and cold spots of recombination in the human genome: the reason we should find them and how this can be achieved. Am J Hum Genet. 2003, 73: 5-16. 10.1086/376419.
Cullen M, Perfetto SP, Klitz W, Nelson G, Carrington M: High-resolution patterns of meiotic recombination across the human major histocompatibility complex. Am J Hum Genet. 2002, 71: 759-776. 10.1086/342973.
Babcock M, Pavlicek A, Spiteri E, Kashork CD, Ioshikhes I, Shaffer LG, Jurka J, Morrow BE: Shuffling of genes within low-copy repeats on 22q11 (LCR22) by Alu-mediated recombination events during evolution. Genome Res. 2003, 13: 2519-2532. 10.1101/gr.1549503.
site UCSC: http://genome.cse.ucsc.edu. 7 AD
Mukai J, Liu H, Burt RA, Swor DE, Lai WS, Karayiorgou M, Gogos JA: Evidence that the gene encoding ZDHHC8 contributes to the risk of schizophrenia. Nat Genet. 2004, 36: 725-731. 10.1038/ng1375.
University of Helsinki Institute of Biotechnology: http://www.biocenter.helsinki.fi/bi/Programs/fastpcr.htm.
National center for biotechnology information BLAST: http://www.ncbi.nlm.nih.gov/BLAST/. 47 AD
Humain CEP: http://www.cephb.fr/cephdb/. 7 AD
project IHM: http://www.hapmap.org/index.html.en. 7 AD
Crawford DC, Bhangale T, Li N, Hellenthal G, Rieder MJ, Nickerson DA, Stephens M: Evidence for substantial fine-scale variation in recombination rates across the human genome. Nat Genet. 2004, 36: 700-706. 10.1038/ng1376.
Li N, Stephens M: Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. Genetics. 2003, 165: 2213-2233.
Broman KW, Murray JC, Sheffield VC, White RL, Weber JL: Comprehensive human genetic maps: individual and sex-specific variation in recombination. Am J Hum Genet. 1998, 63: 861-869. 10.1086/302011.
Kong A, Gudbjartsson DF, Sainz J, Jonsdottir GM, Gudjonsson SA, Richardsson B, Sigurdardottir S, Barnard J, Hallbeck B, Masson G, Shlien A, Palsson ST, Frigge ML, Thorgeirsson TE, Gulcher JR, Stefansson K: A high-resolution recombination map of the human genome. Nat Genet. 2002, 31: 241-247.
Hinds DA, Stuve LL, Nilsen GB, Halperin E, Eskin E, Ballinger DG, Frazer KA, Cox DR: Whole-genome patterns of common DNA variation in three human populations. Science. 2005, 307: 1072-1079. 10.1126/science.1105436.
McVean GA, Myers SR, Hunt S, Deloukas P, Bentley DR, Donnelly P: The fine-scale structure of recombination rate variation in the human genome. Science. 2004, 304: 581-584. 10.1126/science.1092500.
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W, Cho EK, Dallaire S, Freeman JL, Gonzalez JR, Gratacos M, Huang J, Kalaitzopoulos D, Komura D, MacDonald JR, Marshall CR, Mei R, Montgomery L, Nishimura K, Okamura K, Shen F, Somerville MJ, Tchinda J, Valsesia A, Woodwark C, Yang F, Zhang J, Zerjal T, Zhang J, Armengol L, Conrad DF, Estivill X, Tyler-Smith C, Carter NP, Aburatani H, Lee C, Jones KW, Scherer SW, Hurles ME: Global variation in copy number in the human genome. Nature. 2006, 444: 444-454. 10.1038/nature05329.
Jeffreys AJ, Kauppi L, Neumann R: Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex. Nat Genet. 2001, 29: 217-222. 10.1038/ng1001-217.
Baumer A, Riegel M, Schinzel A: Non-random asynchronous replication at 22q11.2 favours unequal meiotic crossovers leading to the human 22q11.2 deletion. J Med Genet. 2004, 41: 413-420. 10.1136/jmg.2003.016352.
Heine D, Khambata S, Wydner KS, Passmore HC: Analysis of recombinational hot spots associated with the p haplotype of the mouse MHC. Genomics. 1994, 23: 168-177. 10.1006/geno.1994.1474.
Driscoll DA, Budarf ML, Emanuel BS: A genetic etiology for DiGeorge syndrome: consistent deletions and microdeletions of 22q11. Am J Hum Genet. 1992, 50: 924-933.
Seaver LH, Pierpont JW, Erickson RP, Donnerstein RL, Cassidy SB: Pulmonary atresia associated with maternal 22q11.2 deletion: possible parent of origin effect in the conotruncal anomaly face syndrome. J Med Genet. 1994, 31: 830-834.
Morrow B, Goldberg R, Carlson C, Das GR, Sirotkin H, Collins J, Dunham I, O'Donnell H, Scambler P, Shprintzen R, .: Molecular definition of the 22q11 deletions in velo-cardio-facial syndrome. Am J Hum Genet. 1995, 56: 1391-1403.
Demczuk S, Levy A, Aubry M, Croquette MF, Philip N, Prieur M, Sauer U, Bouvagnet P, Rouleau GA, Thomas G, .: Excess of deletions of maternal origin in the DiGeorge/velo-cardio-facial syndromes. A study of 22 new patients and review of the literature. Hum Genet. 1995, 96: 9-13. 10.1007/BF00214179.
Ryan AK, Goodship JA, Wilson DI, Philip N, Levy A, Seidel H, Schuffenhauer S, Oechsler H, Belohradsky B, Prieur M, Aurias A, Raymond FL, Clayton-Smith J, Hatchwell E, McKeown C, Beemer FA, Dallapiccola B, Novelli G, Hurst JA, Ignatius J, Green AJ, Winter RM, Brueton L, Brondum-Nielsen K, Scambler PJ, .: Spectrum of clinical features associated with interstitial chromosome 22q11 deletions: a European collaborative study. J Med Genet. 1997, 34: 798-804.
Bonnet D, Cormier-Daire V, Kachaner J, Szezepanski I, Souillard P, Sidi D, Munnich A, Lyonnet S: Microsatellite DNA markers detects 95% of chromosome 22q11 deletions. Am J Med Genet. 1997, 68: 182-184. 10.1002/(SICI)1096-8628(19970120)68:2<182::AID-AJMG12>3.0.CO;2-Q.
Fokstuen S, Arbenz U, Artan S, Dutly F, Bauersfeld U, Brecevic L, Fasnacht M, Rothlisberger B, Schinzel A: 22q11.2 deletions in a series of patients with non-selective congenital heart defects: incidence, type of defects and parental origin. Clin Genet. 1998, 53: 63-69. 10.1034/j.1399-0004.1998.531530113.x.
Matsuoka R, Kimura M, Scambler PJ, Morrow BE, Imamura S, Minoshima S, Shimizu N, Yamagishi H, K J, Watanabe S, Oyama K, Saji T, Ando M, Takao A, Momma K: Molecular and clinical study of 183 patients with conotruncal anomaly face syndrome. Hum Genet. 1998, 103: 70-80. 10.1007/s004390050786.
Rauch A, Hofbeck M, Leipold G, Klinge J, Trautmann U, Kirsch M, Singer H, Pfeiffer RA: Incidence and significance of 22q11.2 hemizygosity in patients with interrupted aortic arch. Am J Med Genet. 1998, 78: 322-331. 10.1002/(SICI)1096-8628(19980724)78:4<322::AID-AJMG4>3.0.CO;2-N.
Lu JH, Chung MY, Hwang B, Chien HP: Prevalence and parental origin in Tetralogy of Fallot associated with chromosome 22q11 microdeletion. Pediatrics. 1999, 104: 87-90. 10.1542/peds.104.1.87.
Trost D, Wiebe W, Uhlhaas S, Schwindt P, Schwanitz G: Investigation of meiotic rearrangements in DGS/VCFS patients with a microdeletion 22q11.2. J Med Genet. 2000, 37: 452-454. 10.1136/jmg.37.6.452.
Eliez S, Antonarakis SE, Morris MA, Dahoun SP, Reiss AL: Parental origin of the deletion 22q11.2 and brain development in velocardiofacial syndrome: a preliminary study. Arch Gen Psychiatry. 2001, 58: 64-68. 10.1001/archpsyc.58.1.64.
Vittorini S, Sacchelli M, Iascone MR, Collavoli A, Storti S, Giusti A, Andreani G, Botto N, Biagini A, Clerico A: Molecular characterization of chromosome 22 deletions by short tandem repeat polymorphism (STRP) in patients with conotruncal heart defects. Clin Chem Lab Med. 2001, 39: 1249-1258. 10.1515/CCLM.2001.201.
Chung MY, Lu JH, Chien HP, Hwang B: Chromosome 22q11 microdeletion in conotruncal heart defects: clinical presentation, parental origin and de novo mutations. Int J Mol Med. 2001, 7: 501-505.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/8/14/prepub
We thank the Department of Genetics of the Hospital Sant Pau; Barcelona (Spain) for DNAs of some of the families used in the pedigree-based linkage map; X. Busquets for critical reading of the manuscript; JB Soriano and T. Alorda for help with the statistical analysis; the Utah participants and research groups involved in producing the data for making it publicly available through the HapMap consortium and CEPH database. We are indebted to all the patients who participated in this study. This work was supported by grants from FIS-PI021476 and the "Fundació Marató TV3" to JF; and FIS-PI05-1585 and FIS-C03/05 to LTJ, JR and DHS.
The author(s) declare that they have no competing interests.
LTJ carried out the molecular genetic studies, participated in the sequence analysis and in the draft of the manuscript. JR carried out clinical characterization of patients, data analysis and drafting of the manuscript. MST participated in the construction of the HapMap based recombination map and in the statistical tests. JF carried out the construction of the HapMap based recombination map and the statistical tests and participated in the drafting of the manuscript. DHS conceived the study, participated in the molecular genetic studies, analysed the data and wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional File 1: Markers used for the construction of the pedigree linkage map. The table depicts the markers used for the construction of the pedigree-based recombination map. It depicts the name of the marker, its position on the chromosome, PCR product size, amplification primers, type of repeat and observed heterozigosity in our study. (DOC 126 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Torres-Juan, L., Rosell, J., Sánchez-de-la-Torre, M. et al. Analysis of meiotic recombination in 22q11.2, a region that frequently undergoes deletions and duplications. BMC Med Genet 8, 14 (2007). https://doi.org/10.1186/1471-2350-8-14
- Recombination Rate
- Recombination Event
- Recombination Breakpoint
- Historical Recombination
- Informative Meiosis