The value of some Corsican sub-populations for genetic association studies
- Veronica Latini†1,
- Gabriella Sole†1,
- Laurent Varesi2,
- Giuseppe Vona3 and
- Maria Serafina Ristaldi1Email author
© Latini et al; licensee BioMed Central Ltd. 2008
Received: 09 April 2008
Accepted: 28 July 2008
Published: 28 July 2008
Genetic isolates with a history of a small founder population, long-lasting isolation and population bottlenecks represent exceptional resources in the identification of disease genes. In these populations the disease allele reveals Linkage Disequilibrium (LD) with markers over significant genetic intervals, therefore facilitating disease locus identification. In a previous study we examined the LD extension on the Xq13 region in three Corsican sub-populations from the inner mountainous region of the island. On the basis of those previous results we have proposed a multistep procedure to carry out studies aimed at the identification of genes involved in complex diseases in Corsica. A prerequisite to carry out the proposed multi-step procedure was the presence of different degrees of LD on the island and a common genetic derivation of the different Corsican sub-populations. In order to evaluate the existence of these conditions in the present paper we extended the analysis to the Corsican coastal populations.
Samples were analyzed using seven dinucleotide microsatellite markers on chromosome Xq13-21: DXS983, DXS986, DXS8092, DXS8082, DXS1225, DXS8037 and DXS995 spanning approximately 4.0 cM (13.3 Mb). We have also investigated the distribution of the DXS1225-DXS8082 haplotype which has been recently proposed as a good marker of population genetic history due to its low recombination rate.
the results obtained indicate a decrease of LD on the island from the central mountainous toward the coastal sub-populations. In addition the analysis of the DXS1225-DXS8082 haplotype revealed: 1) the presence of a particular haplotype with high frequency; 2) the derivation from a common genetic pool of the sub-populations examined in the present study.
These results indicate the Corsican sub-populations useful for the fine mapping of genes contributing to complex diseases.
Isolates have been of considerable use in genetic studies aimed at identifying mutations underlying rare diseases . Moreover, isolated populations also afford several advantages in unrevealing the genetics of complex diseases [2, 3]. The identification of genes involved in the pathogenesis of multifactorial diseases would contribute towards a better understanding of the physiopathology of these conditions. Furthermore, the prevention and development of new therapeutic approaches would be significantly enhanced.
Association studies are critically dependent on the extent of LD. Several reports have underlined how Linkage Disequilibrium (LD) is more extended in founder populations . LD extended over large regions increases the power of association studies since the number of markers to be analyzed is at least 30% less than in outbred populations .
Genetic homogeneity found in isolated populations is a great advantage in the identification of large genomic regions containing the disease-associated locus, while fine mapping could require recently expanded population.
Corsica has been invaded several times (Greeks, Chartaginians, Romans, Vandals, Bizantynes, Saracens, Pisans, Genoese, Austrians, English and French). In the great majority of cases, these invasions were limited to the coastal areas and left slight marks on the gene pool of the native populations . Strong evidence also suggests an internal microgeographic diversity with the most conserved population located in the center of the island in the mountainous regions , as reflected also in several dialectal linguistic subdivisions from a basic language similar to the Tuscan dialect plus remains of an archaic substratum shared by Corsica and Sardinia . The island is separated from Sardinia by the Strait of Bonifacio.
The close by isolated founder Sardinian population has been extensively investigated and has provided a marked contribution towards the mapping of rare monogenic and complex diseases [3, 11, 12]. We proposed Corsica and Sardinia as a unique opportunity to study genes involved in complex diseases. This idea was based on the fact that the populations of these two islands both derive from a single ancestral population that migrated to Corsica and Sardinia during the last ice age. The genetic proximity between the two islands has already been emphasized elsewhere [13, 14]. In a previous study we examined the extent of LD in three central Corsican sub-populations: Niolo, Corte and Bozio, using 7 microsatellite markers on the Xq13-21 region. Our results displayed a high degree of LD for the sub-populations of Bozio and Niolo and a lower degree for Corte . We previously proposed a multistep procedure involving LD mapping in populations characterized by a different degree of LD extension but sharing a common founder population [15, 16]. The proposed procedure involved: a) Identification of a large genomic region containing the disease associated locus in small sub-isolated population of the central region of Corsica and replication in small sub-isolated population of central Sardinia. b) Mapping in populations inside Sardinia and Corsica that show a weaker degree of LD extension. c) Fine mapping in open populations in which the extent of LD is low, such as the general population of Sardinia and Corsica. d) Final mapping in open population in which the level of background LD is very low, such as the African population. 
In the present study we report the LD pattern on Xq13-21 microsatellite markers in a Corsican coastal sub-population in order to assess the presence of a decrease of LD extension in island. We have also examined the distribution of the DXS1225-DXS8082 haplotypes [17, 18] in the four Corsican sub-populations investigated.
In a previous paper, we have studied the LD extent in 3 sub-populations located in the central mountainous area (Bozio, Niolo and Corte) . Here we show new data on a sample of the Corsican coastal population coming from Ajaccio (n = 50) and Bonifacio (n = 32) area. Finally we evaluated a pooled sample of the different sub-populations.
The use of microsatellite markers in LD mapping is being substituted by SNPs. Nevertheless microsatellites markers remain a valuable tool for the first screening of background LD in a given population.
DNA samples were collected from unrelated male individuals. These samples were analyzed using seven dinucleotide microsatellite markers on chromosome Xq13-21: DXS983, DXS986, DXS8092, DXS8082, DXS1225, DXS8037 and DXS995  spanning approximately 4.0 cM (13.3 Mb). The analysis of this region has been widely used as a measure of background LD in a given population and to compare the levels of LD between populations [2, 18–21].
Microsatellites were analysed using an ABI prism 377 DNA analyser. Genotypes were processed by Genescan v3.1 and Genotyper v2.5 software. A DNA standard, consisting of the CEPH control individual number 1347.02 (Applied Biosystems) was incorporated in all the runs to verify accuracy of typing.
The non-random allelic association between pairs of microsatellite loci was tested by an extension of Fisher exact test on contingency tables. p values were corrected by the step-down Holm-Sidak procedure (with the formula: pcorrected = 1 - (1 - p)n, where n is the number of p values smaller or equal to that being corrected . Each corrected p value was considered significant when < 0.05.
In order to compare LD strength between samples, we used the multiallelic normalized disequilibrium coefficient D' between each marker loci pairs . As a small sample size may lead to overestimation of D' values, the latter were calculated subsequent to correction by means of a bootstrap procedure . Graphical displays of LD between microsatellite pairs were provided by GOLD software http://www.sph.umich.edu/csg/abecasis/GOLD/.
Genetic differentiation test was carried out by Genepop 3.3 (updated version of software Genepop 1.2). The method, based on allelic distribution of alleles in the various populations, are described by Raymond and Rousset (1995) . The unbiased estimate of the p value was performed using a Markov chain method. Genetic differentiation test was used for all pairs of populations.
Gene diversity is often referred to as expected heterozygosity and is defined as the probability that two randomly chosen alleles from the population are different.
The study complies with the Declaration of Helsinki and received ethical approval by the University of Corte. Each subject, of Corsican ancestry up to grandparents, gave informed consent.
The extent of linkage disequilibrium was assessed in each Corsican sub-populations, using seven markers on Xq13-21 and two different approaches: Fisher's exact test was used to evaluate statistical significance of the allelic association between all pairs of loci and Lewontin's coefficient D' was applied to assess the strength of LD.
Pair-wise LD statistics for microsatellite markers on Xq13-21 in the different Corsican sub-populations and in the pooled sample.
(N = 51)
(N = 49)
(N = 50)
(N = 82)
(N = 232)
The coastal population showed a low level of LD, rather similar to values obtained for Corte. In both sub-populations only one marker pair (DXS8082- DXS1225) displayed a significant p value following correction for multiple testing with a high D' value (p = 0.000, D' = 0.636 and p = 0.000, D' = 0.622 respectively). This marker pair has shown a strong association in all the populations analyzed so far .
The lowest levels of D' was shown by the pooled Corsican sample. Only the DXS8082- DXS1225 pair showed a corrected D' exceeding 0.5. Nevertheless, significant values were revealed in four marker pairs by means of Fisher's exact test.
Fisher's exact test of differentiation between pairs of populations.
Corte – Niolo
p < 10-5
Bozio – Niolo
Bozio – Corte
Coast – Niolo
p < 10-5
Coast – Corte
p < 10-5
Coast – Bozio
p < 10-5
Common haplotypes of microsatellite loci DXS1225-DXS8082 in the Corsican sub-populations and in the pooled sample.
Gene diversity values for the microsatellites markers on Xq13-21 region in the different Corsican samples.
Number of alleles (diversity) for
Mean gene diversity (s.d.)
Mean number of alleles (s.d.)
The findings of the present study illustrate a decline in the LD extent in Coastal Corsican population compared to the inner mountainous regions of Niolo and Bozio. We have also shown the presence of an high frequency of a particular DXS1225-DXS8082 haplotype in most of the Corsican sub-populations examined, implying a common genetic origin (with the possible exception of Corte). The decrease observed in the level of LD in the coastal population (figure 2) imply the suitability of this sub-population for use in association studies based on a multistep procedure .
In order to mimic a random South Central Corsican sample we also analyzed LD level on the Xq13-21 region on a pool of Corsican samples. LD extent drops dramatically in the pooled sample (figure 2), making a general Corsican sampling unsuitable for long range mapping. This result is most likely explained by micro differentiation due to bottlenecks and genetic drift, as suggested by the differentiation test. It should be underlined how in the pooled Corsican sample average D' values are extremely low, although significant p Fisher values were obtained in four cases. This discrepancy is best explained by the fact that the significance of p Fisher is likely linked to the sample size (> 200), whereas D' value are corrected for different sample sizes. In agreement the analysis of LD level of 50 random sample drawn from pooled Corsican sample provided similar D' values to the unselected pooled sample (data not shown). The judgment on the extent of LD level in a population should take into account both D' values levels and the significance of the allelic association .
The presence of an high frequency of the same DXS1225-DXS8082 haploype (216–225) either in the Coastal sub-population sample as in sub-populations from Niolo and Bozio regions, suggest their origin from a common genetic pool. Being recombination events between these two markers extremely rare, new variants arise prevalently from a mutation in one of the two loci. The footprint of a demographic event persists for a longer period in haplotype distribution within a region characterized by a low crossing-over rate than in a single marker or between actively recombining markers . Nevertheless it is worth nothing that the most frequent haplotype in Corte subpopulation is the 218–225 which only one dinucleotide away from the 216–225.
All the sub-populations examined show a strong degree of differentiation, according to the differentiation test, with the exception of Bozio and Niolo in agreement with previous results . These data are also in agreement with the well documented microgeografic differentiation in Corsica . The apparent discrepancy between haplotype distribution and differentiation test is best explained by the evolutionary forces causing microdifferentiation. In fact micro differentiation is often due to bottlenecks and genetic drift from a common genetic pool which will change gene frequency, thus restricting the number of haplotypes in the isolate population compared to the coastal sample.
Average gene diversity estimate for each Corsican sub-population was quite similar to values reported for other populations and indicates that they possess similar homogenous genetic architectures. The pattern of gene diversity obtained for all single markers showed no clear correlation with the extent of LD, thus suggesting that the increase in linkage disequilibrium is probably not due to selection, but rather due to demographic history of the studied populations .
The island of Corsica is geographically very close to the other Mediterranean Island of Sardinia. The isolated founder Sardinian population has been extensively investigated and has provided a marked contribution towards the mapping of rare monogenic and complex diseases [1, 3]. The Corsican and Sardinian populations derive from a common genetic founding pool, being subjected over the centuries to similar evolutionary forces, such as isolation, consanguinity, and bottlenecks caused by famine and epidemics [5, 6, 13]. Moreover, the two populations share a similar dietary regime and climate (Mediterranean). Based on these considerations, it is feasible to suggest that the two islands may have selected the same kind of allele associated to specific common diseases. The distribution of the DXS1225-DXS8082 haplotypes highlights the genetic peculiarity of the populations of Corsica and Sardinia, thereby confirming previous data [13, 14]. Indeed the 210–219 haplotype, the most frequent in Europe  is rarely observed also in Sardinia, where the most frequently represented is 216–221 (Zavattari personal communication). This haplotype has not been reported in the European population, likely stemming from the 216–225 Corsican haplotype.
The most frequent haplotypes of microsatellite loci DXS1225-DXS8082 in the Corsican sub-populations under investigation (Table 3) have not been reported in other populations to date (see Laan et al. ). Nevertheless we can not exclude that these haplotypes are present in the other populations examined as well as that the absence of some haplotypes in our samples is due to the reduced sample size and/or to founder effect.
Taken together, this data suggests the suitability of the Corsican sub-populations as a candidate for use in identifying genes underlying complex diseases by means of LD association studies performed according to a multistep procedure as described previously . As previously proposed , we confirm that the joint study of Corsica and Sardinia would be an unique opportunity to study genes involved in complex diseases, according to a multistep procedure based on the presence of a decrease of LD extension within the sub-populations of the two islands. Among the possible target diseases, cardiovascular diseases that have an high incidence in Corsica http://www.fnors.org/asp/travaux/accueil.htm and therefore could represent a good candidate for association studies.
Lastly, it should be underlined how particular caution should be taken when using "isolated populations", such as those of the Sardinian and Corsican Islands as a whole, for the purpose of performing long range LD mapping, due to the possibility that micro differentiation may limit LD extension as reported here for the Corsican population.
We thanks Prof. Antonio Cao and Dr Francesca Crobu for critically revising the manuscript.
- Crisponi L, Crisponi G, Meloni A, Toliat MR, Nürnberg G, Usala L, Uda M, Masala M, Höhne W, Becker C, Marongiu M, Chiappe F, Kleta R, Rauch A, Wollnik B, Strasser F, Reese T, Jakobs C, Kurlemann G, Cao A, Nürnberg P, Rutsch F: Crisponi syndromes is caused by mutations in the CRLF1 gene and is allelic to Cold -Induced Sweating Syndrome Type 1. Am J Hum Genet. 2007, 80 (5): 971-981. 10.1086/516843.View ArticlePubMedPubMed CentralGoogle Scholar
- Marroni F, Pichler I, De Grandi A, Beu Volpato C, Vogl FD, Pinggera GK, Bailey-Wilson JE, Pramstaller PP: Population isolates in South Tyrol and their value for genetic dissection of complex diseases. Ann Hum Genet. 2006, 70 (6): 812-821. 10.1111/j.1469-1809.2006.00274.x.View ArticlePubMedGoogle Scholar
- Balaci L, Spada MC, Olla N, Sole G, Loddo L, Anedda F, Naitza S, Zuncheddu MA, Maschio A, Altea D, Uda M, Pilia S, Sanna S, Masala M, Crisponi L, Fattori M, Devoto M, Doratiotto S, Rassu S, Mereu S, Giua E, Cadeddu NG, Atzeni R, Pelosi U, Corrias A, Perra R, Torrazza PL, Pirina P, Ginesu F, Marcias S, Schintu MG, Del Giacco GS, Manconi PE, Malerba G, Bisognin A, Trabetti E, Boner A, Pescollderungg L, Pignatti PF, Schlessinger D, Cao A, Pilia G: IRAK-M is involved in the pathogenesis of Early-Onset Persistent Asthma. Am J Hum Genet. 2007, 80 (6): 1103-1114. 10.1086/518259.View ArticlePubMedPubMed CentralGoogle Scholar
- Service S, DeYoung J, Karayiorgou M, Roos JL, Pretorious H, Bedoya G, Ospina J, Ruiz-Linares A, Macedo A, Palha JA, Heutink P, Aulchenko J, Oostra B, van Duijn C, Jarvelin M-R, Varilo T, Peddle L, Rahman P, Piras G, Monne M, Murray S, Galver L, Peltonen L, Sabatti C, Collins A, Freimer N: Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies. Nat Genet. 2006, 38 (5): 556-560. 10.1038/ng1770.View ArticlePubMedGoogle Scholar
- Varesi L, Vona G, Memmì M, Marongiu F, Ristaldi MS: β-Thalassemia Mutations in Corsica. Hemoglobin. 2000, 24 (3): 239-244.View ArticlePubMedGoogle Scholar
- Vona G, Memmì M, Calò CM, Latini V, Vacca L, Succa V, Ghiani ME, Moral P, Varesi L: Genetic structure of the Corsican population (France): a Review. Recent research developments in human genetics. 2002, Research Signpost-Kerala India, 147-164.Google Scholar
- Morelli L, Paoli G, Francalacci P: Surname analysis of the Corsican population reveals an agreement with geographical and linguistic structure. J Biosoc Sci. 2004, 34 (3): 289-301. 10.1017/S0021932002002894.View ArticleGoogle Scholar
- Tofanelli S, Taglioli L, Varesi L, Paoli G: Genetic History of the population of Corsica (Western Mediterranean) as inferred from autosomal STR analysis. Hum Biol. 2004, 76 (2): 229-251. 10.1353/hub.2004.0038.View ArticlePubMedGoogle Scholar
- Tofanelli S, Varesi L, Paoli G: Population genetics of Corsicans by autosomal STR. Antropologia Contemporanea – Monograph. 1999, 99-110.Google Scholar
- Rohlfs G: Studi e Ricerche su lingue e dialetti d'Italia. 1972, Sansoni FirenzeGoogle Scholar
- Angius A, Melis MP, Morelli L, Petretto E, Casu G, Maestrale GB, Fraumene C, Bebbere D, Forabosco P, Pirastu M: Archival, demographic and genetic studies define a Sardinian sub-isolate as a suitable model for mapping complex traits. Hum Genet. 2001, 109: 198-209. 10.1007/s004390100557.View ArticlePubMedGoogle Scholar
- Angius A, Bebbere D, Petretto E, Falchi M, Forabosco P, Maestrale B, Casu G, Persico I, Melis PM, Pirastu M: Not all isolates are equal: Linkage Disequilibrium analysis on Xq13.3 reveals different patterns in Sardinian sub-populations. Hum Genet. 2002, 111: 9-15. 10.1007/s00439-002-0753-z.View ArticlePubMedGoogle Scholar
- Vona G: The peopling of Sardinia (Italy): History and effects. Inter J Anthrop. 1997, 12: 71-87. 10.1007/BF02447890.View ArticleGoogle Scholar
- Latini V, Vona G, Ristaldi MS, Marongiu MF, Memmì M, Varesi L, Vacca L: β-globin gene cluster haplotypes in the Corsican and Sardinian populations. Hum Biol. 2003, 27 (6): 855-871. 10.1353/hub.2004.0008.View ArticleGoogle Scholar
- Latini V, Sole G, Doratiotto S, Poddie D, Memmì M, Varesi L, Vona G, Cao A, Ristaldi MS: Genetic isolates in Corsica (France): Linkage Disequilibrium extension analysis on the Xq 13 region. Eur J Hum Genet. 2004, 12 (8): 613-619. 10.1038/sj.ejhg.5201205.View ArticlePubMedGoogle Scholar
- Kaessmann H, Zollner S, Gustaffson A, Wiebe V, Laan M, Lundeberg J, Uhlén M, Pääbo S: Extensive LD in small human populations in Eurasia. Am J Hum Genet. 2002, 70: 673-685. 10.1086/339258.View ArticlePubMedPubMed CentralGoogle Scholar
- Laan M, Paabo S: Demographic history and LD in human population. Nat Genet. 1997, 17 (4): 435-438. 10.1038/ng1297-435.View ArticlePubMedGoogle Scholar
- Laan M, Wiebe V, Khusnutdinova E, Remm M, Paabo S: X-Chromosome as a marker for population history: linkage disequilibrium and haplotype study in Eurasian populations. Eur J Hum Genet. 2005, 13 (4): 452-462. 10.1038/sj.ejhg.5201340.View ArticlePubMedPubMed CentralGoogle Scholar
- Katoh T, Mono S, Ikuta T, Munkhbat B, Tounai K, Ando H, Munkhtuvshin N, Imanishi T, Inoko H, Tamiya G: Genetic isolates in East Asia: a study of linkage disequilibrium in the X chromosome. Am J Hum Genet. 2002, 71: 395-400. 10.1086/341608.View ArticlePubMedPubMed CentralGoogle Scholar
- Bellis C, Cox HC, Ovcaric M, Begley KN, Lea RA, Quinlan S, Burgner D, Heath SC, Blangero J, Griffiths LR: Linkage disequilibrium analysis in the genetically isolated Norfolk Island population. Heredity. 2008, 100: 366-373. 10.1038/sj.hdy.6801083.View ArticlePubMedGoogle Scholar
- Branco CC, Cabrol E, Bento MS, Gomes CT, Cabral R, Vicente AM, Pacheco PR, Mota-Vieira L: Evaluation of linkage disequilibrium on the Xq13.3 region: Comparison between the Azores islands and mainland Portugal. Am J Hum Biol. 2008, 20 (3): 364-366. 10.1002/ajhb.20734.View ArticlePubMedGoogle Scholar
- Lautenberger JA, Stephens JC, O'Brien SJ, Smith MW: Significant admixture linkage disequilibrium across the FY locus in African Americans. Am J Hum Genet. 2000, 66: 969-978. 10.1086/302820.View ArticlePubMedPubMed CentralGoogle Scholar
- Lewontin RC: The interaction of selection and linkage. General considerations; Heterotic Models. Genetics. 1964, 49: 49-67.PubMedPubMed CentralGoogle Scholar
- Efron B, Tibshirani RJ: An introduction to the bootstrap. 1993, Chapman and Hall, New YorkView ArticleGoogle Scholar
- Raymond M, Rousset F: An exact test for population differentiation. Evolution. 1995, 49: 1280-1283. 10.2307/2410454.View ArticleGoogle Scholar
- Nei M: Molecular Evolutionary Genetics. 1987, Columbia University Press, New YorkGoogle Scholar
- Tenesa A, Wright AF, Knott SA, Carothers AD, Hayward C, Angius A, Persico I, Maestrale G, Hastie ND, Pirastu M, Visscher PM: Extent of linkage disequilibrium in a Sardinian sub-isolate: sampling and methodological considerations. Hum Mol Genet. 2004, 13: 25-33. 10.1093/hmg/ddh001.View ArticlePubMedGoogle Scholar
- Falchi A, Piras IS, Vona G, Amoros JP, Calò CM, Giovannoni L, Varesi L: Cholesteryl ester transfer protein gene polymorphisms are associated with coronary artery disease in Corsican population (France). Exp Mol Pathol. 2007, 83: 25-29. 10.1016/j.yexmp.2006.12.007.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/9/73/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.