Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A

Cadzow, Murray; Merriman, Tony R.; Boocock, James; Dalbeth, Nicola; Stamp, Lisa K.; Black, Michael A.; Visscher, Peter M.; Wilcox, Phillip L.

doi:10.1186/s12881-016-0341-z

Research article
Open access
Published: 15 November 2016

Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A

Murray Cadzow^1,2,
Tony R. Merriman^1,2,
James Boocock^1,2,
Nicola Dalbeth⁴,
Lisa K. Stamp⁵,
Michael A. Black^1,2,
Peter M. Visscher^2,6,7 &
…
Phillip L. Wilcox^1,2,3,8

BMC Medical Genetics volume 17, Article number: 80 (2016) Cite this article

2152 Accesses
9 Citations
1 Altmetric
Metrics details

Abstract

Background

The gene PPARGC1A, in particular the Gly482Ser variant (rs8192678), had been proposed to be subject to natural selection, particularly in recent progenitors of extant Polynesian populations. Reasons include high levels of population differentiation and increased frequencies of the derived type 2 diabetes (T2D) risk 482Ser allele, and association with body mass index (BMI) in a small Tongan population. However, no direct statistical tests for selection have been applied.

Methods

Using a range of Polynesian populations (Tongan, Māori, Samoan) we re-examined evidence for association between Gly482Ser with T2D and BMI as well as gout. Using also Asian, European, and African 1000 Genome Project samples a range of statistical tests for selection (F _ST, integrated haplotype score (iHS), cross population extended haplotype homozygosity (XP-EHH), Tajima’s D and Fay and Wu’s H) were conducted on the PPARGC1A locus.

Results

No statistically significant evidence for association between Gly482Ser and any of BMI, T2D or gout was found. Population differentiation (F _ST) was smallest between Asian and Pacific populations (New Zealand Māori ≤ 0.35, Samoan ≤ 0.20). When compared to European (New Zealand Māori ≤ 0.40, Samoan ≤ 0.25) or African populations (New Zealand Māori ≤ 0.80, Samoan ≤ 0.66) this differentiation was larger. We did not find any strong evidence for departure from neutral evolution at this locus when applying any of the other statistical tests for selection. However, using the same analytical methods, we found evidence for selection in specific populations at previously identified loci, indicating that lack of selection was the most likely explanation for the lack of evidence of selection in PPARGC1A.

Conclusion

We conclude that there is no compelling evidence for selection at this locus, and that this gene should not be considered a candidate thrifty gene locus in Pacific populations. High levels of population differentiation at this locus and the reported absence of the derived 482Ser allele in some Melanesian populations, can alternatively be explained by multiple out-of-Africa migrations by ancestral progenitors, and subsequent genetic drift during colonisation of Polynesia. Intermediate 482Ser allele frequencies in extant Western Polynesian populations could therefore be due to recent admixture with Melanesian progenitors.

Peer Review reports

Background

The thrifty gene theory is based on the hypothesis that alleles causing increased weight, insulin resistance and type 2 diabetes (T2D) in contemporary populations were an adaptation to fluctuating food availability [1], for example from extreme environmental events and extended sea travel, in ancestral populations. Any genuine thrifty gene variant would have been selectively advantageous and therefore subject to selection during human evolution. Thrifty genetic variants arising early in hominid evolution, for example the uricase knockout [2], would be expected to be monomorphic and shared by all human populations. However, candidate thrifty variants arising more recently in human evolution and polymorphic in contemporary populations should exhibit signatures of selection in the immediate genomic vicinity.

The PPARGC1A transcriptional regulator has a central role in insulin signalling and mitochondrial regulation [3]. The functional Gly482Ser (rs8192678) variant (reviewed in [4]) in the PPARGC1A gene is associated with body mass index (BMI) in a Tongan (Polynesian) population [4]. This led to the hypothesis that the Gly482Ser variant represents a candidate ‘thrifty variant’ in Pacific populations. This was based largely on the observation that the postulated thrifty (derived) 482Ser allele exhibited the highest prevalence world-wide in Polynesian populations, who also have high levels of T2D and BMI. In contrast, low T2D prevalence levels have been recorded in Papuan populations lacking the 482Ser allele [5]. This 482Ser allele observation also supported by a previous report of extreme F _ST values (a measure of inter-population difference at specific genetic variants) at the PPARGC1A locus [5]. To date however, this hypothesis has not been formally tested in a range of Polynesian populations using more direct assessments of natural selection.

The Māori and non-indigenous Polynesian populations of Aotearoa New Zealand (NZ) have a high prevalence of obesity, T2D and related metabolic conditions such as gout [6–8]. The reasons for this appear to be complex and are the result of a combination of poor environmental dietary exposure (contributed to by socioeconomic status) and inherited genetic variants [9]. However, extremely little is known about the etiology of obesity and T2D in NZ Māori and Pacific. If the PPARGC1A gene were associated with weight and subject to selection in Pacific populations, then this would be important knowledge on the aetiology of these conditions in contemporary populations. Furthermore, evidence for selection during Polynesian population history would aid in the destigmatisation of obesity. Here, we tested for association between Gly482Ser rs8192678, BMI, T2D and gout in several Polynesian populations. We also examined evidence for recent positive selection at the derived 482Ser allele and in the region of PPARGC1A, by applying a recently developed analytical pipeline [10] to genomic data from the various Polynesian and 1000 Genomes Project world-wide populations.

Methods

Populations

New Zealand (NZ) Māori, Cook Island Māori, Samoan, Tongan and NZ Caucasian sample sets were examined (Table 1). Samples were collected as part of the “Genetics of Gout in Aotearoa” case–control study [11] and ethical approval was obtained from the New Zealand Multi-region Ethics committee (MEC/105/10/130). Each participant provided informed written consent. Patients with gout satisfied, by clinical examination, the American Rheumatism Association preliminary criteria for acute gout [12]. BMI and self-reported T2D status was recorded at time of recruitment. Subjects were classified into ancestry groups based on self-reported ancestry of grand-parents. Table 1 provides the number of samples in each population, and the methods used for genotyping and data analyses. Tables 2 and 3 describe clinical data and ancestry information for the NZ sample sets.

Table 1 Populations, genotyping platforms and analyses used to evaluate evidence of association and selection in PPARGC1A

Full size table

Table 2 Clinical and genetic information for sample sets used for association analysis with BMI, T2D and gout

Full size table

Table 3 Clinical and ancestral information for sample sets used in tests for selection analysis

Full size table

Five additional populations were obtained from the 1000 Genomes Project phase 3 data release (see [13]): two Asian populations: Han Chinese in Beijing, China (CHB), Southern Han Chinese, China (CHS); two of European ancestry: British in England and Scotland (GBR) and Utah residents with Northern and Western European ancestry (CEU); and one of African ancestry: Yoruba in Ibadan, Nigeria (YRI). The CHB and CHS populations are intermediate with regard to Gly482Ser frequencies between the African (YRI) and European (GBR, CEU) on the one hand and the Polynesian populations on the other [14].

Genotypic data

Two genotypic data sets were used for this study: (a) a SNP-specific assay corresponding to a Glycine (C allele) -Serine (T allele) substitution at amino acid position 482 in the PPARGC1A gene (dbSNP ID rs8192678 residing on chromosome 4 (GRCh37 chr4:23815662) and genotyped on a subset of the above populations (see Table 1); and (b) chromosome 4-wide genotypic data. For (a) SNP rs8192678 was genotyped with TaqMan® assay ID C___1643192_20 (Life Technologies, Carlsbad, CA) using a LightCycler 480 Real-Time PCR System (Roche Applied Science, Indianapolis, IN). The MINI MELT program was run from within the LightCycler software, and genotypes were assigned based on clustering within the software. Clusters were visually checked with misassigned genotypes corrected where appropriate or designated as unknown. 9% of samples were re-genotyped and cross checked as a quality control measure. All re-genotyped samples had complete agreement with the original genotypes. For (b) genotypic data for whole chromosome analyses were obtained either directly from SNP microarrays or whole genome sequencing. For the SNP microarrays, 71 individuals who self-reported four Māori grandparents were genotyped using an Affymetrix AXIOM genome-wide ASI array (Table 1). Similarly, 96 individuals self-reporting four Samoan grandparents were genotyped with an Illumina Human OmniExpress bead chip. The individuals genotyped by SNP microarray were a subset from those genotyped with the Taqman® assay. Whole genome sequence (WGS) from 1000 Genomes Project phase 3 data release (via [13]) was used.

Association analyses

To determine associations between BMI and rs8192678 genotype, single marker linear regressions were undertaken using the R statistical software environment [15]. Hardy-Weinberg equilibrium exact tests were calculated using the R package HardyWeinberg [16]. For each population, three modes of gene action were tested: additive, dominant (both C and derived T allele), and over-dominant. Age, sex, gout affection status, T2D, and Structure-estimated ancestry proportions (calculated as described in [11]) were included as covariates for BMI. For T2D and gout affection status, logistic regression was performed with the same covariates. A meta-analysis of the Polynesian populations was undertaken by combining the NZ Māori, Cook Island Māori, Tongan and Samoan populations and repeating the marker trait regressions as described above with the added covariate of population.

Selection analyses

Pairwise F _ST [17] was estimated for all populations for rs8192678. For populations where genome-wide genotypic data were available (i.e., the 1000 Genomes Project and Axiom/Omni-genotyped populations, Table 1), the following statistics were calculated: F _ST between sample sets, Tajima’s D, Fay and Wu’s H, and integrated haplotype score (iHS, [18]) for individual populations, and cross population haplotype homozygosity (XP-EHH, [19]) to estimate selection between populations. To calculate these statistics we used a customized analytical pipeline [10]. For these analyses we assumed that the 482Ser allele was the derived allele, based on low frequencies of this allele in African populations [5]. F _ST was calculated for the entire chromosome using the Weir and Cockerham method with negative values manually set to zero [17]. Quantiles of 2.5 and 97.5% were used to find the most extreme 5% of values. Tajima’s D [20] and Fay and Wu’s H [21] were calculated for the whole chromosome using bins of 1, 5 and 30 kbp. The Tajima’s D for the entire chromosome with thresholds of the 2.5 and 97.5% quantiles used to establish the most extreme 5% of values.

The software package selscan [22] was used to calculate iHS and XP-EHH with values for both being normalised in frequency bins genome-wide. Cross population extended haplotype homozygosity was used to detect selection of alleles at or near fixation, and was calculated between populations as described in [19]. An iHS or XP-EHH absolute value of greater than 3.29 was used as a threshold to estimate the most extreme 1% of values from the score distribution. The iHS normalisation conformed to a standard Gaussian distribution for all populations. For XP-EHH, all pairwise comparisons were performed between the groups of populations that had been genotyped with genome-wide SNP arrays or via WGS.

Fourteen combinations of populations and genes (corresponding to six genes) previously reported by Voight et al. [18] as showing evidence of selection were analysed as positive controls to provide insight into the overall power of detecting signatures of selection using the methods described above (Additional file 1: Table S1).

Results

Association analyses

Tests for association between BMI and rs8192678 genotype did not reveal any statistically significant association for any of the gene action models (Table 4). Similarly, no relationship was observed between rs8192678 genotype and either gout affection or T2D status, for any of the modes of genetic action (Table 4). There were also no statistically significant associations observed for the meta-analysis of the Polynesian populations between BMI, or gout affection, or T2D status and rs8192678 under any of the modes of genetic action.

Table 4 Results from a single marker generalised linear model for rs8192678 and BMI, T2D and gout

Full size table

Selection analyses

Intra-population tests for selection – Tajima’s D, Fay and Wu’s H and iHS

Estimates of Tajima’s D and Fay and Wu’s H for windows surrounding the Gly482Ser position, were within +/− 2.5% limits for all populations and window lengths (Tables 5 and 6). Only the two Chinese populations (CHB and CHS) had strong positive values for Tajima’s D in the 1kbp region surrounding, suggesting possible balancing selection, however estimates decayed to values close to those of the chromosome average for the 30kbp window size (Table 5), and were well within +/− 2.5% thresholds. For Tajima’s D in 1 kb windows that overlapped the PPARGC1A-encoding region, all populations had windows that exceeded both +2.5 and −2.5% thresholds (Additional file 2: Figure S1c), but for the 5 kb windows only the Māori population had windows exceeding the lower threshold (Additional file 2: Figure S1b), indicating the possibility of a selective sweep. However, no population had any window in the PPARGC1A genic region exceeding either threshold for the 30kbp region for Tajima’s D (Additional file 2: Figure S1a). Moreover, there did not appear to be a clear pattern within any population that was consistent with a selective sweep: estimates for the different window sizes were typically dispersed around the chromosome mean for each population rather than showing a pattern where values consistently exceeded the chromosome thresholds. Similarly, for Fay and Wu’s H, no windows overlapping the PPARGC1A region that exceeded +/− 2.5% thresholds were observed with windows of 30 kbp (Additional file 3: Figure S2a). Only the Samoan population had a window that exceeded the threshold in the PPARGC1A genic region for the 5 kb windows (Additional file 3: Figure S2b). For the 1 kb windows, both the Māori and Samoan populations had regions that exceeded lower thresholds, indicating the possibility of an excess of high frequency derived alleles (Additional file 3: Figure S2c). However, for the region containing rs8192678, no window exceeded the +/− 2.5% thresholds in any population, irrespective of window size (Table 6). Neither of these site frequency spectra-based tests therefore revealed evidence of selection in the regions containing the Gly482Ser substitution.

Table 5 Tajima’s D at PPARGC1A calculated with window sizes of 1 kbp, 5 kbp, and 30 kbp

Full size table

Table 6 Fay’s and Wu’s H at PPARGC1A calculated using window sizes of 1 kbp, 5 kbp, and 30 kbp

Full size table

We also calculated integrated haplotype homozygosity score (iHS) statistics for rs8192678 for each population using a genetic map. The iHS is a statistic to detect evidence of recent positive selection at a locus and is based on the differential levels of linkage disequilibrium surrounding a positively selected allele compared to the other allele at the same position. Only the Samoan population had |iHS| > 3.29 when computed using a genetic map (corresponding to the most extreme 1% of |iHS| values) (Table 7, Fig. 1). Plots of iHS +/− 1 Mbp of the PPARGC1A-encoding region showed relatively few locations within this region that exceeded the 3.29 threshold in any of the populations investigated (Fig. 1).

Table 7 iHS and XP-EHH statistics for rs8192678

Full size table

We also evaluated six genes that had previously been reported as exhibiting evidence of selection by Voight et al. (2006) in specific ancestries. Here, there were 14 combinations of genes and populations (Additional file 1: Table S1) since two ancestries – Asian and Caucasian – were each represented by two populations each in this study (CHS and CHB for Asians, and CEU and GBR for Caucasians – see Additional file 1: Table S1). Seven of the 14 gene-population combinations showed evidence of selection in the 1000 Genomes populations samples used for this study (where we define evidence of selection as >1 SNP exceeding the 1% threshold of iHS values, Additional file 1: Table S1). Overall, the analytical methods used in this study detected evidence of selection at half of the loci previously shown to be under selection [18] that were tested.

Inter-population tests for selection - F _ST and XP-EHH

Population differentiation at rs8192678 was investigated by calculating F _ST statistics for all pairwise combinations all populations that had been genotyped with the TaqMan® assay. Derived allele frequencies from the WGS and chip datasets were very similar to those found in Myles et al. [4] with Polynesian (NZ Māori 0.83, Samoan 0.72), Asian (CHB 0.37, CHS 0.45), European (CEU 0.37, GBR 0.33) and African (YRI 0.04). Results showed strong differentiation between NZ Caucasian and all of the Polynesian populations, with NZ Māori and Caucasian being the most differentiated (0.332, Table 8). Tongan and Samoan populations were less differentiated from the NZ Caucasian population (0.160 and 0.210 respectively). East – West Polynesia differentiation [23] was also apparent, with F _ST between NZ Māori and Samoan (0.033) and NZ Māori and Tongan (0.069) populations greater than NZ Māori and CI Māori (0.000). The Samoan and Tongan populations had relatively small differentiation between them (F _ST = 0.003). Population differentiation was also estimated for the Gly482Ser variants in the genome-wide data sets (Table 9). For the Gly482Ser substitution, Māori were more strongly differentiated than the Samoan population from all of the 1000 Genomes populations, with strongest differentiation occurring in the Yoruban population followed by the European and Chinese populations. A similar but less extreme trend was observed for the Samoan population: F _ST was highest for the Yoruban population, with decreasing levels of differentiation between European and Chinese populations respectively. Lower F _ST between Samoan and Caucasian populations compared to the Māori and Caucasian populations was observed in both the genome-wide genotyped populations and the TaqMan®-genotyped populations.

Table 8 Single marker F _ST for rs8192678

Full size table

Table 9 Chromosome 4 F _ST for rs8192678 in WGS and imputed chip data using 1 bp windows

Full size table

We also calculated F _ST for 5Mbp windows surrounding the Gly482Ser location to determine the (genomic) extent of population differentiation (Table 10) over a wider window. Compared with the estimates at the rs8192678 locus itself, 5Mbp window-based F _ST estimates were lower but still revealed the same trend of most differentiation between the Polynesian and Yoruban populations, with European populations the next most differentiated, and Chinese populations the least differentiated (Table 9). However, F _ST estimates between Māori and other genome-wide genotyped populations differed little from estimates between Samoan and other populations.

Table 10 Chromosome 4 F _ST for WGS and imputed chip data using 5 Mbp windows

Full size table

Cross population extended haplotype homozygosity was calculated across chromosome 4 for all combinations of populations that had been resequenced or genotyped genome-wide. There was no combination of populations where XP-EHH estimates exceeded the |3.29| threshold in the PPARGC1A region (Table 7). Cross population extended haplotype homozygosity values were also plotted across the aforementioned region by pairwise population combination (Fig. 1).

Discussion and conclusions

The primary aim of this study was to directly test the hypothesis that the PPARGC1A locus, in particular the Gly482Ser substitution, has been subject to natural selection in the progenitors of contemporary Polynesian populations. The appropriate basis for concluding a specific gene or region is subject to natural selection has been defined by Vitti et al. [24] as ‘A combination of genomic and functional evidence constitutes the current standard for the field’. In this study we combined association analyses between the Gly482Ser genotype and traits either directly (BMI) or indirectly (gout and T2D, which are correlated with BMI) to identify a potential functional role of Gly482Ser. To identify genomic evidence we used a recently-developed analytical pipeline to test selection by a combination of site frequency spectra based statistics, (Tajima’s D, Fay’s and Wu’s H) as well as haplotype-length based measures that examine selection within populations (iHS) or between populations (XP-EHH, [10]). We also estimated population differentiation (F _ST), which has also been used as an indicator of selection [25], as well as departure from expected HW equilibrium. Of these various approaches, we were able to detect departures from the expected neutral selection model for F _ST only – no other statistics indicated consistent functional or genomic evidence for selection. However, reasons other than selection such as local co-ancestry can lead to outlier values for F _ST [26] therefore the F _ST results alone are not sufficient to conclude the presence of selection.

Lack of evidence of selection in other statistical tests may be a consequence of lack of power. In this regard, two factors are important in interpreting results from such analyses: (1) the nature of positive selection being investigated, and (2) power of each of the statistical tests to reject the null hypothesis of no departure from neutral model of selection. Regarding (1), two intra-specific selection scenarios were frequently examined: ‘hard sweeps’ within populations based on selective advantages arising from a de novo mutation with strong positive effects on fitness, and ‘soft sweeps’ based on extant variants underpinning heritable characteristics that are typically under polygenic control. Power (i.e., (2), above) to detect evidence of selection for the various methods used here differs depending upon these scenarios [27]. For the 482Ser allele investigated in this study, putative selection could be either via hard or soft sweeps. On the one hand, the previously reported absence of the derived 482Ser allele in some African and New Guinean populations [5] imply selection on a de novo mutation arising prior to ancestors of modern Homo sapiens migrating out Africa. On the other hand however, there is a relative dearth of evidence of hard sweeps in humans [28]. In addition, if PPARGC1A were indeed under selection, it would have been subject to concomitant selection in multiple Polynesian and other populations (e.g., [29]) and acting on extant variation at this locus. This suggests soft sweeps are a more likely scenario, especially as Gly482Ser is possibly one of very many small effect loci impacting the T2D and correlated conditions such as BMI [30, 31].

Of the methods used in this study to directly examine evidence of selection, none indicate selection has possibly occurred at this locus (Tables 4, 5, 6 and 7, and Fig. 1). The power of Tajima’s D, Fay’s and Wu’s H, and iHS to detect evidence of selection has recently been evaluated by Ferrer-Admetlla et al. [27] for both hard and soft sweeps. For Tajima’s D they reported variable power across a wide range of selection coefficients, window sizes, and final (derived) allele frequencies for hard sweeps. In hard sweeps where ending allele frequencies were similar to those observed in Māori populations (Table 4), moderate-high power was reported, but in soft sweeps power was relatively low (<0.2). Even if such tests are underpowered, the high number of populations, some of which are from the same ancestry (i.e., CEU and GBR being Caucasian, and CHB and CHS being Asian) would likely have detected selection should a hard sweep at this locus be occurring. Moreover, it is reasonable to assume that if the 482Ser allele were a genuine thrifty gene variant, selection would be occurring in all populations rather than being restricted to the Polynesian populations, as most ancestral populations would have experienced periodic limitations in food availability. For iHS, Ferrer-Admetlla et al. [27] reported power was generally high (>0.75) irrespective of whether hard or soft sweeps had occurred, and was similarly robust to different ending allele frequencies – which ranged from 0.5 to 0.9. The derived 482Ser allele frequency estimates in all of the Polynesian sample sets investigated in this study fall within this range.

To provide further insights into whether or not the lack of evidence of selection according to these statistics was due to inadequate sample sizes, we also analysed 14 population/gene combinations that had previously been identified as showing evidence of selection [18]. In our population samples, seven of these 14 combinations had >1 SNP exceeding the 1% iHS threshold, demonstrating that our methods could identify loci previously shown to be under selection (Additional file 1: Table S1). Moreover, in all of the seven cases, multiple SNPs exceeded this threshold, and for two genes, there was evidence of selection in both population samples representing the same ancestry (i.e., evidence of selection in both CEU and GBR populations for LCT, and in both CHB and CHS for SLC445A5, see Additional file 1: Table S1). This indicates that population samples of this size have approximately 50% power to detect an association in the 1000 Genomes populations – none of which were case–control studies - and that when selection is present, there should be multiple SNPs exceeding the threshold. Based on the above results, we therefore expect approximately (1–0.5²) ≈ 0.75 probability of at least one of the Samoan and Māori populations used in this study to show evidence of selection using the iHS test. However the (case–control) Samoan and Māori cohorts had (only) one and zero SNPs, respectively, exceeding the 1% iHS threshold. We also contend that, if anything, the use of a case–control design would be more likely to increase the frequency of haplotypes carrying the selected allele(s), and thus more likely to improve power should this locus be subject to selection. Therefore the lack of a statistically significant difference in 482Ser allele frequencies in the case control Māori cohort used in this study and the previously reported estimate by Myles et al. [5] from a cross-sectional study of a single Māori tribal group located in the East Coast of the North Island of NZ, is further evidence of the lack of evidence of natural selection at this locus.

A further consideration is the elevated derived 482Ser allele frequency in all non-African populations compared to the African populations [4, 5]. These show a progressive differentiation of populations at this locus. Thus, if PPARGC1A were a thrifty gene candidate that has been subjected to natural selection, selection would likely have occurred in all non-African ancestral populations due to factors such as climatic extremes, competition with other human groups, and/or the need to adapt to newly colonised environments - all of which are likely to contribute to periodic deprivations in food availability in progenitors of all populations studied.

The highest derived allele frequencies and strongest differentiation from extant African populations occurs in the Cook Islands and NZ Māori populations. Therefore if this locus is indeed a thrifty gene, recent migration histories would suggest that these populations would be likely to show evidence of association and/or selection as ancestors of these populations colonised East Polynesia less than 1500 years ago [32]. Thus we would expect repeated evidence of selection at this locus in most - if not all - of the populations examined. However, the reverse is the case: there is no strong evidence of positive selective at this locus according the iHS statistic, nor Tajima’s D nor Fay and Wu’s H. Similarly, for the NZ Māori population, evidence of possible differential selection indicated by Tajima’s D in the PPARGC1A-encoding region exceeding the lower 97.5% quantile, (Additional file 2: Figure S1a-c) was not reflected in either Fay and Wu’s H (Additional file 3: Figure S2a-c) or iHS (Fig. 1). Moreover, if selection on the derived 482Ser allele were occurring then this would be revealed by XP-EHH - for at least some population pairs. However, none of the XP-EHH exceeded threshold 1% values in any of the population pairs, nor were there any obvious differences in XP-EHH between pairs of populations that share ancestry (i.e., the CEU – GBR pair, and the CHB – CHS pair) compared to population pairs with different ancestries. Further, differences in derived 482Ser allele frequency within Polynesia – particularly those between Western Polynesian (i.e., Tongan and Samoan) and Eastern Polynesian (i.e., NZ and Cook Islands Māori) – are not explained by oceanic voyaging: the geographic distances between Tonga/Samoa and the Cook Islands is significantly less than the Cook Islands and NZ, yet the Cook Islands Māori 482Ser frequency differs from Samoa and Tonga and is virtually identical to NZ Māori. These differences are also reflected in the F _ST values (Table 8).

In addition, oral histories of Māori migrations make no reference to extensive loss of life on vaka/waka due to starvation during migration voyages [32], but do describe decisions by tribal groups, whose members were generally closely related (e.g., [33]), to migrate due to food competition and/or ongoing conflicts with other tribal groups - effectively increasing the possibility of genetic drift. We therefore conclude that the lack of evidence for selection in any of the statistical tests used in any of the populations examined in this study is likely an accurate biological reflection for the populations examined rather than a lack of statistical power to detect selective events at this locus with the methods used.

Existing 482Ser allele frequency distributions in the Pacific can be explained by a combination of (a) migration out of Africa by Homo sapiens progenitors and possibly Denisovians who either lacked the 482Ser allele or lost it via genetic drift (whose descendants include modern-day Melanesians such as Papuans), (b) later migration out of Africa by Homo sapiens progenitors of Polynesians subsequent to the mutation giving rise to the 482Ser allele, which drifted to increasingly higher frequencies in repeated migrations across the Pacific, and (c) followed by subsequent admixture between these Melanesian progenitors with ancestors of modern Polynesians [34]. The higher derived allele frequencies and genetic differentiation in Eastern Polynesians are likely the result of genetic drift in the ancestral population, possibly as a result of founder effects. This model is consistent with the absence of the 482Ser allele in Denisovian DNA sequences; hence any derived populations with Melanesian admixture – such as Tongan populations - would have lower frequencies than in Eastern Polynesian populations such as Cook Islands and NZ Māori. Genetic drift therefore could account for the progressive increase in allele frequency in modern non-African human populations, with founder effects arising from successive colonisation of islands within Polynesia resulting in the increase in 482Ser allele frequency.

Single marker regression analyses did not reveal any evidence of association between rs8192678 and BMI, gout affection status, or T2D in any of the populations investigated. Although each Polynesian population was small and likely to be underpowered for validating associations between small effect genes and conditions, no associations were found even when Polynesian populations were combined. We also found no evidence for sex-specific effects for any of the three traits in the combined Polynesian populations (data not shown). Our results contrast with those of Myles et al. [4] who reported associations for Tongan populations (n = 184, P = 0.014–0.037), but are consistent with the lack of association in a NZ Māori tribe (Ngāti Rakaipaaka, n = 110, P > 0.8) that was also reported by Myles et al. [4] as well as a small Tongan population reported by Kimura et al. [14]. The results from these multiple studies when considered together provide no repeated evidence for a functional role of this locus for traits such as T2D, BMI or gout in extant Māori or Samoan populations. However, it is possible that non-additive interaction with unmeasured environmental exposures may obscure evidence for main effect association of Gly482Ser with metabolic phenotypes.

Based on the lack of evidence of association between the Gly482Ser variant with BMI and correlated diseases, as well as lack of evidence of association with any of the tests for selection, we conclude that this study does not support natural selection at either the Gly482Ser variant, or the PPARGC1A region in general. More extreme frequencies of the derived allele in the Polynesian populations can alternatively be explained by genetic drift associated with ancestral population bottlenecks during colonisation. Further, the lack of robust direct candidate gene-based evidence for the thrifty gene hypothesis in light of increasing evidence for selective influence of infectious disease-causing agents on genome composition [35] and other factors such as diet, responses to climate, and skin colour collectively increase doubt regarding ongoing validity of this hypothesis. This is supported by no global of evidence for selection at 65 T2D loci with nominal evidence for selection at individual loci driven by an equal measure of T2D protective and risk haplotypes [31].

Abbreviations

BMI:: Body mass index
CEU:: Utah residents with Northern and Western European ancestry
CHB:: Chinese in Beijing, China
CHS:: Southern Han Chinese, China
CI:: Cook Island
GBR:: British in England and Scotland
iHH:: Integrated haplotype homozygosity
iHS:: Integrated haplotype homozygosity score
NZ:: New Zealand
SNP:: Single nucleotide polymorphism
T2D:: Type 2 diabetes
WGS:: Whole genome (re)sequence
XP-EHH:: Cross population extended haplotype homozygosity
YRI:: Yoruba in Ibadan, Nigeria

References

Neel JV. Diabetes mellitus: a “thrifty” genotype rendered detrimental by “progress”? Am J Hum Genet. 1962;14:353–62.
CAS PubMed PubMed Central Google Scholar
Kratzer JT, Lanaspa MA, Murphy MN, Cicerchi C, Graves CL, Tipton PA, et al. Evolutionary history and metabolic insights of ancient mammalian uricases. Proc Natl Acad Sci U S A. 2014;111:3763–8.
Article CAS PubMed PubMed Central Google Scholar
Handschin C. The biology of PGC-1α and its therapeutic potential. Trends Pharmacol Sci. 2009;30:322–9.
Article CAS PubMed Google Scholar
Myles S, Lea RA, Ohashi J, Chambers GK, Weiss JG, Hardouin E, et al. Testing the thrifty gene hypothesis: the Gly482Ser variant in PPARGC1A is associated with BMI in Tongans. BMC Med Genet. 2011;12:10.
Article CAS PubMed PubMed Central Google Scholar
Myles S, Hradetzky E, Engelken J, Lao O, Nürnberg P, Trent RJ, et al. Identification of a candidate genetic variant for the high prevalence of type II diabetes in Polynesians. Eur J Hum Genet. 2007;15:584–9.
Article CAS PubMed Google Scholar
Sundborn G, Metcalf PA, Gentles D, Scragg R, Dyall L, Black P, et al. Overweight and obesity prevalence among adult Pacific peoples and Europeans in the Diabetes Heart and Health Study (DHAHS) 2002–2003, Auckland New Zealand. N Z Med J. 2010;123:30–42.
PubMed Google Scholar
Winnard D, Wright C, Taylor WJ, Jackson G, Te Karu L, Gow PJ, et al. National prevalence of gout derived from administrative health data in Aotearoa New Zealand. Rheumatology. 2012;51:901–9.
Article PubMed Google Scholar
Winnard D, Wright C, Jackson G, Gow P, Kerr A, McLachlan A, et al. Gout, diabetes and cardiovascular disease in the Aotearoa New Zealand adult population: co-prevalence and implications for clinical practice. N Z Med J. 2013;126:53–64.
Google Scholar
Merriman TR. Population heterogeneity in the genetic control of serum urate. Semin Nephrol. 2011;31:420–5.
Article CAS PubMed Google Scholar
Cadzow M, Boocock J, Nguyen HT, Wilcox P, Merriman TR, Black MA. A bioinformatics workflow for detecting signatures of selection in genomic data. Front. Genet. 2014;5:1–8.
CAS Google Scholar
Hollis-Moffatt J, Phipps-Green A, Chapman B, Jones G, van Rij A, Gow P, et al. The renal urate transporter SLC17A1 locus: confirmation of association with gout. Arthritis Res Ther. 2012;14:R92.
Article CAS PubMed PubMed Central Google Scholar
Wallace SL, Robinson H, Masi AT, Decker JL, Mccarty DJ, Yü TF. Preliminary criteria for the classification of the acute arthritis of primary gout. Arthritis Rheum. 1977;20:895–900.
Article CAS PubMed Google Scholar
1000 Genomes Project FTP. 2015. ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/. Accessed 18 Aug 2015
Kimura R, Ohashi J, Matsumura Y, Nakazawa M, Inaoka T, Ohtsuka R, et al. Gene flow and natural selection in oceanic human populations inferred from genome-wide SNP typing. Mol Biol Evol. 2008;25:1750–61.
Article CAS PubMed Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2015. URL https://www.R-project.org/.
Graffelman J. Exploring Diallelic Genetic Markers: The {HardyWeinberg} Package. J Stat Softw. 2015;64:1–22.
Article Google Scholar
Weir BS, Cockerham CC. Estimating F-Statistics for the Analysis of Population Structure. Evolution (N Y). 1984;38:1358–70.
Google Scholar
Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS Biol. 2006;4:e72.
Article PubMed PubMed Central Google Scholar
Tang K, Thornton KR, Stoneking M. A new approach for using genome scans to detect recent positive selection in the human genome. PLoS Biol. 2007;5:e171.
Article CAS PubMed PubMed Central Google Scholar
Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989;123:585–95.
CAS PubMed PubMed Central Google Scholar
Fay JC, Wu CI. Hitchhiking under positive Darwinian selection. Genetics. 2000;155:1405–13.
CAS PubMed PubMed Central Google Scholar
Szpiech ZA, Hernandez RD. Selscan: an efficient multithreaded program to perform EHH-based scans for positive selection. Mol Biol Evol. 2014;31:2824–7.
Article CAS PubMed PubMed Central Google Scholar
Phipps-Green AJ, Hollis-Moffatt JE, Dalbeth N, Merriman ME, Topless R, Gow PJ, et al. A strong role for the ABCG2 gene in susceptibility to gout in New Zealand Pacific Island and Caucasian, but not Māori, case and control sample sets. Hum Mol Genet. 2010;19:4813–9.
Article CAS PubMed Google Scholar
Vitti JJ, Grossman SR, Sabeti PC. Detecting natural selection in genomic data. Annu Rev Genet. 2013;47:97–120.
Article CAS PubMed Google Scholar
Xue Y, Zhang X, Huang N, Daly A, Gillson CJ, MacArthur DG, et al. Population Differentiation as an Indicator of Recent Positive Selection in Humans: An Empirical Evaluation. Genetics. 2009;183:1065–77.
Article CAS PubMed PubMed Central Google Scholar
Bierne N, Roze D, Welch JJ. Pervasive selection or is it …? why are F ST outliers sometimes so frequent ? Mol Ecol. 2013;33:2061–4.
Article Google Scholar
Ferrer-Admetlla A, Liang M, Korneliussen T, Nielsen R. On detecting incomplete soft or hard selective sweeps using haplotype structure. Mol Biol Evol. 2014;31:1275–91.
Article CAS PubMed PubMed Central Google Scholar
Fu W, Akey JM. Selection and adaptation in the human genome. Annu Rev Genomics Hum Genet. 2013;14:467–89.
Article CAS PubMed Google Scholar
Ségurel L, Austerlitz F, Toupance B, Gautier M, Kelley JL, Pasquet P, et al. Positive selection of protective variants for type 2 diabetes from the Neolithic onward: a case study in Central Asia. Eur J Hum Genet. 2013;21:1146–51.
Article CAS PubMed PubMed Central Google Scholar
Lohmueller KE, Pearce CL, Pike M, Lander ES, Hirschhorn JN. Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease. Nat Genet. 2003;33:177–82.
Article CAS PubMed Google Scholar
Ayub Q, Moutsianas L, Chen Y, Panoutsopoulou K, Colonna V, Pagani L, et al. Revisiting the thrifty gene hypothesis via 65 loci associated with susceptibility to type 2 diabetes. Am J Hum Genet. 2014;94:176–85.
Article CAS PubMed PubMed Central Google Scholar
Buck PH. The Coming of the Māori by Te Rangi Hiroa. New Zealand: Thomas Avery & Sons; 1929.
Stafford DM. Te Arawa: A history of the Arawa people. Reed; 1967.
Kayser M, Brauer S, Cordaux R, Casto A, Lao O, Zhivotovsky LA, et al. Melanesian and Asian origins of Polynesians: mtDNA and Y chromosome gradients across the Pacific. Mol Biol Evol. 2006;23:2234–44.
Article CAS PubMed Google Scholar
Cagliani R, Sironi M. Pathogen-driven selection in the human genome. Int J Evol Biol. 2013;2013:1–6.
Article Google Scholar

Download references

Acknowledgements

This work was supported by contract C04X0804 from the New Zealand Ministry of Business, Innovation and Enterprise, and by the New Zealand Health Research Council. PW was also supported by a sabbatical research fellowship from New Zealand Forest Research Institute Limited. Supercomputing resources provided by the New Zealand National eScience Infrastructure (NeSI, see https://www.nesi.org.nz/) were used for running analytical pipeline. The authors also acknowledge helpful comments on an earlier version of the manuscript made by an anonymous reviewer.

Funding

This research was funded by Health Research Council of New Zealand (data generation) and the New Zealand Ministry of Business Innovation and Enterprise via contract C04X0804 (data analyses and interpretation).

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author upon reasonable request.

Authors’ contributions

MC carried out the genotyping, selection analyses and drafting of the manuscript. JB helped with the selection analysis and drafting of the manuscript. PW and TM conceived the study and participated in the design, interpretation of results, and coordination and helped draft the manuscript. MB participated in the statistical analysis and drafting of the manuscript. ND, LS and PV contributed samples and to the drafting of the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

NA. There is no requirement for consent to publish from neither the researchers host organisation nor funder nor any other organisation.

Ethics approval and consent to participate

Ethical approval was obtained from the New Zealand Multi-region Ethics committee (MEC/105/10/130). Each participant provided informed written consent.

Author information

Authors and Affiliations

Department of Biochemistry, University of Otago, Dunedin, New Zealand
Murray Cadzow, Tony R. Merriman, James Boocock, Michael A. Black & Phillip L. Wilcox
Virtual Institute of Statistical Genetics (www.visg.co.nz), Dunedin, New Zealand
Murray Cadzow, Tony R. Merriman, James Boocock, Michael A. Black, Peter M. Visscher & Phillip L. Wilcox
formerly Scion (New Zealand Forest Research Institute Ltd), 49 Sala Street, Rotorua, New Zealand
Phillip L. Wilcox
Department of Medicine, University of Auckland, Auckland, New Zealand
Nicola Dalbeth
Department of Medicine, University of Otago, Christchurch, New Zealand
Lisa K. Stamp
The Queensland Brain Institute, University of Queensland, Brisbane, Australia
Peter M. Visscher
University of Queensland Diamantina Institute, University of Queensland, Translational Research Institute (TRI), Brisbane, Australia
Peter M. Visscher
Department of Mathematics and Statistics, University of Otago, Science III Building, 730 Cumberland St, Dunedin, 9016, New Zealand
Phillip L. Wilcox

Authors

Murray Cadzow
View author publications
You can also search for this author in PubMed Google Scholar
Tony R. Merriman
View author publications
You can also search for this author in PubMed Google Scholar
James Boocock
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Dalbeth
View author publications
You can also search for this author in PubMed Google Scholar
Lisa K. Stamp
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Black
View author publications
You can also search for this author in PubMed Google Scholar
Peter M. Visscher
View author publications
You can also search for this author in PubMed Google Scholar
Phillip L. Wilcox
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Phillip L. Wilcox.

Additional files

Additional file 1: Table S1.

Fourteen previously detected combinations of populations and genes from Voight et al. (2006) were analysed using the selscan software package (Szpiech and Hernandez 2014) as positive controls to provide insight into possible power of detection of signatures of selection. Seven of the previously detected 14 associations were repeated, where we defined evidence of selection as >1 SNP exceeding the 5% threshold of iHS values. For two genes (LCT in both CEU and GBR populations and SLC44A5 in CHS and CHB populations) showed evidence of selection in populations with similar ancestry (Caucasian and Asian, respectively). Evidence of selection for one other gene (SNTG1) was observed in only one of two populations with the same ethnicity. (DOCX 25 kb)

Additional file 2: Figures S1a-c.

Tajima’s D calculated across chromosome 4 using a 30 kbp sliding window by population. Chromosome 4:22.7–24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S1b Tajima’s D calculated across chromosome 4 using a 5 kbp sliding window by population. Chromosome 4:22.7–24.9 Mbp is shown with chromosome mean (purple) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S1c Tajima’s D calculated across chromosome 4 using a 1 kbp sliding window by population. Chromosome 4:22.7–24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. (ZIP 1473 kb)

Additional file 3: Figures S2a-c.

Fay and Wu’s H calculated across chromosome 4 using a 30 kbp sliding window. Chromosome 4:22.7–24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S2b Fay and Wu’s H calculated across chromosome 4 using a 5 kbp sliding window. Chromosome 4:22.7–24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S2c Fay and Wu’s H calculated across chromosome 4 using a 1 kbp sliding window. Chromosome 4:22.7–24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. (ZIP 841 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Cadzow, M., Merriman, T.R., Boocock, J. et al. Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A . BMC Med Genet 17, 80 (2016). https://doi.org/10.1186/s12881-016-0341-z

Download citation

Received: 20 October 2015
Accepted: 01 November 2016
Published: 15 November 2016
DOI: https://doi.org/10.1186/s12881-016-0341-z

Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Populations

Genotypic data

Association analyses

Selection analyses

Results

Association analyses

Selection analyses

Intra-population tests for selection – Tajima’s D, Fay and Wu’s H and iHS

Inter-population tests for selection - F _ST and XP-EHH

Discussion and conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Authors’ contributions

Competing interests

Consent for publication

Ethics approval and consent to participate

Author information

Authors and Affiliations

Corresponding author

Additional files

Additional file 1: Table S1.

Additional file 2: Figures S1a-c.

Additional file 3: Figures S2a-c.

Rights and permissions

About this article

Cite this article

Keywords

BMC Medical Genetics

Contact us

Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Populations

Genotypic data

Association analyses

Selection analyses

Results

Association analyses

Selection analyses

Intra-population tests for selection – Tajima’s D, Fay and Wu’s H and iHS

Inter-population tests for selection - F ST and XP-EHH

Discussion and conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

Authors’ contributions

Competing interests

Consent for publication

Ethics approval and consent to participate

Author information

Authors and Affiliations

Corresponding author

Additional files

Additional file 1: Table S1.

Additional file 2: Figures S1a-c.

Additional file 3: Figures S2a-c.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Genetics

Contact us

Inter-population tests for selection - F _ST and XP-EHH