Lack of direct evidence for natural selection at the candidate thrifty gene locus, PPARGC1A

Background The gene PPARGC1A, in particular the Gly482Ser variant (rs8192678), had been proposed to be subject to natural selection, particularly in recent progenitors of extant Polynesian populations. Reasons include high levels of population differentiation and increased frequencies of the derived type 2 diabetes (T2D) risk 482Ser allele, and association with body mass index (BMI) in a small Tongan population. However, no direct statistical tests for selection have been applied. Methods Using a range of Polynesian populations (Tongan, Māori, Samoan) we re-examined evidence for association between Gly482Ser with T2D and BMI as well as gout. Using also Asian, European, and African 1000 Genome Project samples a range of statistical tests for selection (F ST, integrated haplotype score (iHS), cross population extended haplotype homozygosity (XP-EHH), Tajima’s D and Fay and Wu’s H) were conducted on the PPARGC1A locus. Results No statistically significant evidence for association between Gly482Ser and any of BMI, T2D or gout was found. Population differentiation (F ST) was smallest between Asian and Pacific populations (New Zealand Māori ≤ 0.35, Samoan ≤ 0.20). When compared to European (New Zealand Māori ≤ 0.40, Samoan ≤ 0.25) or African populations (New Zealand Māori ≤ 0.80, Samoan ≤ 0.66) this differentiation was larger. We did not find any strong evidence for departure from neutral evolution at this locus when applying any of the other statistical tests for selection. However, using the same analytical methods, we found evidence for selection in specific populations at previously identified loci, indicating that lack of selection was the most likely explanation for the lack of evidence of selection in PPARGC1A. Conclusion We conclude that there is no compelling evidence for selection at this locus, and that this gene should not be considered a candidate thrifty gene locus in Pacific populations. High levels of population differentiation at this locus and the reported absence of the derived 482Ser allele in some Melanesian populations, can alternatively be explained by multiple out-of-Africa migrations by ancestral progenitors, and subsequent genetic drift during colonisation of Polynesia. Intermediate 482Ser allele frequencies in extant Western Polynesian populations could therefore be due to recent admixture with Melanesian progenitors. Electronic supplementary material The online version of this article (doi:10.1186/s12881-016-0341-z) contains supplementary material, which is available to authorized users.


Background
The thrifty gene theory is based on the hypothesis that alleles causing increased weight, insulin resistance and type 2 diabetes (T2D) in contemporary populations were an adaptation to fluctuating food availability [1], for example from extreme environmental events and extended sea travel, in ancestral populations. Any genuine thrifty gene variant would have been selectively advantageous and therefore subject to selection during human evolution. Thrifty genetic variants arising early in hominid evolution, for example the uricase knockout [2], would be expected to be monomorphic and shared by all human populations. However, candidate thrifty variants arising more recently in human evolution and polymorphic in contemporary populations should exhibit signatures of selection in the immediate genomic vicinity.
The PPARGC1A transcriptional regulator has a central role in insulin signalling and mitochondrial regulation [3]. The functional Gly482Ser (rs8192678) variant (reviewed in [4]) in the PPARGC1A gene is associated with body mass index (BMI) in a Tongan (Polynesian) population [4]. This led to the hypothesis that the Gly482Ser variant represents a candidate 'thrifty variant' in Pacific populations. This was based largely on the observation that the postulated thrifty (derived) 482Ser allele exhibited the highest prevalence world-wide in Polynesian populations, who also have high levels of T2D and BMI. In contrast, low T2D prevalence levels have been recorded in Papuan populations lacking the 482Ser allele [5]. This 482Ser allele observation also supported by a previous report of extreme F ST values (a measure of inter-population difference at specific genetic variants) at the PPARGC1A locus [5]. To date however, this hypothesis has not been formally tested in a range of Polynesian populations using more direct assessments of natural selection.
The Māori and non-indigenous Polynesian populations of Aotearoa New Zealand (NZ) have a high prevalence of obesity, T2D and related metabolic conditions such as gout [6][7][8]. The reasons for this appear to be complex and are the result of a combination of poor environmental dietary exposure (contributed to by socioeconomic status) and inherited genetic variants [9]. However, extremely little is known about the etiology of obesity and T2D in NZ Māori and Pacific. If the PPARGC1A gene were associated with weight and subject to selection in Pacific populations, then this would be important knowledge on the aetiology of these conditions in contemporary populations. Furthermore, evidence for selection during Polynesian population history would aid in the destigmatisation of obesity. Here, we tested for association between Gly482Ser rs8192678, BMI, T2D and gout in several Polynesian populations. We also examined evidence for recent positive selection at the derived 482Ser allele and in the region of PPARGC1A, by applying a recently developed analytical pipeline [10] to genomic data from the various Polynesian and 1000 Genomes Project world-wide populations.

Populations
New Zealand (NZ) Māori, Cook Island Māori, Samoan, Tongan and NZ Caucasian sample sets were examined (Table 1). Samples were collected as part of the "Genetics of Gout in Aotearoa" case-control study [11] and ethical approval was obtained from the New Zealand Multiregion Ethics committee (MEC/105/10/130). Each participant provided informed written consent. Patients with gout satisfied, by clinical examination, the American Rheumatism Association preliminary criteria for acute gout [12]. BMI and self-reported T2D status was recorded at time of recruitment. Subjects were classified into ancestry groups based on self-reported ancestry of grandparents. Table 1 provides the number of samples in each population, and the methods used for genotyping and data analyses. Tables 2 and 3 describe clinical data and ancestry information for the NZ sample sets. Five additional populations were obtained from the 1000 Genomes Project phase 3 data release (see [13]): two Asian populations: Han Chinese in Beijing, China (CHB), Southern Han Chinese, China (CHS); two of European ancestry: British in England and Scotland (GBR) and Utah residents with Northern and Western European ancestry (CEU); and one of African ancestry: Yoruba in Ibadan, Nigeria (YRI). The CHB and CHS populations are intermediate with regard to Gly482Ser frequencies between the African (YRI) and European (GBR, CEU) on the one hand and the Polynesian populations on the other [14].

Genotypic data
Two genotypic data sets were used for this study: (a) a SNP-specific assay corresponding to a Glycine (C allele) -Serine (T allele) substitution at amino acid position 482 in the PPARGC1A gene (dbSNP ID rs8192678 residing on chromosome 4 (GRCh37 chr4:23815662) and genotyped on a subset of the above populations (see Table 1); and (b) chromosome 4-wide genotypic data. For (a) SNP rs8192678 was genotyped with TaqMan® assay ID C___1643192_20 (Life Technologies, Carlsbad, CA) using a LightCycler 480 Real-Time PCR System (Roche Applied Science, Indianapolis, IN). The MINI MELT program was run from within the LightCycler software, and genotypes were assigned based on clustering within the software. Clusters were visually checked with misassigned genotypes corrected where appropriate or designated as unknown. 9% of samples were re-genotyped and cross checked as a quality control measure. All regenotyped samples had complete agreement with the original genotypes. For (b) genotypic data for whole chromosome analyses were obtained either directly from SNP microarrays or whole genome sequencing. For the SNP microarrays, 71 individuals who self-reported four Māori grandparents were genotyped using an Affymetrix AXIOM genome-wide ASI array (Table 1). Similarly, 96 individuals self-reporting four Samoan grandparents were genotyped with an Illumina Human OmniExpress bead chip. The individuals genotyped by SNP microarray were a subset from those genotyped with the Taqman® assay. Whole genome sequence (WGS) from 1000 Genomes Project phase 3 data release (via [13]) was used.

Association analyses
To determine associations between BMI and rs8192678 genotype, single marker linear regressions were undertaken using the R statistical software environment [15]. Hardy-Weinberg equilibrium exact tests were calculated using the R package HardyWeinberg [16]. For each population, three modes of gene action were tested: additive, dominant (both C and derived T allele), and over-dominant. Age, sex, gout affection status, T2D, and Structure-estimated ancestry proportions (calculated as described in [11]) were included as covariates for BMI. For T2D and gout affection status, logistic regression was performed with the same covariates. A metaanalysis of the Polynesian populations was undertaken by combining the NZ Māori, Cook Island Māori, Tongan and Samoan populations and repeating the marker trait regressions as described above with the added covariate of population.

Selection analyses
Pairwise F ST [17] was estimated for all populations for rs8192678. For populations where genome-wide genotypic data were available (i.e., the 1000 Genomes Project and Axiom/Omni-genotyped populations, Table 1), the following statistics were calculated: F ST between sample sets, Tajima's D, Fay and Wu's H, and integrated haplotype score (iHS, [18]) for individual populations, and cross population haplotype homozygosity (XP-EHH, [19]) to estimate selection between populations. To calculate these statistics we used a customized analytical pipeline [10]. For these analyses we assumed that the 482Ser allele was the derived allele, based on low frequencies of this allele in African populations [5]. F ST was calculated for the entire chromosome using the Weir and Cockerham method with negative values manually set to zero [17]. Quantiles of 2.5 and 97.5% were used to find the most extreme 5% of values. Tajima's D [20] and Fay and Wu's H [21] were calculated for the whole chromosome using bins of 1, 5 and 30 kbp. The Tajima's D for the entire chromosome with thresholds of the 2.5 and 97.5% quantiles used to establish the most extreme 5% of values.
The software package selscan [22] was used to calculate iHS and XP-EHH with values for both being normalised in frequency bins genome-wide. Cross population extended haplotype homozygosity was used to detect selection of alleles at or near fixation, and was calculated between populations as described in [19]. An iHS or XP-EHH absolute value of greater than 3.29 was used as a threshold to estimate the most extreme 1% of values from the score distribution. The iHS normalisation conformed to a standard Gaussian distribution for all populations. For XP-EHH, all pairwise comparisons were performed between the groups of populations that had been genotyped with genome-wide SNP arrays or via WGS. Fourteen combinations of populations and genes (corresponding to six genes) previously reported by Voight et al. [18] as showing evidence of selection were analysed as positive controls to provide insight into the overall power of detecting signatures of selection using the methods described above (Additional file 1: Table S1).

Association analyses
Tests for association between BMI and rs8192678 genotype did not reveal any statistically significant association for any of the gene action models (Table 4). Similarly, no relationship was observed between rs8192678 genotype and either gout affection or T2D status, for any of the modes of genetic action (Table 4). There were also no statistically significant associations observed for the meta-analysis of the Polynesian populations between BMI, or gout affection, or T2D status and rs8192678 under any of the modes of genetic action.

Intra-population tests for selection -Tajima's D, Fay and Wu's H and iHS
Estimates of Tajima's D and Fay and Wu's H for windows surrounding the Gly482Ser position, were within +/− 2.5% limits for all populations and window lengths (Tables 5 and 6). Only the two Chinese populations (CHB and CHS) had strong positive values for Tajima's D in the 1kbp region surrounding, suggesting possible balancing selection, however estimates decayed to values close to those of the chromosome average for the 30kbp window size (Table 5), and were well within +/− 2.5% thresholds. For Tajima's D in 1 kb windows that overlapped the PPARGC1A-encoding region, all populations had windows that exceeded both +2.5 and −2.5% thresholds (Additional file 2: Figure S1c), but for the 5 kb windows only the Māori population had windows exceeding the lower threshold (Additional file 2: Figure  S1b), indicating the possibility of a selective sweep. However, no population had any window in the PPARGC1A genic region exceeding either threshold for the 30kbp region for Tajima's D (Additional file 2: Figure S1a). Moreover, there did not appear to be a clear pattern within any population that was consistent with a selective sweep: estimates for the different window sizes were typically dispersed around the chromosome mean for each population rather than showing a pattern where values consistently exceeded the chromosome thresholds. Similarly, for Fay and Wu's H, no windows overlapping the PPARGC1A region that exceeded +/− 2.5% thresholds were observed with windows of 30 kbp (Additional file 3: Figure S2a). Only the Samoan population had a window that exceeded the threshold in the PPARGC1A genic region for the 5 kb windows (Additional file 3: Figure S2b). For the 1 kb windows, both the Māori and Samoan populations had regions that exceeded lower thresholds, indicating the possibility of an excess of high frequency derived alleles (Additional file 3: Figure S2c). However, for the region containing rs8192678, no window exceeded the +/− 2.5% thresholds in any population, irrespective of window size ( Table 6). Neither of these site frequency spectra-based tests therefore revealed evidence of selection in the regions containing the Gly482Ser substitution.
We also calculated integrated haplotype homozygosity score (iHS) statistics for rs8192678 for each population using a genetic map. The iHS is a statistic to detect evidence of recent positive selection at a locus and is based on the differential levels of linkage disequilibrium surrounding a positively selected allele compared to the other allele at the same position. Only the Samoan population had |iHS| > 3.29 when computed using a genetic map (corresponding to the most extreme 1% of |iHS| values) ( Table 7, Fig. 1). Plots of iHS +/− 1 Mbp of the PPARGC1A-encoding region showed relatively few locations within this region that exceeded the 3.29 threshold in any of the populations investigated (Fig. 1).  We also evaluated six genes that had previously been reported as exhibiting evidence of selection by Voight et al. (2006) in specific ancestries. Here, there were 14 combinations of genes and populations (Additional file 1: Table S1) since two ancestries -Asian and Caucasianwere each represented by two populations each in this study (CHS and CHB for Asians, and CEU and GBR for Caucasianssee Additional file 1: Table S1). Seven of the 14 genepopulation combinations showed evidence of selection in the 1000 Genomes populations samples used for this study (where we define evidence of selection as >1 SNP exceeding the 1% threshold of iHS values, Additional file 1: Table S1). Overall, the analytical methods used in this study detected evidence of selection at half of the loci previously shown to be under selection [18] that were tested.

Inter-population tests for selection -F ST and XP-EHH
Population differentiation at rs8192678 was investigated by calculating F ST statistics for all pairwise combinations all populations that had been genotyped with the Taq-Man® assay. Derived allele frequencies from the WGS and chip datasets were very similar to those found in Myles et al. [4] with Polynesian (NZ Māori 0.83, Samoan 0.72), Asian (CHB 0.37, CHS 0.45), European (CEU 0.37, GBR 0.33) and African (YRI 0.04). Results showed strong differentiation between NZ Caucasian and all of the Polynesian populations, with NZ Māori and Caucasian being the most differentiated (0.332, Table 8). Tongan and Samoan populations were less differentiated from the NZ Caucasian population (0.160 and 0.210 respectively). East -West Polynesia differentiation [23] was also apparent, with F ST between NZ Māori and Samoan (0.033) and NZ Māori and Tongan (0.069) populations greater than NZ Māori and CI Māori (0.000). The Samoan and Tongan populations had relatively small differentiation between them (F ST = 0.003). Population differentiation was also estimated for the Gly482Ser variants in the genome-wide data sets (Table 9). For the Gly482Ser substitution, Māori were more strongly differentiated than the Samoan population from all of the 1000 Genomes populations, with strongest differentiation occurring in the Yoruban population followed by the European and Chinese populations. A similar but less extreme trend was observed for the Samoan We also calculated F ST for 5Mbp windows surrounding the Gly482Ser location to determine the (genomic) extent of population differentiation (Table 10) over a wider window. Compared with the estimates at the rs8192678 locus itself, 5Mbp window-based F ST estimates were lower but still revealed the same trend of most differentiation between the Polynesian and Yoruban populations, with European populations the next most differentiated, and Chinese populations the least differentiated (Table 9). However, F ST estimates between Māori and other genome-wide genotyped populations differed little from estimates between Samoan and other populations.
Cross population extended haplotype homozygosity was calculated across chromosome 4 for all combinations of populations that had been resequenced or genotyped genome-wide. There was no combination of populations where XP-EHH estimates exceeded the |3.29| threshold in the PPARGC1A region (Table 7). Cross population extended haplotype homozygosity values were also plotted across the aforementioned region by pairwise population combination (Fig. 1).

Discussion and conclusions
The primary aim of this study was to directly test the hypothesis that the PPARGC1A locus, in particular the Gly482Ser substitution, has been subject to natural selection in the progenitors of contemporary Polynesian populations. The appropriate basis for concluding a specific gene or region is subject to natural selection has been defined by Vitti et al. [24] as 'A combination of genomic and functional evidence constitutes the current standard for the field'. In this study we combined association analyses between the Gly482Ser genotype and traits either directly (BMI) or indirectly (gout and T2D, which are correlated with BMI) to identify a potential functional role of Gly482Ser. To identify genomic evidence we used a recently-developed analytical pipeline to test selection by a combination of site frequency spectra based statistics, (Tajima's D, Fay's and Wu's H) as well as haplotype-length based measures that examine selection within populations (iHS) or between populations (XP-EHH, [10]). We also estimated population differentiation (F ST ), which has also been used as an indicator of selection [25], as well as departure from expected HW equilibrium. Of these various approaches, we were able to detect departures from the expected neutral selection model for F ST onlyno other statistics indicated consistent functional or genomic evidence for selection. However, reasons other than selection such as local coancestry can lead to outlier values for F ST [26] therefore the F ST results alone are not sufficient to conclude the presence of selection. Lack of evidence of selection in other statistical tests may be a consequence of lack of power. In this regard, two factors are important in interpreting results from such analyses: (1) the nature of positive selection being investigated, and (2) power of each of the statistical tests to reject the null hypothesis of no departure from neutral model of selection. Regarding (1), two intra-specific selection scenarios were frequently examined: 'hard sweeps' within populations based on selective advantages  arising from a de novo mutation with strong positive effects on fitness, and 'soft sweeps' based on extant variants underpinning heritable characteristics that are typically under polygenic control. Power (i.e., (2), above) to detect evidence of selection for the various methods used here differs depending upon these scenarios [27].
For the 482Ser allele investigated in this study, putative selection could be either via hard or soft sweeps. On the one hand, the previously reported absence of the derived 482Ser allele in some African and New Guinean populations [5] imply selection on a de novo mutation arising prior to ancestors of modern Homo sapiens migrating out Africa. On the other hand however, there is a relative dearth of evidence of hard sweeps in humans [28]. In addition, if PPARGC1A were indeed under selection, it would have been subject to concomitant selection in multiple Polynesian and other populations (e.g., [29]) and acting on extant variation at this locus. This suggests soft sweeps are a more likely scenario, especially as Gly482Ser is possibly one of very many small effect loci impacting the T2D and correlated conditions such as BMI [30,31]. Of the methods used in this study to directly examine evidence of selection, none indicate selection has possibly occurred at this locus (Tables 4, 5, 6 and 7, and Fig. 1). The power of Tajima's D, Fay's and Wu's H, and iHS to detect evidence of selection has recently been evaluated by Ferrer-Admetlla et al. [27] for both hard and soft sweeps. For Tajima's D they reported variable power across a wide range of selection coefficients, window sizes, and final (derived) allele frequencies for hard sweeps. In hard sweeps where ending allele frequencies were similar to those observed in Māori populations (Table 4), moderate-high power was reported, but in soft sweeps power was relatively low (<0.2). Even if such tests are underpowered, the high number of populations, some of which are from the same ancestry (i.e., CEU and GBR being Caucasian, and CHB and CHS being Asian) would likely have detected selection should a hard sweep at this locus be occurring. Moreover, it is reasonable to assume that if the 482Ser allele were a genuine thrifty gene variant, selection would be occurring in all populations rather than being restricted to the Polynesian populations, as most ancestral populations would have experienced periodic limitations in food availability. For iHS, Ferrer-Admetlla et al. [27] reported power was generally high (>0.75) irrespective of whether hard or soft sweeps had occurred, and was similarly robust to different ending allele frequencieswhich ranged from 0.5 to 0.9. The derived 482Ser allele frequency estimates in all of the Polynesian sample sets investigated in this study fall within this range.
To provide further insights into whether or not the lack of evidence of selection according to these statistics was due to inadequate sample sizes, we also analysed 14 population/gene combinations that had previously been identified as showing evidence of selection [18]. In our population samples, seven of these 14 combinations had >1 SNP exceeding the 1% iHS threshold, demonstrating that our methods could identify loci previously shown to be under selection (Additional file 1: Table S1). Moreover, in all of the seven cases, multiple SNPs exceeded this threshold, and for two genes, there was evidence of selection in both population samples representing the same ancestry (i.e., evidence of selection in both CEU and GBR populations for LCT, and in both CHB and CHS for SLC445A5, see Additional file 1: Table S1). This indicates that population samples of this size have approximately 50% power to detect an association in the 1000 Genomes populationsnone of which were casecontrol studies -and that when selection is present, there should be multiple SNPs exceeding the threshold. Based on the above results, we therefore expect approximately (1-0.5 2 ) ≈ 0.75 probability of at least one of the Samoan and Māori populations used in this study to show evidence of selection using the iHS test. However the (case-control) Samoan and Māori cohorts had (only) one and zero SNPs, respectively, exceeding the 1% iHS threshold. We also contend that, if anything, the use of a case-control design would be more likely to increase the frequency of haplotypes carrying the selected allele(s), and thus more likely to improve power should this locus be subject to selection. Therefore the lack of a statistically significant difference in 482Ser allele frequencies in the case control Māori cohort used in this study and the previously reported estimate by Myles et al. [5] from a cross-sectional study of a single Māori tribal group located in the East Coast of the North Island of NZ, is further evidence of the lack of evidence of natural selection at this locus. A further consideration is the elevated derived 482Ser allele frequency in all non-African populations compared to the African populations [4,5]. These show a progressive differentiation of populations at this locus. Thus, if PPARGC1A were a thrifty gene candidate that has been subjected to natural selection, selection would likely have occurred in all non-African ancestral populations due to factors such as climatic extremes, competition with other human groups, and/or the need to adapt to newly colonised environments -all of which are likely to contribute to periodic deprivations in food availability in progenitors of all populations studied.
The highest derived allele frequencies and strongest differentiation from extant African populations occurs in the Cook Islands and NZ Māori populations. Therefore if this locus is indeed a thrifty gene, recent migration histories would suggest that these populations would be likely to show evidence of association and/or selection as ancestors of these populations colonised East Polynesia less than 1500 years ago [32]. Thus we would expect repeated evidence of selection at this locus in most -if not all -of the populations examined. However, the reverse is the case: there is no strong evidence of positive selective at this locus according the iHS statistic, nor Tajima's D nor Fay and Wu's H. Similarly, for the NZ Māori population, evidence of possible differential selection indicated by Tajima's D in the PPARGC1A-encoding region exceeding the lower 97.5% quantile, (Additional file 2: Figure S1a-c) was not reflected in either Fay and Wu's H (Additional file 3: Figure S2a-c) or iHS (Fig. 1). Moreover, if selection on the derived 482Ser allele were occurring then this would be revealed by XP-EHH -for at least some population pairs. However, none of the XP-EHH exceeded threshold 1% values in any of the population pairs, nor were there any obvious differences in XP-EHH between pairs of populations that share ancestry (i.e., the CEU -GBR pair, and the CHB -CHS pair) compared to population pairs with different ancestries. Further, differences in derived 482Ser allele frequency within Polynesiaparticularly those between Western Polynesian (i.e., Tongan and Samoan) and Eastern Polynesian (i.e., NZ and Cook Islands Māori)are not explained by oceanic voyaging: the geographic distances between Tonga/Samoa and the Cook Islands is significantly less than the Cook Islands and NZ, yet the Cook Islands Māori 482Ser frequency differs from Samoa and Tonga and is virtually identical to NZ Māori. These differences are also reflected in the F ST values (Table 8).
In addition, oral histories of Māori migrations make no reference to extensive loss of life on vaka/waka due to starvation during migration voyages [32], but do describe decisions by tribal groups, whose members were generally closely related (e.g., [33]), to migrate due to food competition and/or ongoing conflicts with other tribal groups -effectively increasing the possibility of genetic drift. We therefore conclude that the lack of evidence for selection in any of the statistical tests used in any of the populations examined in this study is likely an accurate biological reflection for the populations examined rather than a lack of statistical power to detect selective events at this locus with the methods used.
Existing 482Ser allele frequency distributions in the Pacific can be explained by a combination of (a) migration out of Africa by Homo sapiens progenitors and possibly Denisovians who either lacked the 482Ser allele or lost it via genetic drift (whose descendants include modern-day Melanesians such as Papuans), (b) later migration out of Africa by Homo sapiens progenitors of Polynesians subsequent to the mutation giving rise to the 482Ser allele, which drifted to increasingly higher frequencies in repeated migrations across the Pacific, and (c) followed by subsequent admixture between these Melanesian progenitors with ancestors of modern Polynesians [34]. The higher derived allele frequencies and genetic differentiation in Eastern Polynesians are likely the result of genetic drift in the ancestral population, possibly as a result of founder effects. This model is consistent with the absence of the 482Ser allele in Denisovian DNA sequences; hence any derived populations with Melanesian admixturesuch as Tongan populationswould have lower frequencies than in Eastern Polynesian populations such as Cook Islands and NZ Māori. Genetic drift therefore could account for the progressive increase in allele frequency in modern non-African human populations, with founder effects arising from successive colonisation of islands within Polynesia resulting in the increase in 482Ser allele frequency.
Single marker regression analyses did not reveal any evidence of association between rs8192678 and BMI, gout affection status, or T2D in any of the populations investigated. Although each Polynesian population was small and likely to be underpowered for validating associations between small effect genes and conditions, no associations were found even when Polynesian populations were combined. We also found no evidence for sex-specific effects for any of the three traits in the combined Polynesian populations (data not shown). Our results contrast with those of Myles et al. [4] who reported associations for Tongan populations (n = 184, P = 0.014-0.037), but are consistent with the lack of association in a NZ Māori tribe (Ngāti Rakaipaaka, n = 110, P > 0.8) that was also reported by Myles et al. [4] as well as a small Tongan population reported by Kimura et al. [14]. The results from these multiple studies when considered together provide no repeated evidence for a functional role of this locus for traits such as T2D, BMI or gout in extant Māori or Samoan populations. However, it is possible that non-additive interaction with unmeasured environmental exposures may obscure evidence for main effect association of Gly482Ser with metabolic phenotypes.
Based on the lack of evidence of association between the Gly482Ser variant with BMI and correlated diseases, as well as lack of evidence of association with any of the tests for selection, we conclude that this study does not support natural selection at either the Gly482Ser variant, or the PPARGC1A region in general. More extreme frequencies of the derived allele in the Polynesian populations can alternatively be explained by genetic drift associated with ancestral population bottlenecks during colonisation. Further, the lack of robust direct candidate gene-based evidence for the thrifty gene hypothesis in light of increasing evidence for selective influence of infectious disease-causing agents on genome composition [35] and other factors such as diet, responses to climate, and skin colour collectively increase doubt regarding ongoing validity of this hypothesis. This is supported by no global of evidence for selection at 65 T2D loci with nominal evidence for selection at individual loci driven by an equal measure of T2D protective and risk haplotypes [31].

Additional files
Additional file 1: Table S1. Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S1b Tajima's D calculated across chromosome 4 using a 5 kbp sliding window by population. Chromosome 4:22.7-24.9 Mbp is shown with chromosome mean (purple) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S1c Tajima's D calculated across chromosome 4 using a 1 kbp sliding window by population. Chromosome 4:22.7-24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4  Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S2b Fay and Wu's H calculated across chromosome 4 using a 5 kbp sliding window. Chromosome 4:22.7-24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4 marked. Location of PPARGC1A is marked in red. Rs8192678 is marked by a red dashed line. Figure S2c Fay and Wu's H calculated across chromosome 4 using a 1 kbp sliding window. Chromosome 4:22.7-24.9 Mbp is shown with chromosome mean (blue) and 2.5%, 97.5% quantiles (purple) from Table 4