Polymorphism in microRNA-binding site in HNF1B influences the susceptibility of type 2 diabetes mellitus: a population based case–control study

Background Recent genome-wide association studies (GWAS) have identified many SNPs associated with type 2 diabetes mellitus (T2DM). However, the functional roles for most of the SNPs have not been elucidated. MicroRNAs (miRNAs) are key regulators of gene expression involved in the development and progression of various diseases including T2DM. In this study, we investigated whether commonly occurring SNPs modulate miRNA-directed regulation of gene expression, and whether such SNPs in miRNA-binding sites are associated with the susceptibility for T2DM. Methods Genotypes of eleven 3′ untranslated region (UTR) SNPs of seven susceptibility genes for T2DM were determined in 353 T2DM patients and 448 control subjects. In addition, the interactions of miRNAs with the 3′UTR in the hepatocyte nuclear factor 1β (HNF1B) gene were investigated using luciferase reporter assays. Results One 3′UTR SNP (rs2229295) in the HNF1B gene was significantly associated with T2DM, and the frequency of an A allele (rs2229295) in T2DM patients was decreased compared with that in controls. Luciferase reporter assays showed that the SNP (rs2229295) altered the binding of two miRNAs (hsa-miR-214-5p and hsa-miR-550a-5p). Conclusions We have detected the interactions of hsa-miR-214-5p/hsa-miR-550a-5p and the 3′UTR SNP of the HNF1B gene by in vitro luciferase reporter assays, and propose that the binding of such miRNAs regulates the expression of the HNF1B gene and the susceptibility of T2DM. Electronic supplementary material The online version of this article (doi:10.1186/s12881-015-0219-5) contains supplementary material, which is available to authorized users.


Background
Type 2 diabetes mellitus (T2DM) is a common heterogeneous and complex disease that is characterized by hyperglycemia resulting from impaired pancreatic β-cell function and a decreased action of insulin on target tissues. A combination of multiple genetic and environmental factors is considered to contribute to the pathogenesis of this disease. Patients with T2DM are at greater risk of developing cardiovascular diseases, renal failure, neurological conditions, and retinopathy [1][2][3] Recent genome-wide association studies (GWAS) have successfully identified over 65 susceptibility loci associated with T2DM and related metabolic traits [4][5][6]. GWAS have been a powerful approach to identify single nucleotide polymorphisms (SNPs) associated with disease risk. However, most of the SNPs in susceptibility genes for T2DM identified in previous studies were located within non-translated regions, such as introns, 3′untranslated regions (3′UTRs), and 5′UTRs. Therefore, functional roles for many of the SNPs in susceptibility genes have not been elucidated.
MicroRNAs (miRNAs) are endogenous noncoding RNAs (19)(20)(21)(22)(23)(24)(25) nucleotides in length) that induce the translational repression and degradation of target mRNAs by complementarily binding to their 3′UTR [7]. By silencing their target gene expression, miRNAs are involved in a variety of biological processes, as well as the development and progression of human diseases including cancer and T2DM [8][9][10][11][12][13]. Previous studies showed that SNPs within or proximal to miRNAbinding sites in target genes have the potential to either create or destroy binding sites, which affects the efficiency of miRNA binding on target sites. Thus, SNPs in miRNA-binding sites may modulate expression and protein levels of target genes, and ultimately contribute to phenotypic variations, including disease susceptibility and important traits [9][10][11]14].
In this study, we investigated whether commonly occurring SNPs modulate miRNA-directed regulation of gene expression, and whether such SNPs in miRNA-binding sites are associated with the susceptibility for T2DM.

Subjects
The participants recruited for this study were Japanese who underwent a routine medical check-up at a medical center near the University of Shizuoka. We selected men under 65 years of age as subjects in this study. The case subjects with T2DM (n = 353) were diagnosed as T2DM by physicians according to the World Health Organization (WHO) diagnostic criteria for T2DM [15]. Of these, 251 T2DM patients (71.1 %) were under oral medication for diabetes. The control subjects (n = 448) were randomly selected according to the following criteria to exclude persons with potential glucose intolerance: (1) fasting plasma glucose levels were under 100 mg/dL (5.6 mmol/L), and (2) HbA1c levels were under 6.2 %. All subjects provided written informed consent to participate in this study, and the study was approved by the Ethics Committee of the University of Shizuoka (Approval No.  After overnight fasting, blood was collected from each subject. The clinical characteristics of the subjects were determined according to the medical check-up protocol (Table 1).
Genomic DNA was isolated from peripheral leukocytes by the phenol extraction method. The genotypes of the SNPs were determined for each subject using the PCR-restriction fragment length polymorphism method.

HNF1B 3′UTR reporter gene construction
Two SNPs (rs2229295 C > A, rs1800929 A > G) lie next to each other in the microRNA binding sites in the 3′UTR of the hepatocyte nuclear factor 1B (HNF1B) gene. The HNF1B 3′UTR (920 bp) was amplified using PrimeSTAR® HS DNA Polymerase (Takara Bio Inc., Otsu, Japan) from the genomic DNA of the homozygote for major alleles of the two SNPs (C for rs2229295, A for rs1800929). The primer sequences are listed in Additional file 1.

Statistical analyses
The associations of genotypes of the eleven 3′ UTR SNPs in seven T2DM susceptibility genes and T2DM were examined. The genotype specific odds ratios (ORs) with 95 % confidence intervals (CIs) and p-values for T2DM were calculated using logistic regression analysis, adjusting for age and BMI.
In the luciferase reporter assay, the differences in the luciferase activity between four kinds of constructs (CA, CG, AA and AG) were examined by Tukey-Kramer multiple comparisons test. All statistical analyses were performed using the JMP software package (SAS Institute, Cary, NC, USA). The power to detect an association between each SNP and T2DM was estimated under current sample size and minor allele frequency observed in this study using "Quanto" [26], assuming OR = 1.2, α level = 0.05 (one-sided), and additive model.
For association between T2DM and each SNP, p < 0.0045 (0.05/11) was considered as significant by applying a Bonferroni correction.

Results
We analyzed the relationships between T2DM and genotypes of eleven 3′UTR SNPs in seven T2DM susceptibility genes that were previously detected by GWAS. The genotype distributions of these 11 SNPs were in Hardy-Weinberg equilibrium (P > 0.05). Table 2 shows the associations between T2DM and these SNPs. The ORs and p-values were adjusted for age and BMI in logistic regression analysis. One 3′UTR SNP (rs2229295) in the HNF1B gene was significantly associated with T2DM, and the frequency of CA and AA genotypes of rs2229295 in T2DM patients was decreased compared with that in controls (OR = 0.66 (95 % CI: 0.50-0.88), 0.44 (95 % CI: 0.25-0.77), respectively)) ( Table 2). These data indicate that the A allele of 3′UTR SNP (rs2229295) in the HNF1B gene can be a protective allele for T2DM. The other ten 3′UTR SNPs in the susceptibility genes were not associated with T2DM.
To investigate the functional impact of the SNP (rs2229295) in the HNF1B gene, we next searched miR-NAs whose binding could be affected by the base substitution due to this SNP (rs2229295) using online databases (MirSNP, PolymiRTS, and miRNASNP). We identified four candidate miRNAs whose seed sequences correspond with complementary sequences around the SNP (rs2229295) (Fig. 1). In this region, two SNPs (rs2229295 C > A, rs1800929 A > G) are located next to each other. In addition, the seed sequences of these four miRNA contain complementary sequences to the minor alleles of two SNPs (A for rs2229295, G for rs1800929) of the HNF1B gene (Fig. 1).
Next, we tested whether the binding of these four miRNAs to the 3′UTR of the HNF1B gene was affected Fig. 1 Predicted miRNAs whose binding are possibly affected by the base substitutions due to SNPs r22229295 and rs1800929. The four miRNAs were predicted as candidate miRNAs in at least two of three online databases (MirSNP, PolymiRTS, and miRNASNP) [20][21][22][23][24][25]. Seed sequences of each miRNA were indicated by bold. The complemetary sequences of 3′UTR of the HNF1B gene were shown by underlined. The red color showed sites for SNPs (rs2229295 and rs1800929) by the two SNPs. We generated four kinds of luciferase reporter constructs and one reference construct as described in Methods (Fig. 2a). The constructs were each co-transfected in parallel with the four predicted candidate miRNA mimics into HEK293 cells, and luciferase activity was compared. When hsa-miR-214-5p or hsa-miR-550a-5p mimics were co-transfected with the reporter construct, significant suppression of luciferase activity was observed in constructs containing AA or AG sequences for the two SNPs (rs2229295, rs1800929) compared with the construct containing CA sequence, which presumably does not bind miRNAs (Fig. 2b). When the other two miRNA mimics (hsa-miR-550a-3-5p, hsa-miR-1271-3p) were co-transfected with each reporter construct, there were no differences in luciferase activity among reporter construct (Additional file 2). Furthermore, there were no differences in luciferase activities among reporter constructs when they were transfected into HEK293 cells without miRNA mimics (Additional file 3).
These data indicate that the substitution of C > A due to SNP (rs2229295) induces a decrease of luciferase activity.
However, A > G substitution due to SNP (rs1800929) did not affect luciferase activity. The results of luciferase reporter assays showed that the SNP (rs2229295) actually alters the binding of two miRNAs (hsa-miR-214-5p and hsa-miR-550a-5p), and A allele carrying constructs were specifically regulated by the two miRNAs, while the adjacent SNP (rs1800929) did not affect the binding of the miRNAs to HNF1B 3′UTR.

Discussion
Previous studies have demonstrated that genetic variations within miRNA-binding sites could modulate gene expression and protein levels, and affect phenotypes or cause disease [8][9][10]. In this study, we identified an SNP (rs2229295) in the 3′UTR of the HNF1B gene that could affect miRNA binding and that was associated with the risk of T2DM. Two SNPs (rs2229295, rs1800929) lie next to each other in this region. In silico analysis predicted that substitutions C > A in rs2229295 and A > G in rs1800929 create a new potential miRNA-binding site Fig. 2 Effect of the base substitutions due to SNPs rs2229295 and rs1800929 on miRNA binding. a Schematic representation of reporter constructs used in the luciferase reporter assay. Plasmid construct containing TC sequence, which was selected randomly, was used as a reference. Major allele (C for rs2229295) is shown in blue and minor allele (A for rs2229295) is shown in red. b Relative luciferase activity of each reporter construct. Luciferase activity was normalized to Renilla luciferase levels. Luciferase activities relative to the reference vector (TC vector) are shown as mean ± S.E. from three independent transfection experiments with triplicate assays. The luciferase activities among four constructs were compared using the Turkey-Kramer method (*p < 0.05, **p < 0.01) in the 3′UTR of the HNF1B gene (Fig. 1). It was ascertained one SNP (C > A in rs2229295) could affect the binding of two miRNAs (hsa-miR-214-5p, hsa-miR-550a-5p) by luciferase reporter assay. However, the other SNP (rs1800929) and two miRNAs (hsa-miR-550a-3-5p, hsa-miR-1271-3p) did not influence the luciferase activity. Many potential miRNA target sites can be predicted in 3′ UTRs of many genes by in silico analysis. However, the binding of miRNAs and target genes have considerable flexibility and therefore in silico analysis is not sufficient to define 3′UTR SNPs related to susceptibility of common diseases.
Some GWAS revealed that several tag SNPs in the HNF1B gene were associated with the susceptibility of T2DM, and such associations were well replicated in many countries [36][37][38]. However, the SNP (rs 2229295) that was associated with the risk of T2DM in this study was not a tag SNP for the HNF1B gene. There is no report for the association of this SNP (rs 2229295) and T2DM. We could not observe significant linkage disequilibrium (LD) between the SNP (rs 2229295) and a tag SNP (rs7501939) of the HNF1B gene (Additional file 4).
Recently, Kornfeld and colleagues found that obesityinduced overexpression of miR-802 causes glucose intolerance, impairs insulin signaling, and promotes hepatic gluconeogenesis in the liver through direct silencing of HNF1B, and showed an important role for HNF1B in the control of hepatic insulin sensitivity and glucose metabolism in vivo [39].
We have detected the interactions of hsa-miR-214-5p/ hsa-miR-550a-5p and the 3′UTR of the HNF1B gene by in vitro luciferase reporter assays, and our results suggest that binding of hsa-miR-214-5p and hsa-miR-550a-5p may also regulate the expression of the HNF1B gene. Unfortunately, we could not examine the interactions between such miRNAs and the endogenous HNF1B gene. Because the genomic sequence of miRNA binding site of the HNF1B gene in HEK293 cells that we used in this study is C (rs2229295), this sequence does not bind hsa-miR-214-5p and hsa-miR-550a-5p. Furthermore, we have no data as to whether HNF1B mRNA and/or protein levels in vivo are affected by the genotype of the SNP (rs2229295).
The miR214 gene is located in an intronic region of the Dynamin-3 gene on human chromosome 1q24.3, and is expressed in the liver, kidney, pancreas, and osteoblasts involved in the development of pancreas and bone [40,41]. The miR-550 gene is located in the intronic region of the Znrf2 gene on human chromosome 7p14.3, and expressed in multiple cancers including hepatocellular carcinoma [42]. However, there is little information regarding the function and regulation of expression of miR-550 in normal cells and tissues. We need to know how the expressions of hsa-miR-214-5p and hsa-miR-550a-5p are regulated in vivo.
In this study, we found the possibility that the binding of two miRNAs to the 3′UTR of the HNF1B gene provided the protective effect for T2DM. In most patients with MODY5, the clinical phenotypes may be related to loss of function or dominant-negative mechanisms for HNF1B [28,[32][33][34][35]. However, a previous study reported a mutation that showed a gain-of function phenotype with increased transcript activity of the HNF1B gene [43]. Important roles of HNF1B for complex transcriptional networks in pancreatic β-cells and hepatocytes have been established [35,44,45]. There is a possibility that the dysregulated expression of the HNF1B gene due to nucleotide changes within the miRNA-binding site would lead to impair transcriptional networks related to HNF1B and the differences of susceptibility for T2DM. Further experiments are needed to ascertain roles for hsa-miR-214-5p and hsa-miR-550a-5p and HNF1B-dependent regulation of insulin secretion, glucose metabolism in vivo.

Conclusions
In this study, we found the 3′UTR SNP (rs2229295) in the HNF1B gene was associated with the susceptibility of T2DM. In addition, luciferase reporter assays indicate that the substitution of C > A due to SNP (rs2229295) induces the binding of hsa-miR-214-5p/hsa-miR-550a-5p to the 3′UTR of the HNF1B gene.
There is a possibility that the dysregulated expression of the HNF1B gene due to nucleotide changes within miRNA binding site lead the difference of susceptibility for T2DM.

Additional files
Additional file 1: Table S1. PCR primers used for subcloning and introduction of nucleotide changes in 3′UTR of HNF1B. (XLSX 11 kb) Additional file 2: Figure S1. Effect of miRNA (A:hsa-miR-550a-3-5p, B: hsa-miR1271-3p) binding to reporter constructs. There was no significant difference in luciferase activities among constructs containing CA, AA, or AG sequences (for SNP rs2229295 and rs1800929). Luciferase activities relative to reference vector (TC vector) were shown as mean ± S.E. from 3 independent transfection experiments with triplicate assays. The comparisons of luciferase activity among four constructs were using Turkey-Kramer method. (PDF 10 kb)