Identification of transcription factors and single nucleotide polymorphisms of Lrh1 and its homologous genes in Lrh1-knockout pancreas of mice
BMC Medical Genetics volume 15, Article number: 43 (2014)
To identify transcription factors (TFs) and single nucleotide polymorphisms (SNPs) of Lrh1 (also named Nr5a2) and its homologous genes in Lrh1-knockout pancreas of mice.
The RNA-Seq data GSE34030 were downloaded from Gene Expression Omnibus (GEO) database, including 2 Lrh1 pancreas knockout samples and 2 wild type samples. All reads were processed through TopHat and Cufflinks package to calculate gene-expression level. Then, the differentially expressed genes (DEGs) were identified via non-parametric algorithm (NOISeq) methods in R package, of which the homology genes of Lrh1 were identified via BLASTN analysis. Furthermore, the TFs of Lrh1 and its homologous genes were selected based on TRANSFAC database. Additionally, the SNPs were analyzed via SAM tool to record the locations of mutant sites.
Total 15683 DEGs were identified, of which 23 was Lrh1 homology genes (3 up-regulated and 20 down-regulated). Fetoprotein TF (FTF) was the only TF of Lrh1 identified and the promoter-binding factor of FTF was CYP7A. The SNP annotations of Lrh1 homologous genes showed that 92% of the mutation sites were occurred in intron and upstream. Three SNPs of Lrh1 were located in intron, while 1819 SNPs of Phkb were located in intron and 1343 SNPs were located in the upstream region.
FTF combined with CYP7A might play an important role in Lrh1 regulated pancreas-specific transcriptional network. Furthermore, the SNPs analysis of Lrh1 and its homology genes provided the candidate mutant sites that might affect the Lrh1-related production and secretion of pancreatic fluid.
The pancreas is an endocrine gland, producing insulin, glucagon, somatostatin, and pancreatic polypeptide, and also an exocrine gland, accounting for more than 98% of pancreatic gland and secreting pancreatic juice containing digestive enzymes . These digestive enzymes help to further break down the carbohydrates, proteins and lipids in the chime and thus support the absorption and digestion of nutrition in small intestine . In the past decades, many research have focused on target genes and transcription factors (TFs) involved in the exocrine pancreas-specific transcriptional networks which are required for the production and secretion of pancreatic fluid that helps out the digestive system. Currently, many exocrine pancreas-specific genes and transcription factors have been identified, which may promote the understanding of the effect of exocrine pancreas on digestive system.
Liver receptor homolog-1 (Lrh1; also called Nr5a2) is a nuclear receptor of ligand-activated transcription factors in liver by binding as a monomer to DNA sequence elements with the consensus sequence 5′-Py-CAAGGPyCPu-3′ . It has been suggested that Lrh1 is progressively expressed in both the endocrine and exocrine pancreas . Baquié M et al.  have found that Lrh1 is expressed in human islets and protects β-cells against stress-induced apoptosis that may be mediated via the increased glucocorticoid production that blunts the pro-inflammatory response of islets. Meanwhile, Fayard E et al.  have demonstrated that both Lrh1 and CEL (encoding carboxyl ester lipase) are co-expressed and confined to the exocrine pancreas. The identification of CEL as an Lrh1-target gene indicates that Lrh1 plays an important role in enterohepatic cholesterol homeostasis associated with the absorption of cholesteryl esters and the assembly of lipoproteins by the intestine . Besides, Lrh1 is a downstream target in the PDX-1 (lead to pancreas agenesis) regulatory cascade that is activated only during early stages of pancreas development and that governs pancreatic development, differentiation and function .
Recently, the rapid advent of next-generation sequencing has made this technology broadly available for researchers in various molecular and cellular biological fields. Holmstrom SR et al.  have determined the cistrome and transcriptome for the nuclear receptor LRH-1 in exocrine pancreas and revealed that Lrh1 directly induces expression of genes encoding digestive enzymes and secretory and mitochondrial proteins based on Chromatin immunoprecipitation (ChIP)-seq and RNA-seq analyses. Besides, Lrh1 cooperates with the pancreas transcription factor 1-L complex (PTF1-L) in regulation of exocrine pancreas-specific gene expression. However, many potential target genes and TFs of Lrh1 based on RNA-seq analysis have not been revealed.
In the present study, we downloaded the raw RNA-seq data of Holmstrom SR et al. deposited in The National Center for Biotechnology Information (NCBI) database, which were analyzed using multiple bioinformatics tools in the purpose of finding specific TFs of Lrh1 and its homology genes. Additionally, we also annotated the SNPs of Lrh1 and its homology genes to predict their mutant sites. Our study might improve the understanding of the regulation network of Lrh1-related production and secretion of pancreatic fluid.
RNA-seq data acquisition
The RNA-seq data was downloaded from NCBI (http://www.ncbi.nlm.nih.gov/) Gene Expression Omnibus (GEO) database (GEO accession: GSE34030 ), including 2 Lrh1 pancreas knockout samples and 2 wild type samples. RNA preparations were subjected to the Illumina RNA-seq protocol and the platform was GPL9185.
Data pre-processing, gene expression and homology gene of Lrh1
The raw data were downloaded from SRA (Sequence Read Archive) of NCBI and then converted to fastq reads using fastq-dump program of NCBI SRA Toolkit (−q 64) (http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=std). Then, these reads were processed through TopHat  and Cufflinks  package to calculate gene-expression level. All parameters were set up according to the default settings of TopHat and Cufflinks. The DEGs were identified via non-parametric algorithm (NOISeq) methods in R package . The thresholds value was False Discovery Rate (FDR) < 0.001. BLASTN analysis [13, 14] of the selected DEGs was used to identify the homology genes of Lrh1. Homology genes here refer to the paralogous genes which share a high degree of sequence similarity (maximum expectation value was set to e−5) with Lrh1 in mice.
Function annotation of Lrh1homologous genes
For functional analysis of Lrh1 homologous genes, DAVID (Database for Annotation, Visualization and Integrated Discovery)  was performed for Gene Ontology (GO)  function and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis.
Transcription factor (TF) of Lrh1homologous genes
Combined with TRANSFAC database , the TFs regulated the transcription of Lrh1 and its homologous genes were identified. Then, the promoter-binding factors regulated via the selected TFs were analyzed based on the website (http://www.nursa.org/molecule.cfm?molType=receptor&molId=5A2).
Screening of SNPs
The fastq reads were mapped to marker sequences using bowtie . And the aligned reads were called using the SAM tool . In order to minimize the risk of false-positive SNP Callings, the threshold value was that ID was “*” with quality > 50, or ID was not “*” with quality > 20. These SNPs were annotated via SnpEff  to categorize the effects of variants in genome sequences. The identified SNPs were searched in the dbSNP database to identify diseased SNPs or de novo discovered SNPs.
Identification and homology analysis of differentially expressed genes
After data processing, at FDR < 0.001, a total of 15683 DEGs were identified, including 10994 up-regulated and 4698 down-regulated genes. BLASTN analysis of DEGs showed 23 Lrh1 homology genes. Among them, 3 were up-regulated and 20 were down-regulated (Table 1).
Function and pathway annotation of Lrh1homologous genes
To determine the function of Lrh1 homologous genes in pancreas, GO enrichment analysis and KEGG pathway enrichment analysis were used to analyze the up- and down-regulated Lrh1 homologous genes. For function and pathway annotation, DEGs were enriched into hexose metabolic process and monosaccharide metabolic process, which were involved into glycometabolism (Figure 1). Meanwhile, KEGG pathway enrichment analysis identified insulin signaling pathway, indicating that the disorders of glycometabolism might be resulted from insulin resistance and/or insulin secretion (Figure 2). PHKB, an Lrh1 homologous gene, participated in GO terms (hexose metabolic process and monosaccharide metabolic process) and KEGG pathway (insulin signaling pathway), was identified.
Potential TFs of Lrh1homologous genes
Fetoprotein transcription factor (FTF) (ID: T04754) of Lrh1 was the only TF identified based on TRANSFAC database. Meanwhile, the promoter-binding factor of Lrh1 was CYP7A (Cholesterol 7α-hydroxylase).
SNPs of Lrh1homologous genes
The annotation of SNPs of Lrh1 homologous genes showed that the majority of SNPs were located in intron and upstream, accounting for nearly 92% of all SNPs (Tables 2 and 3). Three SNPs of Lrh1 were distributed in intron. Meanwhile, total 1819 SNPs of Phkb were located in the intron and 1343 SNPs were located in the upstream region of Phkb.
In the present study, combined with RNA-seq data of Lrh1-knockout pancreas samples, FTF was the only TF of Lrh1 identified based on TRANSFAC database and may regulate cholesterol catabolism into bile acids by activation of the promoter-binding factor CYP7A. Many literatures have elucidated the function of Lrh1/Nr5a2/FTF/CYP7A via experimental studies [21–25].
FTF is highly expressed in the liver and intestine and is implicated in the regulation of cholesterol, bile acid and steroid hormone homeostasis . Nearly 50% of the body cholesterol is catabolized to bile acids via bile acid biosynthetic pathway, of which cholic acid (hydroxylated at position 12) and chenodeoxycholic acid are the major primary bile acids and play an important role in cholesterol homeostasis . Chenodeoxycholic acid can repress FTF expression and is a more potent suppressor of HMG-CoA reductase and cholesterol 7α-hydroxylase/CYP7A1 (7α-hydroxylase) than cholic acid . It has been proposed that Lrh1, also known as CYP7A promoter-binding factor, LRH1, or FTF, is required for the transcription of the 7α-hydroxylase gene [19, 28]. The small heterodimer partner 1 (SHP) of the nuclear bile acid receptor, FXR (farnesoid X receptor) can dimerize with FTF and diminish its activity on the 7α-hydroxylase promoter .
Although Lrh1 has been demonstrated the function in feedback regulation of CYP7A1 expression as part of the FXR-SHP-LRH-1 cascade, in which bile acids can inhibit their own synthesis, the mechanisms have not been well understood. Out C et al.  have suggested that CYP7A1 expression is increased rather than decreased under chow-fed conditions in Lrh1-knockdown mice that is coincided with a significant reduction in expression of intestinal Fgf15, a suppressor of CYP7A1. Besides, Noshiro M et al.  have suggested that the circadian rhythm of CYP7A is regulated by multiple transcription factors, including DBP, REV-ERBα/β, LXRα, HNF4α DEC2, E4BP4, and PPARα. Hepatocyte nuclear factor 4α (HNF4α) and FTF are two major TFs driving CYP7A1 promoter activity in lipid homeostasis. Bochkis IM et al.  have shown that prospero-related homeobox (Prox1) directly interacts with both HNF4α and FTF and potently co-represses CYP7A1 transcription.
In the present study, we annotated the SNPs of Lrh1 and its homologous genes, showing that the majority was located in intron and upstream. Quiles Romagosa MÁ  has reported that a functional SNP located in Lrh1 promoter is related to Body Mass Index (BMI) and these SNPs might play important roles in the obese phenotype. However, previous researches mostly focused on SNPs associated with pancreatic cancer cell growth and proliferation. For example, a previous genome-wide association study has identified five SNPs on 1q32.1 associated with pancreatic cancer that mapped to Lrh1 gene and its up-stream regulatory region .
In conclusion, FTF combined with CYP7A might play an important role in Lrh1 regulated pancreas-specific transcriptional network. Furthermore, the SNPs analysis of Lrh1 and its homology genes provided the candidate mutant sites that might affect the Lrh1-related production and secretion of pancreatic fluid. These common susceptibility loci for Lrh1 and its homologous genes needed follow-up studies.
Total 15683 DEGs were identified, of which 23 was Lrh1 homology genes (3 up-regulated and 20 down-regulated).
Fetoprotein TF was the only TF of Lrh1 identified based on TRANSFAC database and the promoter-binding factor of fetoprotein TF was CYP7A.
The SNP annotations of Lrh1 homologous genes showed that 92% of mutation sites were occurred in intron and upstream. Three SNPs of Lrh1 were located in intron, while 1819 SNPs of Phkb were located in intron and 1343 SNPs were located in upstream region.
Leung PS: Physiology of the pancreas. The Renin-Angiotensin System: Current Research Progress in The Pancreas: The RAS in the Pancreas, Volume 690. 2010, Netherlands: Springer, 13-27.
Whitcomb DC, Lowe ME: Human pancreatic digestive enzymes. Dig Dis Sci. 2007, 52: 1-17. 10.1007/s10620-006-9589-z.
Fernandez-Marcos PJ, Auwerx J, Schoonjans K: Emerging actions of the nuclear receptor LRH-1 in the gut. Biochimica et Biophysica Acta (BBA)-Molecular Basis of Disease. 2011, 1812: 947-955. 10.1016/j.bbadis.2010.12.010.
Rausa FM, Galarneau L, Bélanger L, Costa RH: The nuclear receptor fetoprotein transcription factor is coexpressed with its target gene HNF-3 < i > β</i > in the developing murine liver intestine and pancreas. Mech Dev. 1999, 89: 185-188. 10.1016/S0925-4773(99)00209-9.
Baquié M, St-Onge L, Kerr-Conte J, Cobo-Vuilleumier N, Lorenzo PI, Moreno CMJ, Cederroth CR, Nef S, Borot S, Bosco D: The liver receptor homolog-1 (LRH-1) is expressed in human islets and protects β-cells against stress-induced apoptosis. Hum Mol Genet. 2011, 20: 2823-2833. 10.1093/hmg/ddr193.
Fayard E, Schoonjans K, Annicotte J-S, Auwerx J: Liver receptor homolog 1 controls the expression of carboxyl ester lipase. J Biol Chem. 2003, 278: 35725-35731.
Hui DY, Howles PN: Carboxyl ester lipase structure-function relationship and physiological role in lipoprotein metabolism and atherosclerosis. J Lipid Res. 2002, 43: 2017-2030. 10.1194/jlr.R200013-JLR200.
Annicotte J-S, Fayard E, Swift GH, Selander L, Edlund H, Tanaka T, Kodama T, Schoonjans K, Auwerx J: Pancreatic-duodenal homeobox 1 regulates expression of liver receptor homolog 1 during pancreas development. Mol Cell Biol. 2003, 23: 6713-6724. 10.1128/MCB.23.19.6713-6724.2003.
Holmstrom SR, Deering T, Swift GH, Poelwijk FJ, Mangelsdorf DJ, Kliewer SA, MacDonald RJ: LRH-1 and PTF1-L coregulate an exocrine pancreas-specific transcriptional network for digestive function. Genes Dev. 2011, 25: 1674-1679. 10.1101/gad.16860911.
Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009, 25: 1105-1111. 10.1093/bioinformatics/btp120.
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012, 7: 562-578. 10.1038/nprot.2012.016.
Robles JA, Qureshi SE, Stephen SJ, Wilson SR, Burden CJ, Taylor JM: Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing. BMC Genomics. 2012, 13: 484-10.1186/1471-2164-13-484.
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
Crawford JE, Guelbeogo WM, Sanou A, Traoré A, Vernick KD, Sagnon NF, Lazzaro BP: De novo transcriptome sequencing in Anopheles funestus using Illumina RNA-seq technology. PLoS One. 2010, 5: e14202-10.1371/journal.pone.0014202.
Da Wei Huang BTS, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2008, 4: 44-57. 10.1038/nprot.2008.211.
Hulsegge I, Kommadath A, Smits MA: Globaltest and GOEAST: Two different approaches for Gene Ontology Analysis. BMC Proceedings. 2009, 4: S10-
Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K: TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 2006, 34: D108-D110. 10.1093/nar/gkj143.
Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10: R25-10.1186/gb-2009-10-3-r25.
Gerbod-Giannone M-C, del Castillo-Olivares A, Janciauskiene S, Gil G, Hylemon PB: Suppression of cholesterol 7α-hydroxylase transcription and bile acid synthesis by an α1-antitrypsin peptide via interaction with α1-fetoprotein transcription factor. J Biol Chem. 2002, 277: 42973-42980.
Cingolani P, Platts A, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM: A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012, 6: 80-92. 10.4161/fly.19695.
Mouzat K, Baron S, Marceau G, Caira F, Sapin V, Volle DH, Lumbroso S, Lobaccaro JM: Emerging roles for LXRs and LRH-1 in female reproduction. Molecular and cellular endocrinology. 2013, 368: 47-58. 10.1016/j.mce.2012.06.009.
Falender AE, Lanz R, Malenfant D, Belanger L, Richards JS: Differential expression of steroidogenic factor-1 and FTF/LRH-1 in the rodent ovary. Endocrinology. 2003, 144: 3598-3610. 10.1210/en.2002-0137.
Xu Z, Ouyang L, Castillo-Olivares AD, Pandak WM, Gil G: Alpha(1)-Fetoprotein Transcription Factor (FTF)/Liver Receptor Homolog-1 (LRH-1) Is an Essential Lipogenic Regulator. Biochimica et biophysica acta. 1801, 2010: 473-479.
del Castillo-Olivares A, Gil G: Alpha 1-fetoprotein transcription factor is required for the expression of sterol 12alpha -hydroxylase, the specific enzyme for cholic acid synthesis. Potential role in the bile acid-mediated regulation of gene transcription. J Biol Chem. 2000, 275: 17793-17799.
Out C, Hageman J, Bloks VW, Gerrits H, Sollewijn Gelpke MD, Bos T, Havinga R, Smit MJ, Kuipers F, Groen AK: Liver receptor homolog-1 is critical for adequate up-regulation of Cyp7a1 gene transcription and bile salt synthesis during bile salt sequestration. Hepatology. 2011, 53: 2075-2085. 10.1002/hep.24286.
Xu Z, Ouyang L, del Castillo-Olivares A, Pandak WM, Gil G: α< sub> 1</sub>−Fetoprotein transcription factor (FTF)/liver receptor homolog-1 (LRH-1) is an essential lipogenic regulator. Biochimica et Biophysica Acta (BBA)-Molecular and Cell Biology of Lipids. 2010, 1801: 473-479. 10.1016/j.bbalip.2009.12.009.
Fakheri RJ, Javitt NB: Autoregulation of cholesterol synthesis: Physiologic and pathophysiologic consequences. Steroids. 2011, 76: 211-215. 10.1016/j.steroids.2010.10.003.
del Castillo-Olivares A, Gil G: Role of FXR and FTF in bile acid-mediated suppression of cholesterol 7α-hydroxylase transcription. Nucleic Acids Res. 2000, 28: 3587-3593. 10.1093/nar/28.18.3587.
Del Castillo-Olivares A, Campos JA, Pandak WM, Gil G: Role of FTF/LRH-1 on bile acid biosynthesis. A known nuclear receptor activator that can act as a suppressor of bile acid biosynthesis. J Biol Chem. 2004, 279 (16): 16813-16821.
Noshiro M, Usui E, Kawamoto T, Kubo H, Fujimoto K, Furukawa M, Honma S, Makishima M, Honma K-i, Kato Y: Multiple mechanisms regulate circadian expression of the gene for cholesterol 7α-hydroxylase (Cyp7a), a key enzyme in hepatic bile acid biosynthesis. J Biol Rhythms. 2007, 22: 299-311. 10.1177/0748730407302461.
Bochkis IM, Schug J, Diana ZY, Kurinna S, Stratton SA, Barton MC, Kaestner KH: Genome-wide location analysis reveals distinct transcriptional circuitry by paralogous regulators Foxa1 and Foxa2. PLoS genetics. 2012, 8: e1002770-10.1371/journal.pgen.1002770.
Quiles Romagosa MÁ: NR5A2: a regulator of glucose metabolism. 2011, http://repositorio.unican.es/xmlui/bitstream/handle/10902/555/%5B2%5D%20Quiles%20Romagosa%20MA.pdf?sequence=1,
Petersen GM, Amundadottir L, Fuchs CS, Kraft P, Stolzenberg-Solomon RZ, Jacobs KB, Arslan AA, Bueno-de-Mesquita HB, Gallinger S, Gross M: A genome-wide association study identifies pancreatic cancer susceptibility loci on chromosomes 13q22. 1, 1q32. 1 and 5p15. 33. Nat Genet. 2010, 42: 224-228. 10.1038/ng.522.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/15/43/prepub
This study was supported by grants from the National Natural Science Foundation of China (No. 81200320, 81300350), Shanghai Science and Technology Commission (No. 11JC1410000), Fund of Shanghai Health Bureau (No. 20114315) and Training Plan of Excellent Academic Researcher of Shanghai Tenth People’s Hospital (No. 12XSGG105,No. 04.01.13037).
The authors declare that they have no competing interests.
MT, XM and CL participated in the design of this study, and they both performed the statistical analysis. RJ, GYH and YZ carried out the study, together with LQ, collected important background information, and drafted the manuscript. HL, XPW and ZS conceived of this study, and participated in the design and helped to draft the manuscript. All authors read and approved the final manuscript.
Maochun Tang, Li Cheng contributed equally to this work.
About this article
Cite this article
Tang, M., Cheng, L., Jia, R. et al. Identification of transcription factors and single nucleotide polymorphisms of Lrh1 and its homologous genes in Lrh1-knockout pancreas of mice. BMC Med Genet 15, 43 (2014). https://doi.org/10.1186/1471-2350-15-43
- Lrh1-knockout pancreas
- Lrh1 homologous gene
- Transcription factor
- Single nucleotide polymorphisms