Hypervariable intronic region in NCX1 is enriched in short insertion-deletion polymorphisms and showed association with cardiovascular traits

Background Conserved non-coding regions (CNR) have been shown to harbor gene expression regulatory elements. Genetic variations in these regions may potentially contribute to complex disease susceptibility. Methods We targeted CNRs of cardiovascular disease (CVD) candidate gene, Na(+)-Ca(2+) exchanger (NCX1) with polymorphism screening among CVD patients (n = 46) using DHPLC technology. The flanking region (348 bp) of the 14 bp indel in intron 2 was further genotyped by DGGE assay in two Eastern-European CVD samples: essential hypertension (HYPEST; 470 cases, 652 controls) and coronary artery disease, CAD (CADCZ; 257 cases, controls 413). Genotype-phenotype associations were tested by regression analysis implemented in PLINK. Alignments of primate sequences were performed by ClustalW2. Results Nine of the identified NCX1 variants were either singletons or targeted by commercial platforms. The 14 bp intronic indel (rs11274804) was represented with substantial frequency in HYPEST (6.82%) and CADCZ (14.58%). Genotyping in Eastern-Europeans (n = 1792) revealed hypervariable nature of this locus, represented by seven alternative alleles. The alignments of human-chimpanzee-macaque sequences showed that the major human variant (allele frequency 90.45%) was actually a human-specific deletion compared to other primates. In humans, this deletion was surrounded by other short (5-43 bp) deletion variants and a duplication (40 bp) polymorphism possessing overlapping breakpoints. This indicates a potential indel hotspot, triggered by the initial deletion in human lineage. An association was detected between the carrier status of 14 bp indel ancestral allele and CAD (P = 0.0016, OR = 2.02; Bonferroni significance level alpha = 0.0045), but not with hypertension. The risk for the CAD development was even higher among the patients additionally diagnosed with metabolic syndrome (P = 0.0014, OR = 2.34). Consistent with the effect on metabolic processes, suggestive evidence for the association with heart rate, serum triglyceride and LDL levels was detected (P = 0.04). Conclusions Compared to SNPs targeted by large number of locus-specific and genome-wide assays, considerably less attention has been paid to short indel variants in the human genome. The data of genome dynamics, mutation rate and population genetics of short indels, as well as their impact on gene expressional profile and human disease susceptibility is limited. The characterization of NCX1 intronic hypervariable non-coding region enriched in human-specific indel variants contributes to this gap of knowledge.


Background
Cardiovascular disease (CVD) is a complex disorder affecting heart and blood vessels, which develops from the interaction between life style patterns and genetic susceptibility to the disease. Western societies face high and increasing rates of CVD (such as coronary artery disease, hypertension, arteriosclerosis, heart failure and arrhytmia etc.), which is considered a number one cause of premature death and disability. Although CVD has been shown to have significant heritability, pinpointing of the genes and variants associated with the elevated risk to the disease has been challenging [1,2]. The focus has slowly switched from DNA variants located in genic regions causing direct changes in the encoded protein to the regulatory variants affecting gene expression. Noncoding variants potentially contributing to the susceptibility to complex diseases are localized in promoters and enhancers, introns or 5'-and 3'-UTRs, and may affect binding of the gene expression regulators, such as transcription and splicing factors or miRNAs. Comparative genetics studies have noted several essential gene regulatory elements that are conserved among species [3,4]. Thus, targeting evolutionarily conserved non-coding regions (CNR) in candidate genes for CVD may pinpoint regulatory elements directing the gene expression profile. Genetic variation in these regions may contribute to the susceptibility to CVD. Based on these hypotheses we aimed to target human CVD candidate gene Na(+)-Ca(2+) exchanger (NCX1; SLC8A1) with polymorphism screening in CNRs and to test associations of identified variants with CVD and related metabolic traits in two Eastern-European populations. Na + /Ca 2+ exchange participates in the regulation of vascular function and thus, disturbances in this process contribute to the development of CVD. Na + /Ca +2 exchanger (NCX1) is a bidirectional calcium transporter, responsible for calcium homeostasis in cardiac myocytes and in other cell types by catalyzing the exchange of one Ca 2+ ion for three Na + ions across plasma membrane [5]. Altered Na + /Ca 2+ exchange activity has been observed in arrhythmias, heart failure [6], and salt-sensitive essential hypertension [7,8]. Ncx1 -/mice showed complete lack of Na + /Ca 2+ exchange activity in heart leading to the defects in heart development and embryonic lethality [9].
NCX1 gene (498 908 bp) is located in chromosome 2p22.1 and consists of 12 alternatively spliced exons [10]. Alternative splicing of NCX1 produces several tissuespecific isoforms [11] differing in their regulatory properties [12,13]. NCX1 alternative isoforms respond differently to potential therapeutic agents such as polyunsaturated fatty acids [14] and specific NCX1 inhibitors [9]. Currently, genetic studies targeting the association of NCX1 polymorphisms with CVD are limited. Resequencing of the entire coding and promoter regions in Japanese population identified 15 polymorphisms, two of these variants located >23 kb upstream of the mRNA transcription start site were associated with hypertension [8].
We have conducted a polymorphism screening in NCX1 non-coding regions. The most potential genetic variant to affect gene function, a 14 bp indel, localized in an intronic hypervariable region was characterized in detail in cardiovascular and metabolic traits in two European populations.

Methods
In silico analysis of conserved non-coding regions (CNR) of NCX1 We screened NCX1 (also known as SLC8A1) locus for the presence of Conserved Non-coding Regions (CNRs) using the web-based VISTA software [15] with the proposed default parameters (cutoff criteria: 100 bp sliding window; sequence identity 70%; comparison with rat and mouse). The analyzed NCX1 locus (in total 420,181 bp) spanned from 10 kb downstream to 10  were excluded as potential coding regions. For polymorphism discovery we selected 29 non-coding regions based on the following criteria: the (i) length 50-300 bp; (ii) location >200 bp from the nearest exons, and (iii) sequence identity >70% between human and both rodents (See additional file 1).

DHPLC screening of novel polymorphisms in NCX1 noncoding regions
The selected NCX1 conserved non-coding segments were targeted to polymorphism screening by Denaturing High-Performance Liquid Chromatography method (DHPLC; Wave Technologies Inc. USA). During the design and experimental screening process of DHPLC products the recommendations of the manufacturer were followed. Details of DHPLC assay and running conditions with appropriate PCR primers and fragment characteristics are given in additional file 2. Among the total 29 CNR-s selected, 16 intronic regions entered the DHPLC screening (See additional file 1) phase. 13 regions were excluded before the laboratory experiments due to failure in DHPLC primer design, inappropriate length of the PCR and CNR fragment (too long >700 bp or short <50 bp), or more than two different Tm melting points for the region of interest. The average length of the screened CNR segments was 163 bp (SD: 64 bp, range: 70-287 bp) and PCR fragments was 334 bp (SD: 87 bp, range 170-489 bp). Polymorphism screening was performed with 15 different DNA pools, each consisting of DNAs of three patients with cardiovascular disease originating from two Eastern European sample sets (n = 22 from HYPEST and n = 24 from CADCZ study; detailed description is given below). Individual DNAs in the pools exhibiting evidence for the presence of a polymorphism were sequenced at least twice on both forward and reverse orientations.

DGGE genotyping assay
Genotyping of the identified 14 bp indel (rs11274804, NCBI dbSNP database) in NCX1 intron 2 was performed by standard Denaturing Gradient Gel Electrophoresis (DGGE) (Ingeny, Goes, Netherlands). The manufacturer's recommendations were followed in the design of the DGGE assay and in choosing the conditions for the experimental setup. Detailed information of the assay is given in additional file 2. To initially validate the reliability of the DGGE assay, all DNA samples with alternative genotypes previously detected by DHPLC were re-genotyped at the DGGE platform. To further assure DGGE gel typing system's quality in each assay, double positive (product containing 14 bp indel) and negative controls were used. All ambiguous genotypes in DGGE analysis were re-genotyped twice and/or sequenced on both DNA strands by an ABI 377 Prism automated DNA sequencer using ReproGel 377 gels (Amersham Biosciences Inc., USA). The sequences of all novel variants were verified by resequencing twice on both forward and reverse orientation.

Subjects for association studies with cardiovascular disease
Two Eastern-European sample collection, HYPEST and CADCZ were used to conduct association analysis of rs11274804 with cardiovascular traits and serum biomarkers (  (December 1996) and is published elsewhere [17]. The studies were carried out in compliance with the Helsinki Declaration and all the participants have given their written informed consent. These sample collections have been recruited to target the genetic-epidemiological component of cardiovascular disease in Estonian and Czech populations, respectively. HYPEST subjects were recruited across Estonia during 2004-2007 (1823 individuals, age range 18-85 years) with the aim to evaluate risk factors for essential hypertension and related cardiovascular disease. Details of the recruitment are given in additional file 2. CADCZ subjects (n = 893; n = 296 coronary artery disease patients, n = 597 controls) were recruited by the Cardiology Department of the 2 nd Clinic of Internal Medicine, Faculty Hospital Královské Vinohrady in Prague Czech Republic and Czech heath clinics in years 1998-2000 [17]. In order to exclude obesity and age-related risks, the current study included individuals with BMI <35 kg/m 2 and age <65 years. From HYPEST individuals 470 hypertensive patients and 652 normotensive controls, and from CADCZ samples 257 CAD patients and 413 controls, were analyzed. The control group for both studies consisted of matched healthy individuals with no personal history of CAD, essential hypertension, MI, peripheral arterial disease, or stroke. As no population differentiation was detected among HYPEST and CADCZ study subjects previously [18], the controls of the two studies were pooled in order to address the association of rs11274804 with cardiovascular traits in general Eastern-European population.

Cardiovascular phenotype
For all subjects in the HYPEST and CADCZ studies resting blood pressure (BP) and heart rate were measured by trained clinicians during recruitment. In both studies BP measurements per subject were obtained after a rest in a sitting position using a standard mercury column sphygmomanometer and size-adjusted cuffs. All HYPEST individuals possessed a documented history of multiple SBP and DBP readings (on average 4.31 readings per individual, range 2-29) during mean 3.17 years (range 1-17 years). To compensate for the variability in heaviness of data per study subjects, we used for the analysis the median across the longitudinal BP readings as well as the median of the subject's age during the readings. Definition of essential hypertension among HYPEST subjects is given in additional file 2. For CADCZ subjects three blood pressure measurements were documented and the median value was recorded.
Coronary artery disease (CAD) in CADCZ study was diagnosed according to WHO criteria, and one or more large stenosis of a major coronary vessel was confirmed by coronarography in all patients details of which have been published elsewhere [17]. Carotid wall intima media thickness (IMT) and the presence of carotid plaque, recorded in the CADCZ subjects were determined by ultrasonography using linear exploring coil 7,5 MHz on the distant interior wall about 1-2 cm distally from the bifurcation. The measurement was performed on the right and the left carotid 5-10 times on each side. Diagnosis of metabolic syndrome was defined based on the criteria appointed by International Diabetes Federation [19].

Laboratory measurements
Altered serum lipid profile is considered as a cardiovascular risk factor -a condition that is associated with an increased risk of developing CVD affecting the heart and blood vasculature. In the current study, lipid measurements (total-cholesterol, HDL-cholestrerol, LDLcholesterol and triglycerides, TG) were determined from fasting venous blood samples in the HYPEST and CADCZ subjects. For HYPEST total-cholesterol, HDLcholesterol, LDL-cholesterol, and triglycerides in the serum were measured by standardized assays (Cobas Integra 800® analytical platform, Roche Diagnostics, Inc., USA) at the United Laboratories, Tartu

Statistical analysis
For all identified polymorphisms, the deviation from Hardy-Weinberg equilibrium and differences in allele frequencies between populations were tested using an exact test implemented in Genepop web Version 3.4 [22]. Four rare population-specific polymorphisms showed differences in allele frequencies between the studied individuals from HYPEST (n = 22) and CADCZ (n = 24) (Fisher's exact test, p < 0.05; data not shown). There was no significant difference between the HYPEST (n = 1122) and CADCZ (n = 670) study groups (Fisher's exact test, p > 0.05; data not shown) for the distribution of the 14 bp indel polymorphism.
The significance of the associations between the NCX1 14 bp indel and cardiovascular traits was tested, and odds ratios/effect sizes and confidence intervals were obtained using linear (quantitative traits) and logistic (case-control analysis) regression analyses implemented in the PLINK software [23]. In all study stages the association analyses were performed under additive and dominant genetic models. Additive genetic models assume a trend per copy of the minor allele to contribute to the trait or disease susceptibility on genotype categories, whereas dominant genetic models assume that heterozygotes have the same increased risk as minor homozygous genotypes. Recessive genetic model was not applied, as it requires a large sample size to reach a reasonable statistical power. Meta-analysis was performed using inverse variance method with fixed effect model of both sample sets. Association tests were performed with age, sex and BMI as co-variates. Pvalues <0.05 were considered statistically significant. Two additional corrections were used in the quantitative parameter analysis: in the serum lipid biomarker analysis, a correction according to Jun Wu was implemented to all of the individuals obtaining lipid-lowering medications [24], and in the blood pressure (SBP and DBP) association test, a correction described by Martin Tobin was used for all subjects receiving antihypertensive treatment [25]. For the multiple comparisons (11 traits Cases: age at the onset of the essential hypertension (HYPEST) or coronary artery disease (CADCZ); Controls: age at the recruitment mmHg -millimeters of mercury; bpm -beats per minute; NA -not available tested) a simple Bonferroni correction was used (p < 0.05/11 = 0.0045). Multiple sequence alignment of the human, common chimpanzee and rhesus macaque DNA sequences of the orthologous regions adjacent to 14 bp indel within NCX1 intron 2 was performed with web-based analysis tool ClustalW2 [26]. LD structure (based on the Hap-Map variation data) for the analyzed NCX1 genomic region was performed with the Haploview package [27] (See in additional file 3).

DHPLC screening of polymorphisms in NCX1 non-coding regions
We analyzed human cardiovascular candidate gene, NCX1 (Na(+)-Ca(2+) exchanger, 2p22.1), for the conserved non-coding regions using VISTA genome browser [15] with default parameters. In total, we identified 365 conserved non-coding regions between human and mouse and rat (May 2004, NCBI Build 33). These conserved non-coding regions (CNR) covered 15.1% of the analyzed genomic region (gene ± 10 kb). Based on the length (50-300 bp), location (>200 bp from the nearest exon) and sequence conservation (>70% between human and both rodents), 29 regions were selected for further polymorphism screening among 46 East-European cardiovascular disease patients (HYPEST n = 22; CADCZ n = 24). Due to the limitations of the technology and/or complex structure of the DNA sequence, 13 of the selected NCX1 regions did not qualify for the DHPLC screening. The remaining 16 regions were subjected to polymorphism discovery among cardiovascular phenotype patients. Detailed descriptions of the selected regions as well as inclusion/exclusion criteria for the analysis are given in additional file 1. In total, ten genetic variants (including three novel variants) were identified within the 16 analyzed regions ( Table 2). Most of the variations were located within the second intron of the gene (first intron relative to ATG initiation codon), including six SNPs, and one 14 bp indel (rs11274804, NCBI dbSNP database). Two SNPs were detected in the NCX1 intron 10 and one SNP in intron 4. Among the screened HYPEST samples (n = 22) three common (minor allele frequency, MAF>10%) and four rare (MAF<10%) variants were identified. Five polymorphisms detected in the studied CADCZ patients (n = 24) were uncommon (MAF<10%) and four were common (MAF>10%). Four variants were specific to one of the studied sample sets (either HYPEST or CADCZ).

Characterization of the intronic hypervariable region
Nine of the identified variants in screened NCX1 noncoding regions were either singletons or targeted already by commercial platforms and thus included in large number of studies ( Table 2). The current study focused on the 14 bp indel (rs11274804), which was represented with substantial frequency (6.82% in HYPEST; 14.58% in CADCZ) in both study populations. In addition, the location of rs11274804 indel variant in the second intron of NCX1 (the first intron in the coding region) raised the hypothesis about its potential effect on the gene transcription as several gene expression regulatory Major and minor alleles are indicated with capital and lower case letters, respectively. 4 Number of number of homozygotes/heterozygotes (hom/het) of each identified polymorphism among the screened individuals 5 Commercial genotyping platforms, which include the described variants (according to NCBI database) 6 the 14 bp indel: CATTCCCTCTCCAT/-ND -not detected; NA -not available elements have been mapped within the first introns [28][29][30]. Thus, this polymorphism as well as its flanking region was selected for further characterization. The region flanking the 14 bp indel (348 bp) in NCX1 intron 2 was targeted for the larger-scale genotyping using Denaturing Gradient Gel Electrophoresis (DGGE). Our study samples, HYPEST (n = 1122) and CADCZ (n = 670), displayed nine different genotype variants of this intronic segment (representing seven novel alternative alleles), detected by DGGE and confirmed by sequencing ( Figure 1). The analyzed region appeared to be highly polymorphic in both sample collections. In addition to 14 bp indel, a SNP (C/G), a duplication of 40 bp segment, and four alternative deletions (10 bp, 43 bp and 5 bp) were localized within the 348 bp region (Figure 1, Table 3). The breakpoints of several detected variants co-localized with the 14 bp indel (Figure 2b). The allele frequencies of the detected variants in the full genotyped sample (n = 1792) varied from singletons (10 bp deletion; 40 bp duplication) to common polymorphisms with allele frequencies up to 8.51% (14 bp indel). The 43 bp deletion was enriched in HYPEST samples (20 subjects in HYPEST versus 1 in CADCZ). One HYPEST subject appeared to be a compound heterozygote for 14 bp indel/43 bp deletion. In total, 18.21% of the genotyped HYPEST subjects and 20.49% of CADCZ subjects were carrying alternative variants of the studied NCX1 intron 2 segment (Table 3). Active genome dynamics of the analyzed region is supported by its location between two LD-blocks within the NCX1 gene (See in additional file 3).

Identified 14 bp indel originates from a human-specific deletion compared to ancestral primate sequence
In order to determine the ancestral primate variant of the studied 348 bp region, the consensus sequences of human, common chimpanzee and rhesus macaque were aligned. In contrast to the expectations, multiple sequence alignment (ClustalW2) revealed that the ancestral primate variant is actually the minor human allele carrying the 14 bp sequence motif. The major human variant (among East-European subjects) has evolved through a 14 bp human-specific deletion when compared to sister-species chimpanzee and rhesus macaque (Figure 2b).
Other identified short insertion-deletion variants within the studied region have occurred in human lineage on the chromosomal variant carrying the humanspecific 14 bp deletion (Figure 2b).

Association of the 14 bp indel with cardiovascular traits
Association of NCX1 intronic 14 bp indel (rs11274804) with cardiovascular disease was studied in two Eastern European sample sets: essential hypertension (HYPEST, Estonia; n = 1122; cases n = 470/controls = 652) and coronary artery disease, CAD (CADCZ, Czech; n = 670; cases n = 257/controls = 413). Associations were assessed using logistic regression under additive and dominant effect models (age, sex, and BMI as covariates; Table 4). NCX1 intronic 14 bp indel region revealed strong association with the diagnosis of CAD (P = 0.0016, OR = 2.02; P = 0.0018, OR = 2.07; additive and dominant models, respectively). As there are various clinical conditions that comprise CAD, additional casecontrol analysis was performed using patients diagnosed with CAD as well as metabolic syndrome (n = 88). Despite a three fold reduction in sample size compared to the full CADCZ patient group, the analysis of CAD patients with metabolic syndrome revealed highly significant association with increased effect size (P = 0.0014, OR = 2.34; P = 0.0016, OR = 2.41; additive and dominant models, respectively). These associations remained

Association of the 14 bp indel with quantitative cardiovascular parameters
Associations of the 14 bp indel with quantitative cardiovascular parameters [systolic (SBP) and diastolic (DBP) blood pressure, heart rate, Intima-Media Thickness (IMT)], and serum lipid biomarkers [total cholesterol, HDL, LDL, triglycerides (TG)] were evaluated by linear regression under additive and dominant models ( Table  5, Table 6). A marginal negative correlation was detected with heart rate (P = 0.04, beta = -1.6; additive model) and LDL (P = 0.04, beta = -0.26; dominant model) among healthy Eastern-European subjects ( Table  5, Table 6). Association of the 14 bp indel with serum triglyceride levels reached marginal significance in the CADCZ sample (P = 0.04, beta = 0.25; both models) and a non-significant trend for association in the joint meta-analysis with HYPEST data (P = 0.08, beta = 0.18; P = 0.07, beta = 0.19; additive and dominant models, respectively). No significant association was detected with other studied cardiovascular and serum lipid parameters in separate sample sets and in meta-analysis.

Discussion
We subjected the human NCX1 (Na + /Ca 2+ exchanger) gene to polymorphism screening in conserved non-coding regions with the aim to identify novel potential regulatory variants, which may contribute to the development of cardiovascular disease (CVD). So far, fine-scale polymorphism discovery in the coding, and promoter regions of the NCX1 gene have been carried out only among Japanese individuals, where twopromoter SNPs were shown to be associated with essential hypertension [8] as one of the major risk factor for several CVDs (i.e. CAD). We conducted the polymorphism discovery in NCX1 non-coding conserved regions using CVD patients from two Eastern-European sample collections (HYPEST, essential hypertension; CADCZ, coronary artery disease). Among the ten identified variants, the genomic context of the 14 bp indel located in NCX1 gene intron 2 and its association with CVD was studied in detail. First introns have previously been indicated to contain essential regulatory elements and therefore, may contribute to the transcriptional regulation processes [31] and splicing. For example, in human CFTR gene a regulatory intronic DNase I hypertensive site (DHS) was shown to be required for the normal expression levels in the intestinal epithelium in vivo [28]. The expression profiles of human angiotensin II type 2 receptor AGTR2 and erythroid-specific GATA-1 are affected by regulatory elements in intron 1 containing transcription factor binding sites [29,30].
Genotyping of the 14 bp indel locus revealed the hypervariable nature of the studied genomic segment within NCX1 intron 2. Among the screened Eastern-Europeans (n = 1792) the analyzed 348 bp region was represented by seven different alleles (Figure 1, Figure  2). The alignments of human-chimpanzee-macaque sequences revealed that the major human variant (allele frequency 90.45%) was actually a human-specific deletion compared to other primates. The most common alternative variant, the 14 bp indel, appeared to have the ancestral status among primates. Both, chimpanzee and rhesus macaque possess this common 14 bp sequence tract in their NCX1 intron 2 (Figure 2b). Either natural selection or genetic drift may have contributed to the enrichment of the novel 14 bp deletion variant among humans. The data suggests that the novel deletion variant may carry a selective advantage among humans as it was found to be associated with decreased risk for CAD and elevated serum triglyceride levels. In primate evolution, emergence of such short indel variants and indel-related transcriptional and translational changes may have provided an additional source for the flexible response of genomes to the changing life-style and environmental conditions. As a supportive observation, an enrichment of indels in immunity-associated loci has been found as a possible response to variable virus infections (i.e. HIV) in human and chimpanzee [32].
The human-specific deletion variant is surrounded by an abundance of other short (5-43 bp) deletion variants and a duplication (40 bp) polymorphism, which possess overlapping breakpoints (Figure 2b). This observed high variation refers to a potential indel hotspot, which may have been triggered by the initial 14 bp deletion in human lineage. This scenario is consistent with a recent report revealing the mutagenic role of the indel heterozygosity to its surrounding sequences [33]. The state of indel heterozygosity is expected to affect localized chromosome pairing during meiosis. Regions with indel heterozygosity might be prone to double stranded DNA breaks and are thus targeted to mutational repair, which in turn leads to higher mutation rate [33,34]. A consensus sequence motif GTAAG has been reported with the high prevalence within genomic regions prone to insertion/deletion events [35,36]. The sequence (CATTCCCTCTCCAT) of the 14 bp indel identified in this study contains the inverted sequence of this previously described hotspot motif on the reverse strand (GTAAG vs. CATTC). In addition, the studied  For association analysis with SBP and DBP regression testing for a linear trend of marker alleles was performed with age, sex and BMI as covariates. Association analysis with heart rate was performed with sex as covariate, and intima-media thickness without covariates. 2 Correction for antihypertensive treatment was implemented to all treated patients as described [25]. 3 Pooled HYPEST and CADCZ control subjects, who had no personal history of cardiovascular disease, including essential hypertension, myocardial infarction, coronary artery disease, stroke, and had never been prescribed cardiovascular medications. Previously, no population differentiation was detected between HYPEST and CADCZ study subjects [18] n -number of individuals; IMT -Intima-media thickness; Significant differences have been highlighted in bold, p < 0.05 hypervariable NCX1 intronic region harboured two further CATTC motifs (Figure 2b). Low LD in this region additionally refers to active genome dynamics (See in additional file 3).
High prevalence of small indels (<100 bp) has been found to be preferably associated with alternatively spliced genes, where partial inclusion and deletion of genic regions may broaden gene expression profiles in different tissue types [32,36]. Among the 12 exons coded by the human NCX1 gene there are six alternative untranslated 5'-exons denoted 1a-1f. Different combinations of these six exons may result in up to 32 different Na + /Ca 2+ exchanger mRNA transcripts [37]. The hypothesis that preferences in splice-site selection among NCX1 exons 1a-1f (and other alternative exons) may be affected by the genetic composition in the hypervariable intron 2 requires experimental proof.
The knowledge about the involvement of small indels (<100 bp) in increasing/decreasing susceptibility to the development of complex disease is still scarce. A wellknown example is 32 bp deletion in human the CCR5 gene that results in a frameshift and premature termination [38,39]. This variant, which is common among Europeans (MAF 9.2%) and almost absent among Africans contributes to the resistance against HIV-1 infection. It was speculated that a 10 bp direct repeat that flanks the deleted region promoted a recombination event leading to the 32 bp deletion [39]. The effect of indels in noncoding regions is less understood. A recent study identified a 25 bp deletion in intron 32 of the human MYBPC3 gene leading to the loss of exon 33. The carriers of this deletion develop heritable cardiomyopathies and have increased risk of heart failure in Indian populations [40].
The current study identified a strong association between the carrier status of NCX1 intronic 14 bp indel and increased risk to coronary artery disease (CAD) in the East European population. The risk was the highest among the CAD patients with metabolic disease. Consistent with the effect on metabolic processes, the 14 bp indel was also associated with serum triglyceride levels. Several recent studies have highlighted the role of metabolic factors and metabolic syndrome in determining the extent of CAD and the risk for new vascular events [41,42]. In perspective, the association study of rs11274804 as well as other identified indel variants in NCX1 intron 2 with CAD and related metabolic factors is to be extended to other populations. The expression of NCX1 alternative transcripts in vascular muscle (NCX1.3 isoform) and in heart (NCX1.1 isoform) further supports the potential involvement of NCX1 genetic variants in susceptibility For association analysis with serum lipids regression testing for a linear trend of marker alleles was performed with age, sex and BMI as covariates. 2 Correction for the treatment with lipid-lowering medication was implemented as described [24]. 3 Pooled HYPEST and CADCZ control subjects with available records for serum lipids. The individuals had no personal history of cardiovascular disease, including essential hypertension, myocardial infarction, coronary artery disease, stroke, and had never been prescribed cardiovascular medications. Previously, no population differentiation was detected between HYPEST and CADCZ study subjects [18] n -number of individuals; Significant differences have been highlighted in bold, p < 0.05 to coronary artery disease [7]. Functional studies would reveal whether the carrier status of alternative NCX1 intronic indel variants affects the alternative transcript profile of the gene.

Conclusions
In summary, compared to SNPs targeted by large number of locus-specific and genome-wide assays, considerably less attention has been paid to short insertion-deletions (indels) variants in the human genome. The data of genome dynamics, mutation rate and population genetics of short indels, as well as their impact on gene expressional profile and human disease susceptibility is limited. The characterization of NCX1 intronic hypervariable region enriched in human-specific indel variants contributes to this gap of knowledge.