Abstract
A variable number of tandem repeats (VNTR) located in the insulin gene (INS) control region may be involved in the development of type 2 diabetes (T2D). The TH01 microsatellite is located close to INS and has previously been suggested to be involved its regulation. Therefore, this observational study investigated whether the TH01 microsatellite and INS VNTR, as assessed via the surrogate marker single nucleotide polymorphism (SNP) rs689, are associated with T2D in the Mexican population. Logistic regression models were used to calculate the risk conferred by TH01 and INS VNTR loci for T2D development. TH01 alleles 6, 8, 9 and 9.3 and allele A of rs689 were independently associated with T2D; differences were found between age at T2D diagnosis and sex. Larger alleles of TH01 (≥8 repeats) conferred an increased risk for T2D in males when compared with smaller alleles (≤7 repeats) (odds ratio, ≥1.46; 95% confidence interval, 1.1–1.95). In females, larger alleles conferred a 1.5-fold higher risk for T2D when diagnosed at ≥46 years whereas they conferred protection when diagnosed at ≤45 years. Both TH01 and SNP rs689 were associated with T2D in the same groups; the association remained significant for both loci in multivariate models. The median fasting plasma insulin concentration was significantly higher in patients with T2D versus controls, and in those diagnosed at ≤45 versus ≥46 years. TH01 larger alleles or the A allele of rs689 may potentiate insulin synthesis in males, but not females, without T2D, a process that is disabled in those with T2D.
Introduction
Type 2 diabetes (T2D) is associated with insulin resistance in peripheral tissues and a deficiency in insulin production by the pancreas [1, 2], which may be due to poor gene expression [3], mRNA maturation [4], or abnormal insulin secretion [5] by pancreatic β-cells. Single nucleotide polymorphisms (SNPs) located in genes involved in insulin maturation and production, such as WFS1 [6] and IGF2BP2 [7], or the regulation of insulin gene (INS) expression, such as TCF7L2, confer an increased risk for T2D [8]. The SNP rs149483638, which confers a lower risk for T2D, is located in the INS-IGF2 gene located near the INS gene [9]. These data appear to suggest that the INS control region could be influenced directly or indirectly by proteins encoded either by these or other genes, or by nearby polymorphisms. In the INS control region, which begins upstream of the INS gene transcription start site, there is a variation in the number of tandem repeats (VNTR) polymorphism at 390 base pairs (bp) from the start of INS transcription (Figure 1) that is associated with type 1 diabetes [10–12]. The INS VNTR consists of a sequence of 14–15 nucleotides (5′-ACAGGGGTGTGGGG-3′) repeated in tandem [13] and has been shown to participate in the regulation of the INS in healthy individuals [14–16]. INS VNTR alleles are categorized according to the number of repeats (class I, 26–63 repetitions, average 570 bp; class II, 64–140 repetitions, average 1200 bp; class III, 141–209 repetitions, average 2200 bp). While class I INS VNTR alleles are considered to confer a risk of developing type 1 diabetes, class III alleles are generally associated with dominant protection [13]. Unlike class III alleles, class I alleles have been associated with increased insulin production [17, 18].
Although the INS VNTR has been associated with T2D [19–22], the results remain largely controversial. Moreover, because most studies were conducted in European populations, there is little evidence in Latin Americans. There could be different associations between INS VNTR and T2D in the latter population, as polymorphisms in SLC16A11, INS-IGF2, and HNF1A genes have previously been reported to have strong associations with T2D in Mexicans, but not in Caucasians [23, 24].
The INS VNTR is complex; therefore, a surrogate marker, the SNP rs689, has been frequently used [22]. This marker is found within intron 1 of INS and is in complete linkage disequilibrium with INS VNTR in White populations (allele A is associated with class III alleles; allele T, with class I alleles). In addition, the tyrosine hydroxylase (TH)01 microsatellite, a tetranucleotide (AATG) repeated 3–11 times in tandem, is also located relatively close to INS (approximately 9,800 bp). It is located within the TH gene that is in partial linkage imbalance with the INS VNTR in White [10, 25] and Japanese [26] populations. In addition, in vitro experiments have demonstrated that TH01 has enhancer functions, i.e., it can regulate gene expression at a distance, a common attribute for microsatellites [27], and could potentially influence INS expression [28].
Interestingly, although TH01 is associated with obesity [29], hypertension [30, 31], coronary heart disease [32], metabolic syndrome and high triglyceride levels [33, 34], its association with T2D has not been explored. Therefore, this study was conducted to investigate whether the TH01 microsatellite and INS VNTR (through SNP rs689) are associated with T2D and the concentration of fasting plasma insulin in the Mexican population. The main study conducted was a case–control study that used univariate (ULR) and multivariate (MLR) logistic regression models to calculate the risk (OR, odd’s ratio) conferred of each loci for T2D in a Mexican population; a clinical replica case–control study and a cross-sectional study were also included.
Results
Participants and demographic characteristics
The main case–control study, replica case–control study, and cross-sectional study included 1986, 1188, and 1172 participants, respectively (Table S1); 53.3%, 60.6%, and 73.5% of participants were female, respectively. In both case–control studies, nearly 50% of the participants were T2D cases (i.e., 10.5% [main case–control study] and 3.2% [replica case– control study] of participants were diagnosed during recruitment, while the remaining cases had been previously diagnosed). In contrast, in the cross-sectional study, the prevalence of T2D was 5.4%; the majority (92/104) were diagnosed during recruitment, while only 12 cases had been previously diagnosed. At the time of enrollment, the mean ± standard deviation (SD) age of controls vs cases was similar across the studies (main, 59.2 ± 11.3 vs 55.6 ± 11.7 years; replica, 54.1 ± 10.1 vs 57.1 ± 9.5 years; cross-sectional, 49.1 ± 10.8 vs 49.2 ± 11.4 years), and the mean age at the time of T2D diagnosis was also similar across the studies (ranging from 44.8 to 49.2 years). The mean duration of disease was 9.5 ± 8.7 years in the main study, 12.3 ± 7.7 years in the replica case–control study, and approximately 0 years in the cross-sectional study (Table S1).
Distribution of TH01 alleles stratified by sex and age at T2D diagnosis
TH01 alleles were only assessed in the main study (n = 3972 chromosomes). The TH01 allele frequency between cases and controls is summarized in Table 1. Though we identified 9 of the 11 alleles reported for the TH01 microsatellite [35], only five (6, 7, 8, 9 and 9.3) had a frequency ≥5% in cases or controls. Whereas allele 6 was observed more frequently in controls than in cases (36.6% vs 30.2%, P < 0.0001), alleles 8 (5.2% vs 3.9%, P < 0.05) and 9.3 (21.0% vs 17.3%, P < 0.01) were more frequent among cases than controls. The difference between controls vs cases for allele 6 was greater in males (38.3% vs 29.4%; P < 0.0001) than in females (35.1% vs 30.9%; P < 0.05). Differences between cases vs controls for alleles 8 and 9.3 were only significant among males (5.8% vs 3.2%; P < 0.01 and 21.4% vs 17%; P < 0.05, respectively).
When stratified by sex and age at diagnosis, the difference in the frequency of allele 6 increased in the older controls, irrespective of sex. In the older group, the difference in allele 8 frequency was significantly increased in male cases vs controls. Moreover, the difference of allele 9.3 frequency between controls and cases was significant in older females and in males regardless of age at T2D diagnosis. Similarly, allele 9 was more frequent in younger female controls vs cases. The large (L) alleles, those with ≥8 repeats (R; alleles 8, 9, 9.3, and 11) were predominant in older male and female cases, as well as in younger male cases, while the small (S) alleles, those with ≤7R (alleles 7, 6, 5, 4, and 3) predominated in the younger female cases.
Association of TH01 with T2D using regression models
The association of TH01 alleles 6, 8, 9, and 9.3 (n = 3972 chromosomes) with T2D is presented in Table S2. Allele 6 had a significantly protective effect towards T2D diagnosed at ≥46 years, with a 32.0% reduced risk in females and 44.0% in males, and allele 9 for T2D in females diagnosed at ≤45 years (54.0%). The opposite effect was observed for allele 8, which conferred a 2.26-times greater risk for T2D diagnosed at ≥46 years in males, and allele 9.3, which conferred a 1.43-times increased risk in females diagnosed at ≥46 years and a 1.34-times increased risk in males diagnosed at either age cutoff. The alleles with ≥8R conferred a similarly increased risk for T2D diagnosed at ≥46 years in both males (1.57 times) and females (1.46 times). Moreover, while they conferred a 1.46-times increased risk for T2D diagnosed at ≤45 years in males, they offered protection (25% reduction) in females diagnosed at ≤45 years.
The association of TH01 genotypes (n = 1986 chromosomes) with T2D stratified by age at T2D diagnosis and sex is highlighted in Table 2. The associations of T2D with genotype were of greater magnitude than observed with alleles; the homozygous L/L genotypes conferred a much higher risk than heterozygous L/S genotypes. The risk in males increased from an odds ratio (OR) of 1.48 (95% confidence interval [CI], 1.13–1.95; P = 0.005) when heterozygous to 2.26 (95% CI, 1.41–3.64; P = 0.0007) when homozygous, indicating an additive effect with the number of alleles with ≥8R in the genotype.
Frequency of alleles and genotypes of SNP rs689 and its association with T2D
Table 3 compares the frequency of rs689 alleles (n = 8682) between the controls and cases. Allele A was significantly more frequent in cases than in controls (23.8% vs 20.0%; P < 0.0001). Although this difference was observed in females, it was markedly greater among males. This difference also significantly increased in male cases diagnosed at ≤45 years than at ≥46 years. In contrast, the frequency of allele A was also higher in cases than in controls among females diagnosed at ≥46 years, but not in the younger (≤45 years) group. The frequency of rs689 genotypes (n = 4341) between controls and cases are listed in Table S3, and similar differences were found in the frequency of genotypes AT and AA between all cases and controls.
Logistic regression analysis demonstrated that allele A confers a significantly higher risk for T2D (OR, 1.25; 95% CI, 1.1–1.4; P < 0.0001), with a comparatively higher risk in males (OR, 1.37; 95% CI, 1.2–1.6; P < 0.001) than in females (OR, 1.15; 95% CI, 1–1.3; P = 0.043). This risk was even higher in males diagnosed with T2D at ≤45 years (OR, 1.45; 95% CI, 1.2–1.9, P < 0.001) than at ≥46 years (OR, 1.25; 95% CI, 1–1.6; P = 0.069). This allele confers an increased risk for T2D only in females diagnosed at ≥46 years (OR, 1.24; 95% CI, 1–1.5; P = 0.032).
The association of SNP rs689 genotypes (n = 4341) are shown in Table 4. Both genotypes were found to confer a risk for T2D only in males diagnosed early, with an additive effect; risk was much greater for AA (OR, 2.2; 95% CI, 1.2–4.1; P = 0.013) than AT (OR, 1.53; 95% CI, 1.2–2; P = 0.002). In those diagnosed at ≥46 years, a higher risk was conferred only by AA in females (OR, 2.44; 95% CI, 1.4–4.2; P = 0.0015) and AT in males (OR, 1.38; 95% CI, 1–1.9; P = 0.036).
The allelic and genotypic frequencies of SNP rs689 alleles (n = 3962) and genotypes (n = 1981) in the main study are presented in Table S4, and their associations using regression models in Table S5. Similar observations in the replica case–control study (alleles, n = 2376; genotypes n = 1188) are shown in Table S6, and their associations using regression models in Table S7; those for the cross-sectional study (alleles, n = 2344; genotypes, n = 1172) are in Table S8 and Table S9, respectively. Despite the similar differences across the three studies in the allelic and genotypic frequencies, the risk conferred by allele A, or genotypes AT and AA, towards T2D diagnosis were observed mainly in males diagnosed at an earlier age in the two replica studies.
Linkage disequilibrium analysis
The linkage disequilibrium between the TH01 and SNP rs689 alleles (n = 3682) is summarized in Table 5. Interestingly, the alleles of TH01 with 3 to 7 repeats were in linkage disequilibrium with allele T of SNP rs689. Alleles 6 and 7 of TH01 were inherited with allele T of SNP rs689 98.5% (Tajima’s D [D’] = 0.9307) and 94.8% (D’ = 0.7585) of the time, respectively. The differences in the frequency of haplotypes 6/T vs 6/A and 7/T vs 7/A completely departed from the expected random distribution if they were in linkage equilibrium (P < 0.00001). In contrast, all TH01 alleles with 8 to 11 repeats, except allele 9, were in partial linkage disequilibrium with allele A of SNP rs689. Allele 9.3 of TH01 was highly linked and inherited with allele A for 83.0% of the time (D’ = 0.7829), whereas alleles 8 and 11 were partially linked to allele A and segregated together for 52.9% and 66.7% of the time, respectively, and their respective D’ values were lower (0.3882 and 0.5753). Similarly, the frequencies of haplotypes 9.3/A, 8/A, and 11/A were much more frequent than haplotypes with a T than if they were in equilibrium and randomly distributed (P < 0.01). Allele 9 was in linkage disequilibrium with allele T, and they were inherited together 90.1% of the time (D’ = 0.5814).
Multivariate logistic regression of TH01 and SNP rs689
The associations of the TH01 and SNP rs689 alleles (n = 3952) with T2D are shown in Table 6. Both markers were associated with T2D in the same groups as reported in ULR models; the association remained significant for both loci in MLR models. Despite being in linkage disequilibrium, the ligation between some alleles of these two markers is incomplete, suggesting that both loci contribute independently to T2D in the MLR model. In fact, the value of R2 increased from the first (TH01) to the second (rs689) block introduced in the MLR model, supporting the additive contribution of the second marker in the model, which could be indicative of a cooperative effect by which both loci contribute to T2D.
The association of TH01/rs689 haplotypes (n = 3952) with T2D, using the univariate model, is shown in Table S10. In males, both haplotypes were associated with T2D with haplotype ≥8R/T conferring a higher risk than haplotype ≥8R/A. This finding indicates a predominant effect of the microsatellite TH01 over that of rs689 alleles. When explored individually, the T allele of SNP rs689 confers protection, although in females the effect differed between the age groups. Here, in females ≥8R/T was protective for T2D development at ≤45 years while ≥8R/A had no effect; the opposite effect was observed among the group diagnosed with T2D at ≥46 years of age.
Association of insulin concentration with age and the alleles of TH01 and rs689
Table S11 depicts the concentrations of fasting plasma insulin observed in the main study, as stratified by age at T2D diagnosis, sex, and TH01 and rs689 alleles (n = 2564). The median (interquartile range) fasting plasma insulin concentration (mIU/mL) was significantly higher among cases than in controls: 9.5 (5.5–16.2) vs 6.8 (4.8–10); P < 0.0001. Interestingly, insulin concentrations were higher in cases diagnosed at ≤45 years than at ≥46 years; this difference was greater in males than in females. In fact, insulin concentrations decreased with the age at T2D diagnosis (r = −0.111; P < 0.0001). The decrease was much greater in males (r = −0.189; P < 0.0001) than female (r = −0.051; P > 0.05) and was independent of allele type, suggesting it is likely related to age.
Insulin concentration significantly decreased with participant age for the entire main study population (r = −0.046; P = 0.02). The correlations between fasting plasma insulin concentration and age of controls and cases for the alleles of the two markers are shown for males in Figure 2 and females in Figure 3. In males with diabetes, insulin decreased significantly with age in those with alleles that had ≥8R of TH01 (r = −0.203; P = 0.009) or the rs689 allele A (r = −0.250; P = 0.004). With the presence of both alleles, there was a significant increase in the negative correlation (r = −0.423; P = 0.007), which occurred only with ≥8R/A and not ≥8R/T. On the contrary, in males in the control group, an inverse correlation was observed for those who have these alleles whereby the fasting plasma insulin concentration increased with age (Figure 2). In contrast, no significant correlations between insulin concentration and age were observed in cases with T2D or females in the control group for the alleles of the two markers (Figure 3).
DISCUSSION
Our results indicate an association between TH01 microsatellite and SNP rs689 with T2D and with plasma concentrations of fasting plasma insulin. The degree of association varied with age at T2D diagnosis and sex. TH01 alleles with ≥8R and rs689 A allele conferred an increased risk of developing T2D at any age in males and at ≥46 years in females; there was a protective effect for developing T2D at ≤45 years among females. TH01 alleles with ≤7R have either an inverse (allele 6) or a neutral (allele 7) effect. Fasting plasma insulin decreased linearly with age in male cases who had alleles with ≥8R or rs689 A allele but increased in controls with those alleles. In contrast, fasting plasma insulin remained constant in male cases and controls who had alleles with ≤7R or rs689 T allele. These findings are in line with those reported by Le Stunff et al who found that, when using SNP rs689 as a surrogate marker of INS VNTR, T/T genotypes (VNTR I/I) were associated with a higher level of fasting plasma insulin [36]. On the contrary, these results suggest that the decrease of insulin concentration with age in females is not influenced by TH01 or rs689.
Linkage disequilibrium and multivariate analysis suggest that the TH01 marker is partially in linkage imbalance with SNP rs689 and that the risk or protection conferred by both loci for T2D appear to be independent of each other, even though there may be an additive effect. On analyzing SNP rs689 in isolation, the T allele conferred protection for T2D, but when linked to TH01 alleles with ≥8R, the haplotype (≥8R/T) conferred a risk of developing T2D in males at any age and in females aged ≥46 years. However, alleles with ≥8R conferred protection for T2D in females aged ≤45 years when linked to the T allele (≥8R/T), but not the A allele (≥8R/A).
Although inconsistent, most studies indicate that class I alleles stimulate the INS gene 1.5- to 3-times more than class III alleles [10, 14, 17, 18, 37–39]. A study that used the SNP rs689 as a surrogate marker of the INS VNTR noted an association between T/T genotypes (VNTR I/I) with a higher fasting plasma insulin level [36]. In contrast, some studies have found greater stimulation of allele III for insulin gene expression in in vitro experiments [15] or equal to allele I for plasma insulin secretion [16]. Allele A of SNP rs689, linked with allele III, influences alternative splicing of intron 1 of INS through differential recognition of its 3′ splice site, resulting in an increased production of mature transcripts and more proinsulin in culture supernatants than transcripts from allele T [16].
The influence of VNTR on gene activity or plasma insulin secretion in T2D remains unknown. However, studies have demonstrated the involvement of VNTR in the expression of INS, a mechanism that could be altered or affected in T2D in some populations [11, 16]. Our results with SNP rs689 support this hypothesis, as insulin concentration significantly decreased with age in male participants with T2D who were positive for allele A (class III), while it remained constant in those positive for allele T (class I).
We found that alleles with ≥8R were associated with a slight increase in fasting plasma insulin concentrations with age in controls, indicating a probable involvement of TH01 in regulating INS expression. However, because this association was also observed with allele A of SNP rs689, the association between TH01 microsatellite alleles and fasting plasma insulin levels could be due to a linkage effect with allele A. In cases with these alleles or with the ≥8R/A haplotype, fasting plasma insulin levels decreased with age, which could suggest that the regulation of INS expression could be progressively affected by age in the presence of class III (of VNTR INS) and/or ≥8R (of TH01) alleles. Because allele 6, which is completely linked to allele T of rs689, was found to be a protector for T2D and its frequency increased with age, it could have an inverse effect compared with the ≥8R/A haplotype on INS gene expression during aging.
Given alleles with ≥8R are protective for T2D development at ≤45 years and confer a greater risk at a later age (only in females), then this could indicate the crucial role of estrogen in this association. Epidemiological studies have demonstrated that estrogens are protective for T2D, whereas hormone therapy has been shown to reduce the incidence of diabetes by 35% in postmenopausal females with coronary heart disease [40]. This is in contrast to early menopause, during which there is a higher risk of T2D [41]. There is evidence that estrogen represses the expression of insulin mRNA in pancreatic β-cells through indirect genomic signaling [42] and can stimulate the degradation of misfolded proinsulin, thereby protecting the production of insulin and delaying the onset of diabetes [43]. Our analysis is limited by including only Mexican participants and therefore the study findings are not necessarily generalizable to other populations. Moreover, this study focused on the diagnosis of T2D; however, the extent of it and whether the investigated alleles pose a risk for treatment failure, remain unknown.
In conclusion, insulin increases with age in males when TH01 alleles with ≥8R or allele A of rs689 are present, suggesting an involvement in a mechanism that maintains insulin synthesis in individuals without T2D. In contrast, this is somehow disabled in patients with T2D, thus warranting further research.
Materials and Methods
Sample selection and study design
The individuals included in the main case–control study were part of the Diabetes in Mexico Study (DMS), the details of which have been previously described as part of the SIGMA Type 2 Diabetes Consortium [23, 24]. Briefly, participants were recruited between November 2009 and August 2013 from two tertiary level hospitals in Mexico City, Mexico; T2D was diagnosed based on the American Diabetes Association [44] criteria. A total of 988 cases (unrelated individuals, aged >20 years, with a previous diagnosis of T2D or fasting blood glucose >125 mg/dL) and 998 controls (healthy individuals aged >50 years with fasting blood glucose <100 mg/dL) were selected from the DMS database; all participants were of Mexican mestizo ancestry. Clinical information such as weight, height, waist and hip circumference, and parental history of T2D was collected during the initial interview. For fasting glucose measurements and DNA extraction, 10 mL of intravenous blood was collected; all participants were assessed for TH01 and rs689 markers.
For the replication of rs689, a case–control study including 593 cases of T2D and 595 controls (recruited between January 2014 and January 2015 from a tertiary care hospital in Mexico City, Mexico) was designed. Cases were individuals previously diagnosed with T2D according to the American Diabetes Association criteria, who agreed to participate, and were continuously recruited during their routine medical visits. Controls were individuals aged ≥50 years, who attended the same clinics for reasons other than T2D, had fasting blood glucose <100 mg/dL, and agreed to participate in the study. Clinical history and anthropometric and biochemical measurements were recorded for all study participants.
For another replica of rs689, a population-based cross-sectional study was conducted in 2005 patients (recruited between July and December 2017 from a hospital in Puebla, Mexico). Healthy individuals were invited to participate through flyers distributed in the hospital’s neighborhood. The participants were surveyed to assess T2D risk factors, and anthropometric measurements were collected. A blood sample was drawn to measure glycated hemoglobin (A1c) and assess DNA polymorphisms; individuals with A1c ≥6.5 were considered diabetic, and those with A1c <6.5 were considered non-diabetic. The SNP, rs689 was genotyped in all patients with T2D (n = 104), those with pre-diabetes (n = 529; A1c ≥5.7 to <6.5), and controls aged ≥40 years (n = 539; A1c <5.7). Those with pre-diabetes and controls were included in the non-diabetic group (n = 1068).
The associations of TH01 microsatellite and rs689 were investigated using linear regression models. For the analysis, the presence or absence of T2D was considered the dependent variable, and the values of the alleles or genotypes were considered the explanatory variables.
Ethics
The protocol for the main study was approved by the local ethics committee of each study site and the Federal Commission for the Protection Against Health Risks (COFEPRIS) (CAS/OR/CMN/113300410D0027-0577/2012). The clinical replica study was approved by the Ethics and Research Committees of the Comisión Nacional de Investigación Científica of the Instituto Mexicano del Seguro Social (IMSS R-2014-785-005), whilst the cross-sectional study was approved by the Ethics and Research Committees of the Hospital General de Puebla “Ignacio Romero Vargas” (68/ENS/INV/REV/2017). All three protocols complied with the Declaration of Helsinki and local ethical guidelines for clinical studies in Mexico. Written informed consent was obtained from all participants prior to their inclusion in the study.
TH01 and rs689 SNP genotyping
All DNA samples of the main study were genotyped for TH01 and all samples from the replica studies were genotyped for SNP rs689. The TH01 microsatellite was assessed by polymerase chain reaction (PCR) using a fluorescence-labelled primer (5′- GTGGGCTGAAAAGCTCCCGATTAT-3′) and an unlabeled primer (5′- ATTCAAAGGGTATCTGGGCTCTGG-3′). Subsequently, alleles were identified by number of repeats (R) identified during capillary electrophoresis using the SeqStudio Genetic Analyzer (ThermoFisher Scientific, Waltham, MA, USA). Genotyping of the rs689 SNP was performed using the allelic discrimination assay-by-design TaqMan® method (C_1223317_10) on 384-well plates analyzed on the QuantStudio™ 12 K Flex Real-Time PCR System (ThermoFisher Scientific). The genotypes were analyzed using Genotyper™ software v1.3 (ThermoFisher Scientific).
Statistical analyses
For the sample size calculation, MLR models were considered to have good performance when a baseline of 100 cases (i.e., the ULR models) and 10–15 additional cases for each variable were introduced into the model [45]. Given that only two variables were to be introduced in the MLR models, we calculated that a minimum number of 115 cases were needed in the comparison groups of the main case–control study in which both loci were studied.
Baseline characteristics and insulin concentration were summarized using mean ± SD or median and interquartile range (25%–75%). To assess the statistical significance of intergroup differences, the Student’s t-test was used for mean values, and the Mann–Whitney U test for median values. We investigated whether the frequency of genotypes was distributed according to the Hardy–Weinberg law based on the allelic frequency and the formula: (a + b)2 = a2 + 2ab + b2, where a and b are the allelic frequencies in the control group. The statistical significance of the differences in the distribution of genotypes between the observed and expected results was calculated using the chi-square test, and the linkage disequilibrium between the TH01 and rs689 SNP alleles was calculated using Arlequin (version 3.5.2.2; http://cmpg.unibe.ch/software/arlequin35/) [46]. The trend of fasting plasma insulin concentration was analyzed according to participant age; significance was calculated using the Pearson correlation test.
Univariate and multivariate logistic regression models were used to calculate the risk or protection conferred by the TH01 microsatellite and VNTR loci for T2D development. The analysis of both DNA polymorphism variables was stratified by sex and age at T2D diagnosis. For these subgroups, cases (≤45 and ≥46 years) and controls (≤54 and ≥55 years) were categorized by their median age at diagnosis. The risk conferred by each factor (explanatory variables) was calculated by comparing the cases and controls using univariate logistic regression models. The reference indicators of the explanatory variables TH01 and rs689, were TH01 alleles with ≤7R and rs689 = T. The association was expressed as the OR and 95% CI, and the contribution to the variability of T2D was expressed as Nagelkerke’s R2. The factors were included successively in the model in different blocks. The contribution of each factor to the model was assessed by the increase of R2 and the decrease in the −2 log likelihood ratio value from one block to the next; the Omnibus test was used to determine statistical significance between the successive blocks.
A post-hoc power analysis was performed for each logistic regression model using the G*Power software version 3.1.9.7 (https://www.psychologie.hhu.de/arbeitsgruppen/allgemeine-psychologie-und-arbeitspsychologie/gpower), and when considering the sample size, OR, the probability of the event in the control group, and an α = 0.05 [47]. In addition, for multivariate linear regression models, the value of the total r2 obtained at the end of the model was introduced for the power calculation. All statistical tests were two-sided, and the level of significance was set at P < 0.05. The statistical analyses were conducted using SPSS software version 25 (IBM Corp., Armonk, NY, USA).
Funding
This work was supported by the Carlos Slim Foundation, the Laboratorio Huella Génica, and the Faculty of Medicine of the National Autonomous University of Mexico (UNAM).
Data Availability
All data produced in the present study are available upon reasonable request to the authors.
Conflict of Interest Statement
All authors have nothing to disclose.
Abbreviations
- A1c,
- glycated hemoglobin
- CI
- confidence interval
- D’
- Tajima’s D
- DMS
- Diabetes in Mexico Study
- INS
- insulin
- L
- large
- MLR
- multivariate logistic regression
- OR
- odds ratio
- R
- repeats
- S
- small
- SD
- standard deviation
- SNP
- single nucleotide polymorphism
- T2D
- type 2 diabetes
- ULR
- univariate logistic regression
- VNTR
- variable number of tandem repeats
Acknowledgements
The authors wish to thank Aafreen Saiyed of Edanz (www.edanz.com) for providing editorial support for this manuscript, in accordance with Good Publication Practice (GPP3) guidelines (http://www.ismpp.org/gpp3).