Introduction

figure a

Associations between birthweight and later type 2 diabetes mellitus are well documented [1,2,3,4]. In 1991, Hales et al observed an inverse association between birthweight and the prevalence of type 2 diabetes among men aged 64 years in the UK [1]. By contrast, in 1994, we reported a non-linear association between birthweight and the prevalence of type 2 diabetes in Pima Indians aged 20–39 years, with greater type 2 diabetes prevalence observed in people with low or high birthweight (U shaped association) [2]. These earlier findings, and those of many subsequent studies [3, 4], however, were based largely on prevalence data obtained from adults (≥40 years in many cases) and thus may not help to explain the currently increasing incidence of type 2 diabetes in the younger population [5, 6].

The association between birthweight and later type 2 diabetes has been attributed to environmental factors [7] and genetic factors [2, 8,9,10,11]. Our previous report of the association between paternal diabetes and low birthweight provided early evidence on the potential influence of genetic factors on the association between birthweight and later type 2 diabetes [10]. In support of this observation, some type 2 diabetes susceptibility genetic variants were found to be associated with either low birthweight (e.g. ADCY5, CDKAL1 and HHEX-IDE) [12, 13] or high birthweight (e.g. ANK1, MTNR1B and ABCC8) [12, 14]. However, these genetic variants have thus far accounted for <5% of heritability for type 2 diabetes [13].

Pima Indians living in south-western USA have a high prevalence of type 2 diabetes [15, 16], and an increasing incidence of type 2 diabetes in youth (aged <25 years) [17, 18]. In this population, we estimated the effect of birthweight on the incidence of early-onset type 2 diabetes (aged <40 years). We also determined the extent to which the aggregate contribution of established type 2 diabetes susceptibility genetic variants modifies the effect of birthweight on the risk of early-onset type 2 diabetes.

Methods

Study design and participants

Residents of the Gila River Indian Community in Arizona, aged ≥5 years, participated in a longitudinal study of diabetes and related complications from 1965 to 2007. Diabetes in this community is thought to be exclusively type 2 diabetes, with no evidence of the autoimmunity typical of type 1 diabetes, even in those with very early onset [19]. Participants were asked to undertake standardised health examinations biennially. The present analysis comprised participants who met the following criteria: (1) were singleton births and had a record of birthweight; (2) reported at least 50% heritage from the Pima or closely related Tohono O’odham Indian tribe; (3) had at least two research examinations, each of which included an OGTT and measurements of height and weight and (4) had genome-wide genotypic data and complete data on other covariates excluding paternal variables (see the description of covariates below). This study was approved by the Institutional Review Board of the National Institute of Diabetes and Digestive and Kidney Diseases. Participants aged <18 years gave their assent to participate in the study, in addition to written informed consent obtained from parents. Participants aged ≥18 years provided written informed consent.

Data collection

Data on birthweight were obtained from birth certificates and review of medical records. Anthropometric and biochemical data on participants and their parents were obtained from their respective research examinations. BMI was calculated from height and weight measured while participants were dressed in light clothing and without shoes. A modified OGTT was conducted by measuring the plasma glucose at 2 h following a 75 g oral load of glucose. Diabetes diagnoses were ascertained either from the longitudinal survey or from clinical records, as described previously [16]. Briefly, survey diagnoses were ascertained through research examinations using the World Health Organization criteria for epidemiological studies (i.e. a 2 h plasma glucose concentration ≥11.1 mmol/l [200 mg/dl]) [20]. Clinical diagnoses were ascertained through review of medical records of participants for any diabetes diagnosis between research examinations and were typically based on a 2 h glucose concentration ≥11.1 mmol/l, a casual glucose concentration ≥11.1 mmol/l or a fasting glucose concentration meeting the prevailing criteria for diabetes diagnosis. The same methods were used to ascertain maternal and paternal diabetes prior to the birth of the participant (i.e. parental diabetes was diagnosed by OGTT or documented clinical diagnosis rather than by self-report). Data from maternal glucose tolerance tests during pregnancy were available for only 29% of participants; thus we did not analyse gestational diabetes defined on this basis. Moreover, we did not use fasting plasma glucose or HbA1c to diagnose type 2 diabetes as these were not routinely measured in the early years of the study. Genotypic data were available for 211 established type 2 diabetes susceptibility genetic variants from a genome-wide association study, using methods described previously [21, 22].

Statistical analysis

Participant and parental characteristics at the first research examination and the distribution of birthweight and BMI according to these characteristics were summarised using descriptive statistics (including median, 25th and 75th centiles for non-normally distributed variables), and compared using unpaired t test, Wilcoxon rank-sum test, or χ2 test. Birthweight quintiles were generated using the PROC RANK statement in SAS. This procedure symmetrically divided an ordered distribution of the birthweight of all participants into five approximately equal strata. BMI was age- and sex-standardised to the study population to obtain a population-specific z score. This standardisation procedure involved the following: (1) identifying sex-specific age-adjusted regression models that provide the best fit for BMI (i.e. log10BMI = β0 + β1age + β2age2 for male sex and log10BMI = β0 + β1age + β2age2 + β3age3 for female sex); (2) obtaining residuals of BMI at the first examination from sex-specific baseline models; (3) computing residuals of BMI at follow-up examinations from these baseline models and (4) standardising all residual values to obtain z scores with mean = 0 and SD = 1.

We constructed a type 2 diabetes susceptibility genetic risk score (GRS) for 211 established type 2 diabetes susceptibility genetic variants, which had been previously identified in type 2 diabetes genome-wide association studies of trans-ethnic populations [23], using methods described previously [21]. Briefly, the GRS was computed by summing the number of type 2 diabetes risk alleles, weighted by the logarithms of the published allelic odds ratios, across all 211 loci.

The period of risk began at the first research examination and ended with a diagnosis of diabetes or last biennial examination before the age of 40 years, whichever came first. A censoring age of 40 years was selected because there was little follow-up beyond 40 years of age among participants with known birthweights. Age- and sex-specific incidence rates of diabetes were computed per 1000 person-years of follow-up. To determine the effect of birthweight on diabetes incidence, Poisson regression models were constructed. Models were adjusted for sex, BMI z score (fitted as time-varying), the GRS and maternal variables, including race/ethnicity, age at the birth of the participant and having diabetes prior to the birth of the participant. Multivariable models were not adjusted for paternal variables because of limited data.

Effects of birthweight on diabetes, assessed as incidence rate ratios (IRRs), were not uniform over time, and in some regression models the effects were U shaped. Therefore, all analyses were stratified by decade of age during follow-up, while in some analyses, birthweight was analysed as quintiles with the middle quintile as the reference category. When birthweight was analysed as a continuous variable, a quadratic term (the square of birthweight) was included in the model when statistically significant (at p < 0.05) to account for the U shape. Models were tested for potential interactions of birthweight with significant covariates by including product terms. In a model comprising both linear and quadratic terms for birthweight, two product terms were fitted and a chunk test was conducted to determine their statistical significance. Models were also evaluated at different values of significant covariates to illustrate the joint effects of these variables with birthweight on the incidence of diabetes.

We investigated the interaction effects of two specific type 2 diabetes susceptibility genetic variants, maternally derived C allele of KCNQ1 SNP rs2299620 (the most significant type 2 diabetes susceptibility genetic variant in this study population) [24] and a novel R1420H in ABCC8 (associated with both high birthweight and type 2 diabetes in this study population [14]), on the association between birthweight and type 2 diabetes.

Sensitivity analyses were conducted to ascertain the validity of risk estimates. This process included restricting data to participants who were not offspring of a diabetic woman to remove any confounding by exposure to an intrauterine diabetic environment. The effect of using BMI as the measure of adiposity rather than absolute weight and height was investigated by comparing estimates of these variables in regression models. A similar approach was used to compare the effect of using weighted GRS rather than raw (all alleles weighted equally) GRS. Moreover, we compared risk estimates obtained from incidence analyses where the period of risk began at the first research examination when the participant did not have diabetes, with those obtained from incidence analyses where the period of risk began at birth and included participants who had diabetes at their first examinations.

All analyses were performed using SAS (version 9.4; SAS Institute, Cary, NC, USA). A two-sided p < 0.05 was considered statistically significant.

Results

Participant characteristics at the first research examination

This study comprised 3039 participants (56% female sex, median birthweight 3.45 kg, 3% offspring of women with type 2 diabetes; Table 1). Male participants, participants who were younger or participants who were offspring of older fathers, of a diabetic mother or of mothers who were older or not full American Indians, were heavier at birth (p < 0.05; Table 1). At the first research examination (median age 8.6 years, Q1 = 6.8, Q3 = 11.4), participants with greater birthweight, offspring of a diabetic mother, and offspring of fathers with diabetes before the birth of the child, had greater BMI z scores (p < 0.05; Table 1). Similarly, participants who were born more recently had greater BMI z scores (p < 0.001; see ESM Table 1 for secular trends in other participant characteristics). Tertiles of the GRS for 211 established type 2 diabetes susceptibility genetic variants were not significantly related to weight at birth or BMI z score at the first examination.

Table 1 Participant characteristics and distributions of birthweight and BMI at the first research examination

Diabetes incidence

Among 3039 participants who were followed for a median of 14.3 years (quartile 1 = 8.0, quartile 3 = 24.3) from the first research examination, there were 652 incident cases of diabetes (13.4 per 1000 person-years). Of these incident cases, 420 (64.4%) were ascertained through the longitudinal survey and 232 (35.6%) through review of clinical records. Crude diabetes cumulative incidence (Fig. 1) and incidence density (Table 2) increased with age and were greater in participants in the lowest and the highest birthweight quintiles. Comparing diabetes incidence in these two high-risk groups, those in the highest birthweight quintile had a greater cumulative incidence from age 10 years to 25 years, after which their cumulative incidence was surpassed by those in the lowest birthweight quintile (Fig. 1). This time-varying effect of birthweight on diabetes risk persisted when adjusted for sex, year of birth, GRS, maternal race/ethnicity and maternal age at the birth of the participant, with a U-shaped effect of birthweight on diabetes incidence at 10–19 years (p < 0.001) but inverse linear effects at 20–29 years (p = 0.027) and at 30–39 years (p = 0.042) (Fig. 2a–c). After further adjustment for BMI and maternal diabetes, the effect of birthweight on diabetes incidence remained U shaped, with greater risk observed in participants with the lowest (IRR 1.79 [95% CI 1.02, 3.14]) and highest (IRR 1.50 [95% CI 0.87, 2.59]) birthweight quintiles (Fig. 2d). After this adjustment, the inverse linear effects at older ages were more pronounced (p < 0.001 at 20–29 years and p = 0.003 at 30–39 years), with a greater risk observed in participants in lower birthweight quintiles only (20–29 years, quintile 1: IRR 1.50 [95% CI 1.05, 2.15]; 30–39 years, quintile 1: IRR 1.50 [95% CI 1.04, 2.16] and quintile 2: IRR 1.16 [95% CI 0.79, 1.71]) (Fig. 2e, f). The effect of birthweight on diabetes incidence could not be reliably determined at age 5–9 years because there were only seven incident cases of diabetes (Table 2).

Fig. 1
figure 1

Crude cumulative incidence (incidence proportion from the first research examination) of type 2 diabetes by birthweight quintile. Cumulative incidence (unadjusted for covariates), where the period of risk began at the first research examination when the participant did not have diabetes, was computed independently in each group of participants defined by birthweight quintile

Table 2 Crude incidence rates of diabetes by birthweight quintiles and age group during follow-up
Fig. 2
figure 2

Effect of birthweight on the risk of type 2 diabetes. (ac) Risk of type 2 diabetes (adjusted for all participant and maternal characteristics except BMI of participant and maternal diabetes) at age 10–19 years (a; n = 2912, p < 0.001), 20–29 years (b; n = 1989, p = 0.027) and 30–39 years (c; n = 1079, p = 0.042). (df) Risk of type 2 diabetes (adjusted for all participant and maternal characteristics including BMI of participant and maternal diabetes) at age 10–19 years (d; n = 2912, p < 0.001), 20–29 years (e; n = 1989, p < 0.001) and 30–39 years (f; n = 1079, p = 0.003). The height of each bar represents the point estimate of the IRR and the error bars represent its 95% CI

Other factors that increased diabetes risk included greater BMI z score, higher GRS for type 2 diabetes and maternal diabetes (ESM Table 2). These covariates did not confound or interact with the effect of birthweight on diabetes incidence at all age intervals (p ≥ 0.05 for all interactions), although they independently added information about diabetes incidence. This was illustrated by results of a model of diabetes IRRs evaluated at age 10–19 years for variables such as increasing quintiles of birthweight, increasing tertiles of BMI z score, a GRS below or above the median and the presence or absence of maternal diabetes in pregnancy (Fig. 3). While the rate ratios were uniform, the absolute rate differences were, of course, much greater in those at greater risk from these variables, especially maternal diabetes.

Fig. 3
figure 3

Joint effects of BMI, type 2 diabetes GRS and maternal diabetes with birthweight on the incidence rate (per 1000 person-years) of type 2 diabetes at age 10–19 years. (a) Offspring of a non-diabetic woman with a low (below the median) type 2 diabetes GRS. (b) Offspring of a non-diabetic woman with a high (at or above the median) type 2 diabetes GRS. (c) Offspring of a diabetic woman with a low type 2 diabetes GRS. (d) Offspring of a diabetic woman with a high type 2 diabetes GRS. py, person-years

However, there was no difference in the incidence ratios associated with birthweight categories between those with and without exposure to maternal diabetes, as illustrated in Fig. 3 (ratios comparing incidence rates shown in Fig. 3a vs Fig. 3b are the same as those comparing rates in Fig. 3c vs Fig. 3d). Type 2 diabetes GRS predicted diabetes incidence in each age group, with or without adjustment for the other diabetes risk factors (IRRs ranged from 1.30 to 1.40 per SD of the GRS; ESM Table 2).

Since the GRS was based on genetic variants associated with type 2 diabetes in non-American Indian populations, variations in ABCC8 and KCNQ1, which have a large effect on type 2 diabetes in the present study population, were also analysed. Estimates of the effect of birthweight on diabetes incidence in models adjusted for the single variants, a novel R1420H in ABCC8 and rs2299620 in KCNQ1 (ESM Table 3), were similar to those of models adjusted for type 2 diabetes GRS (ESM Table 2). However, for ABCC8, there was a significant interaction effect of the R1420H variant with the association between birthweight and diabetes incidence at age 20–29 years (p = 0.006), with a positive linear effect of birthweight on diabetes incidence at 20–29 years among heterozygous carriers of the H allele (IRR 2.68 per kg [95% CI 1.09, 6.60]; n = 108) and a negative linear effect among non-carriers of the H allele (IRR 0.60 per kg [95% CI 0.46, 0.77]; n = 2931). However, there was no significant interaction of the ABCC8 R1420H variant with the effect of birthweight on type 2 diabetes incidence at age 10–19 years or 30–39 years. The KCNQ1 variant did not interact with the effect of birthweight on type 2 diabetes incidence in any age group.

Sensitivity analyses

Estimates of models restricted to data from participants who were offspring of a non-diabetic woman were similar to those including all participants. Comparing effects of two measures of adiposity, there were no meaningful differences between models comprising BMI and those comprising absolute weight and height. Similarly, findings of regression models on the association of weighted GRS with birthweight and type 2 diabetes incidence, and on the impact of weighted GRS on birthweight-associated type 2 diabetes risk, were similar to those of models in which raw GRS was used. Moreover, in analyses in which the period of risk began at birth (ESM Fig. 1), estimates of the effect of birthweight on diabetes incidence were similar to those observed in analyses where the period of risk began at the first research examination without diabetes (Fig. 1).

Discussion

Statement of principal findings

Birthweight had differential effects on early-onset diabetes, with the lowest and highest birthweight quintiles increasing type 2 diabetes risk during adolescence (10–19 years). In contrast to the effect of low birthweight, the effect of high birthweight on annual incidence rates of type 2 diabetes did not persist into adulthood (20–39 years), although the cumulative incidence remained greater in this quintile than in the middle three quintiles because of the greater incidence of type 2 diabetes in the earlier years. An aggregate of 221 established type 2 diabetes susceptibility genetic variants that were identified in genome-wide studies of non-American Indian populations independently predicted the risk of early-onset type 2 diabetes and added to the effect of birthweight on type 2 diabetes risk in our south-western American Indian population. However, it was not significantly associated with birthweight and did not confound or modify the effect of birthweight on type 2 diabetes risk. Similarly, greater BMI during follow-up and being exposed to a diabetic intrauterine environment independently predicted type 2 diabetes risk and added to the effect of birthweight on type 2 diabetes risk.

Strengths and weaknesses of the study

The strength of this study is its prospective and long-term ascertainment of diabetes using definitive tests of blood glucose control. This allows for a more reliable estimate of the incidence and the time of diabetes onset, thereby helping to clarify inconsistencies in existing evidence. However, our study is also limited by the non-adjustment of birthweight for the length at birth and gestational age, due to the lack of reliable data on these two covariates. As data from maternal glucose tolerance tests during the third trimester were not generally available, the maternal diabetes status included in our analyses largely reflects diabetes occurring prior to pregnancy, rather than gestational diabetes. However, in this study population, offspring of mothers with overt diabetes (largely developing before pregnancy) have a much greater incidence of early-onset type 2 diabetes than offspring whose mothers had impaired glucose tolerance (7.8–11.0 mmol/l) in the third trimester that is typical of gestational diabetes [25]. We did not report data on other indicators of abormal glucose regulation, such as plasma insulin levels, which were not measured throughout the study. However, in a previous analysis of children and young adults (aged 5–29 years) from this study population, birthweight was negatively associated with both fasting and 2 h insulin levels [26].

Strength and weaknesses in relation to other studies

Previous studies of the association between birthweight and type 2 diabetes were largely conducted in older adults (≥40 years) and findings were based on prevalence data which do not account for the time of diabetes onset. In most of these prior studies, low birthweight (but not high birthweight) was associated with greater type 2 diabetes risk [3, 4]. However, in a few studies, high (but not low) birthweight [27], or both low and high birthweight [2, 28, 29], were associated with greater type 2 diabetes risk. In a study of adults in Mysore, India, the prevalence of diabetes was not significantly related to birthweight but was greatest in those who were short at birth and had greater ponderal index (birthweight/length3) [30]. The U-shaped effect of birthweight on adolescence-onset type 2 diabetes risk observed in the present study was also seen in our previous study of the prevalence of type 2 diabetes in Pima Indians [2]; the present study extends these observations in showing that the increased risk associated with high birthweight occurs primarily at younger ages. A U-shaped association was also observed in prior studies conducted in children (aged ≤18 years) from Taiwan and a Canadian indigenous population [28, 29]. In China, where there is a relatively high prevalence and increasing incidence of early-onset type 2 diabetes [31], birthweight had a U-shaped association with fasting plasma insulin and HOMA-IR levels among adolescents (mean age 15 years) [32]. Similarly, the association between high birthweight and greater type 2 diabetes risk observed in the present study was also observed in a study conducted among children and adults (aged 10–44 years) in a Canadian indigenous population with a high prevalence of type 2 diabetes [29]. Findings of the present and those of prior studies suggest that high birthweight (in addition to low birthweight) has an impact on type 2 diabetes risk during childhood, especially in populations with an exceptionally high type 2 diabetes prevalence.

Meaning of the study: possible explanations and implications for clinicians and policy makers

The association between low birthweight and greater type 2 diabetes risk has been attributed to several factors: (1) the effects of nutritional insufficiency in utero on fetal growth and fetal programming of the endocrine system [7, 33]; (2) selective survival of small babies prone to insulin resistance [2] and (3) genetic variants mediating fetal growth and impairment in insulin secretion or sensitivity [9, 34, 35]. The observed U-shaped association between birthweight and the risk of adolescence-onset diabetes may be driven by the high prevalence of maternal diabetes-mediated fetal macrosomia in this study population [36]. This association was also observed in a Canadian aboriginal population with a high prevalence of type 2 diabetes in children and young adults [27, 29]. These findings highlight the role of gene–intrauterine environment interaction on the association between high birthweight and later type 2 diabetes risk.

The additional effect of higher GRS, greater BMI during follow-up and exposure to a diabetic intrauterine environment on the incidence of early-onset diabetes in the present study conforms with our previous discovery of a trans-generational cycle of obesity and type 2 diabetes [37]. In this vicious cycle, offspring of a diabetic woman have a greater risk of early obesity and type 2 diabetes, especially during their reproductive age. When this occurs in female offspring of a diabetic woman, the cycle repeats, thereby causing an increasing prevalence of type 2 diabetes in subsequent generations. This finding emphasises the need for preventing or delaying diabetes in women before or during their childbearing years. Given the increasing incidence of early-onset type 2 diabetes and its preponderance in the female sex in most regions of the world and across all race/ethnicities [6], the current findings are highly relevant to other populations. Specifically, our findings may have direct applications to populations with a high prevalence of type 2 diabetes among youth and young adults, such as the indigenous populations of the USA, Canada, Australia and New Zealand [5, 38, 39], and the Chinese population [31].

Unanswered questions and future research

Genetic variants underlying the association between birthweight and type 2 diabetes are still not well understood. In the present study, aggregate contribution of common type 2 diabetes susceptibility genetic variants identified in trans-ethnic populations, including variants also known to be associated with birthweight (e.g. ADCY5, CDKAL1, HHEX-IDE, ANK1, MTNR1B and ABCC8) [12,13,14], did not modify the effect of birthweight on type 2 diabetes risk. However, a novel R1420H variant in ABCC8, reported to have a large effect on the risk of type 2 diabetes and high birthweight in Pima Indians [14], significantly modified the direction of effect of birthweight on type 2 diabetes incidence at age 20–29 years, with a negative linear effect observed in participants without the ABCC8 risk genotype and a positive linear effect in those with the ABCC8 genotype. However, this finding did not replicate in other age groups and may be limited by the small number of diabetes cases observed (n = 42) in the 108 participants with the ABCC8 mutation. Therefore, further investigation into the genetic mechanisms underlying these associations is needed. For example, this may involve exploring the role of altered gene expression mediated by epigenetic modifications in an adverse intrauterine environment [40].

Conclusion

We report a novel finding of differential effects of birthweight on type 2 diabetes risk in a population with a high prevalence of early-onset (aged <40 years) type 2 diabetes, in which birthweight had a U-shaped effect on type 2 diabetes risk in adolescents (aged 10–19 years) but a negative linear effect in young adults (aged 20–39 years). Higher type 2 diabetes GRS, greater BMI and being an offspring of a diabetic woman added to the risk of early-onset type 2 diabetes but did not modify or explain the effect of birthweight on type 2 diabetes incidence. High birthweight was associated with markedly elevated risk of type 2 diabetes at age 10–19 years, but not at older ages. This suggests that the effect of high birthweight on type 2 diabetes risk, reported only in populations with a high prevalence of type 2 diabetes, occurs largely at young ages and may be driven by the inter-generational vicious cycle of type 2 diabetes and obesity in these populations [37].