The integration of principles and procedures underlying population genetics and epidemiology provides a potential research framework to make causal inferences about the association between a risk factor and disease [1, 2]. In an epidemiological context, numerous biomarkers have been linked to the development of type 2 diabetes. In this issue of Diabetologia, Herder and colleagues assess the prospective association between circulating levels of macrophage migration inhibitory factor and risk of type 2 diabetes [3]. Their results provide suggestive statistical evidence for a sex-specific association between this biomarker and risk of type 2 diabetes. This observation is consistent with the current paradigm that inflammation plays a role in the pathogenesis of insulin resistance and type 2 diabetes [4].
The association might be causal. Alternatively, given the observational nature of epidemiological research, it may also be completely explained by confounding. In this scenario, the relationship is explained by a third factor that is associated with the biomarker and disease risk. That is, the biomarker is not independently or causally linked to disease risk. Epidemiological approaches generally use multivariable statistical analysis to reduce confounding. However, because of residual confounding (as a result of measurement error) and unknown confounders, standard observational epidemiology cannot resolve whether an observed association has a causal basis. The conceptual ambiguity of any disease mechanism (specifically, defining mediators and confounders) also limits statistical modelling to reduce confounding. Furthermore, the observed association may be due to reverse association/causation—that is, the relationship between biomarker and disease could be the result of undiagnosed or early disease rather than the risk factor, and thus a consequence of disease rather than a cause.
‘Mendelian randomisation’—the random assortment of genes from parents to offspring that occurs during gamete formation and conception [1]—provides an epidemiological approach that is much less susceptible to confounding by classical or environmental risk factors and excludes reverse causality as a possible non-causal explanation for the observed association between a biomarker and disease risk [5]. In this research framework, the inter-relationship and consistency of associations among genetic variants that encode or regulate a biomarker, the expression or circulating levels of the biomarker, and disease risk may characterise the true magnitude of the relationship between the biomarker and risk of disease (Fig. 1) [5–7]. Reproducible evidence for a statistical association between a genetic variant and biomarker provides the basis for a Mendelian randomisation study. Because of independent assortment [1], characteristics that may confound any association between a biomarker and disease are equally distributed among the relevant genetic variants. Thus, a comparison of groups of individuals defined by a genetic variant, based on a Mendelian randomisation design, is equivalent to a randomised comparison, with only the relevant biomarker differing across the relevant genetic variant [8, 9]. Examining the relationship between variants in the genome that show unequivocal associations with the relevant biomarker and disease risk is therefore a potential method of assessing whether a biomarker might be causally linked to disease. In this context, Herder et al. [3], based on specificity and directional consistency of associations among genetic variants, biomarker and disease risk, provide some evidence for a causal link between circulating levels of macrophage migration inhibitory factor and risk of type 2 diabetes. However, much more robust statistical evidence is required to make causal inferences using this approach [6, 10–12].
Mendelian randomisation studies, like other genetic epidemiological studies, require reliable identification of statistical associations between genetic variants and biomarkers and disease risk [12]. Importantly, the principal requirement is the ability of Mendelian randomisation studies to detect an association of equivalent magnitude to that predicted by a proportional change in the relevant trait or risk factor. Given that individual genetic variants are likely to explain only a very small proportion of the variation in a biomarker trait or risk factor, these assessments will require very large sample sizes, which are probably beyond the scale of existing studies or collaborations in type 2 diabetes genetics [13]. This statistical limitation is compounded by random measurement error in the assessment of the magnitude of the associations among the genetic variant, biomarker and disease risk, which can lead to attenuated association signals. To help overcome issues of statistical resolution, investigators have opted to use meta-analysis [8]. However, using this approach for Mendelian randomisation triangulation has several limitations, which are generic to this strategy [14]. Heterogeneity among studies (resulting from, for example, potential gene–gene or gene–environment interactions) may distort assessments of the magnitude of the associations [6]. This potential limitation may be exacerbated when different studies are used for data aggregation for each side of the Mendelian randomisation triangle (Fig. 1).
It is therefore likely that international collaborative frameworks for Mendelian randomisation studies of type 2 diabetes will be needed to achieve the required level of statistical precision. For example, reproducible statistical associations between genetic variants and risk of type 2 diabetes show an effect size of around 10–30% increased risk per allele [13]. Under a log-additive model, around 15,000 cases and 15,000 controls are required to detect a per allele risk of 10% (with an α-level of 1 × 10−4 and 90% power) for a relatively common genetic variant (20% allele frequency). The association between genetic variants and type 2 diabetes risk may be even smaller for those variants which also show reproducible associations with relevant biomarkers [15]. Thus, it is likely that Mendelian randomisation studies will need much greater statistical resolution. To facilitate this requirement, improved statistical approaches for meta-analysing data across studies are in development [16]. However, data pooling strategies have distinct advantages in this context, allowing enhanced data harmonisation and statistical testing, including haplotype analysis and imputational methods to increase the comparability of markers across studies.
There are other caveats. Genetic confounding and biological compensation may also limit inferences in Mendelian randomisation studies. Biological compensation, also referred to as canalisation, is defined as a developmental and physiological adaptation to a genetic difference, and could lead to inconsistencies in the magnitude of the triangulated associations [17]. This physiological adaptation, among other important differences, distinguishes Mendelian randomisation studies from randomised controlled trials, which are not susceptible to this limitation [6]. Importantly, the pleiotropic nature (multiple biological effects) of some genetic variants may produce confounded associations between genetic variants and phenotypes. Undetected population stratification (confounding), where genetic differences and trait associations may result from underlying differences in ancestry, may also distort the magnitude of gene–biomarker and gene–disease associations. By contrast, correlation among genetic variants (linkage disequilibrium) can be used to examine associations among these variants, traits and diseases, without directly ascertaining functional variants. However, in the same context, correlation with pleiotropic genetic variants, for example, could distort associations.
Despite these limitations, Mendelian randomisation provides a potential research framework to assess causal links between biological and environmental phenotypes and disease risk. These studies, when correctly performed, will provide insights into aetiological mechanisms and causality, informing potential therapeutic and preventative strategies.
References
Davey Smith G, Ebrahim S (2003) ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 32:1–22
Clayton D, McKeigue PM (2001) Epidemiological methods for studying genes and environmental factors in complex diseases. Lancet 358:1356–1360
Herder C, Klopp N, Baumert J et al (2008) Effect of macrophage migration inhibitory factor (MIF) gene variants and MIF serum concentrations on the risk of type 2 diabetes: results from the MONICA/KORA Augsburg Case–Cohort Study, 1984–2002. Diabetologia DOI 10.1007/s00125-007-0800-3
Wellen KE, Hotamisligil GS (2005) Inflammation, stress, and diabetes. J Clin Invest 115:1111–1119
Lawlor DA, Harbord RM, Sterne JA, Timpson N, Davey Smith G (2007) Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med DOI 10.1002/sim.3034
Nitsch D, Molokhia M, Smeeth L, DeStavola BL, Whittaker JC, Leon DA (2006) Limits to causal inference based on Mendelian randomization: a comparison with randomized controlled trials. Am J Epidemiol 163:397–403
Hingorani A, Humphries S (2005) Nature's randomised trials. Lancet 366:1906–1908
Casas JP, Bautista LE, Smeeth L, Sharma P, Hingorani AD (2005) Homocysteine and stroke: evidence on a causal link from mendelian randomisation. Lancet 365:224–232
Timpson NJ, Lawlor DA, Harbord RM et al (2005) C-reactive protein and its role in metabolic syndrome: mendelian randomisation study. Lancet 366:1954–1959
Minelli C, Thompson JR, Tobin MD, Abrams KR (2004) An integrated approach to the meta-analysis of genetic association studies using Mendelian randomization. Am J Epidemiol 160:445–452
Tobin MD, Minelli C, Burton PR, Thompson JR (2004) Commentary: development of Mendelian randomization: from hypothesis test to ‘Mendelian deconfounding’. Int J Epidemiol 33:26–29
Chanock SJ, Manolio T, Boehnke M et al (2007) Replicating genotype–phenotype associations. Nature 447:655–660
Frayling TM (2007) Genome-wide association studies provide new insights into type 2 diabetes aetiology. Nat Rev Genet 8:657–662
Egger M, Schneider M, Davey Smith G (1998) Spurious precision? Meta-analysis of observational studies. BMJ 316:140–144
Casas JP, Shah T, Cooper J et al (2006) Insight into the nature of the CRP–coronary event association using Mendelian randomization. Int J Epidemiol 35:922–931
Higgins JP, Little J, Ioannidis JP et al (2007) Turning the pump handle: evolving methods for integrating the evidence on gene–disease association. Am J Epidemiol 166:863–866
Davey Smith G, Ebrahim S (2005) What can mendelian randomisation tell us about modifiable behavioural and environmental exposures? BMJ 330:1076–1079
Acknowledgements
The authors would like to thank an anonymous reviewer for insightful comments on this manuscript.
Duality of interest
The authors declare that there is no duality of interest associated with this manuscript.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sandhu, M.S., Debenham, S.L., Barroso, I. et al. Mendelian randomisation studies of type 2 diabetes: future prospects. Diabetologia 51, 211–213 (2008). https://doi.org/10.1007/s00125-007-0903-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00125-007-0903-x