Can increasing years of schooling reduce type 2 diabetes (T2D)?—Evidence from a Mendelian randomization of T2D and 10 of its risk factors

Charleen D. Adams; Brian B. Boutwell

doi:10.1101/2020.02.05.20020701

Abstract

A focus in recent decades has involved examining the potential causal impact of educational attainment (schooling years) on a variety of disease and life-expectancy outcomes. Numerous studies have broadly revealed a link suggesting that as years of formal schooling increase so too does health and wellbeing; however, it is unclear whether the associations are causal. Here we use Mendelian randomization, an instrumental variables technique, to probe whether more years of schooling are causally linked to type 2 diabetes (T2D) and 10 of its risk factors. The results reveal a protective effect of more schooling years against T2D (odds ratio=0.39; 95% confidence interval: 0.26, 0.58; P=3.89 × 10⁻⁰⁶), which might be mediated in part by more years of schooling being protective against the following: having a first-degree relative with diabetes, being overweight, and having high blood pressure, higher levels of circulating triglycerides, and lower levels of HDL cholesterol. More schooling years had no effect on risk for gestational diabetes or polycystic ovarian syndrome and was associated with a decreased likelihood of moderate physical activity. These findings imply that strategies to retain adults in higher education may help reduce the risk for a major source of metabolic morbidity and mortality.

Tacit to most epidemiological research is a desire to infer whether an environmental exposure impacts some outcome in a causal fashion. A particular area of focus in recent decades has involved examining the impact of educational attainment (years of schooling) on a variety of disease and life expectancy outcomes¹. Numerous studies have broadly revealed a strong statistical association suggesting that as the years of formal schooling increase so too does health and wellbeing ². Indeed, educational attainment has been associated with diverse mental and physical health outcomes, including depression, cancer incidence, heart disease, and diabetes¹.

Entangled in this line of inquiry (and all of social science research), however, is a concern about the evidence for causal inference open to scholars³. With regard to the associations between educational attainment and health outcomes, Montez and Friedman (2015) caution: “Studies such as those highlighted above often implicitly assume that educational attainment has a causal influence on adult health; however, this assumption has long been challenged. If the assumption is incorrect then investing in education policies and schooling systems may waste government spending and not manifest in improved population health” (p.1)¹. To be sure, there is emergent evidence utilizing quasi-experimental and natural-experimental designs which suggest some causal effects may exist in some contexts for educational attainment on health outcomes². Yet, there remains an overall dearth of evidence utilizing designs admitting of stronger causal inference capabilities.

More recently, scholars have begun utilizing data gleaned from large genomic consortia and publicly available genome wide association (GWA) studies to construct instrumental variables comprised of trait-relevant single-nucleotide polymorphisms (SNPs). When certain assumptions (discussed below) are satisfied in the data, it is possible to investigate whether some type of modifiable risk or protective factor causally impacts some outcome⁴. Known as Mendelian Randomization (MR), this variant of instrumental variable analysis has been increasingly widely applied to a variety of medical and epidemiological outcomes⁵. In the current study, we apply MR modeling strategies to zoom in on whether educational attainment plays a causal role in the prevention of one of society’s most pressing public-health challenges: type 2 diabetes (T2D) and 10 of its risk factors.

Results

T2D

A strong protective effect against T2D is observed for more Education Years (odds ratio, OR, for T2D per SD increase in Education Years): IVW estimate 0.39; 95% confidence interval (CI): 0.26, 0.58; P=3.89 × 10⁻⁰⁶). The sensitivity estimators aligned in direction and magnitude of effects with the IVW’s estimate, and the MR-Egger intercept test indicated no evidence for directional pleiotropy. (Since this is also the case for all the tests – none showed evidence for direction pleiotropy with the MR-Egger intercept test, this statement will not be repeated for the remaining results).

Sibling, mother, and father with diabetes

Small protective effects against having a sibling, mother, or father with diabetes are observed for more Education Years (ORs for a first-degree relative with diabetes per SD increase in Education Years): sibling IVW estimate 0.97; 95% CI: 0.96, 0.98; P=4.23 × 10⁻¹¹); mother IVW estimate 0.97; 95% CI: 0.96, 0.98; P=6.66 × 10⁻⁷); father IVW estimate 0.98; 95% CI: 0.97, 0.99; P=0.0008. The sensitivity estimators aligned in direction and magnitude of effects with the IVW’s estimate.

Overweight status

A strong protective effect against being overweight is observed for more Education Years (OR for being overweight per SD increase in Education Years): IVW estimate 0.60; 95% CI: 0.51, 0.72; P=1.01 × 10⁻⁰⁸). The sensitivity estimators mostly aligned in direction and magnitude of effects with the IVW’s estimate, with a slightly more protective effect observed for the weighted mode estimator.

Physical activity

A strong protective effect against performing the most amount of moderate physical activity is observed for more Education Years (OR for the highest level of moderate physical activity compared to all other amounts of moderate physical activity per SD increase in Education Years): IVW estimate 0.77; 95% CI: 0.71, 0.84; P=1.08 × 10⁻⁰⁸). The sensitivity estimators varied in the magnitude of their effects, which might indicate unwanted pleiotropy.

High blood pressure

A modest protective effect against having high blood pressure is observed for more Education Years (OR for high blood pressure per SD increase in Education Years): IVW estimate 0.94; 95% CI: 0.92, 0.96; P=2.49 × 10⁻¹⁰). The sensitivity estimators aligned in direction and magnitude of effects with the IVW’s estimate.

Gestational diabetes and polycystic ovarian syndrome

There were null effects for the influence of more Education Years on gestational diabetes and polycystic ovarian syndrome (OR for each per SD increase in Education Years): IVW estimate 1.00; 95% CI: 1.00, 1.00; gestational diabetes, P=0.1705; polycystic ovarian syndrome, P=0.2844. The sensitivity estimators aligned in direction and magnitude of effects with the IVW’s estimate: all = 1.

HDL levels

An increase in HDL levels were observed for more Education Years (beta estimate per SD increase in Education Years): IVW estimate 0.14; 95% CI: 0.06, 0.22; P=0.0009). The sensitivity estimators varied in the magnitude of effects, indicating the potential for some unwanted pleiotropy.

Triglyceride levels

A decrease in triglyceride levels were observed for more Education Years (beta estimate per SD increase in Education Years): IVW estimate −0.19; 95% CI: −0.27, −0.11; P=3.34 × 10⁻⁰⁶). The sensitivity estimators aligned in direction and magnitude of effects with the IVW’s estimate.

Discussion

We observed a protective effect of Education Years against T2D, which might be mediated in part by more years of schooling being protective against the following: having a first-degree relative with diabetes, being overweight, and having high blood pressure, higher levels of circulating triglycerides, and lower levels of HDL cholesterol. These findings comport with another MR study that examined education and diabetes with UK Biobank data. Davies et al. (2018) observed that leaving secondary school at an older age was causally protective against diabetes⁷. Their study differed from the present one in that ours examined education inclusive of college—Davies et al. (2018) focused on education up to college. Here, we document that the protective effect of education extends beyond schooling in adolescence. Years of schooling after high school decrease the chance of T2D.

In the present study, more years of schooling had no effect on risk for gestational diabetes or polycystic ovarian syndrome and was associated with a decreased likelihood of moderate physical activity. Regarding the later, another recent MR study found little evidence that more education increased vigorous physical activity⁸. Thus, it seems unlikely that the protective effect of Education Years against T2D occurs through an influence on physical activity.

The protective effect against having a first-degree relative with diabetes is intriguing. Several recent studies have documented that there is a bidirectional causal relationship between fluid intelligence and years of schooling^9,10. While having higher fluid intelligence may causally impact more years of schooling, the magnitude of the effect for more years of schooling increasing fluid intelligence is comparatively larger: that is, the impact of Education Years on intelligence is more than two-fold greater than the impact of intelligence on Education Years^9,10. Like educational attainment, which is sometimes treated as a proxy for cognitive ability, being brighter is protective against an array of negative health-outcomes¹¹. This means that it is possible that intelligence is confounding the present findings, especially those pertaining to a protective effect of more years of schooling against having a first-degree relative with diabetes. However, due to the durable influence of educational attainment on intelligence, it is also conceivable that those with more education positively influence their family members in ways that reduce risk for T2D.

One limitation for the analyses of Education Years on a first-degree relative with diabetes is that it is possible that some cases of type 1 diabetes were included, since the UK Biobank questions that captured the measure for illnesses of relatives asked about “diabetes” – not specifically about T2D. However, the influence for this is expected to be minimal, since more than 90% of adults with diabetes have T2D¹².

The primary limitation of the present study is one that all MR studies are liable to: unwanted horizontal pleiotropy. However, the most logical pleiotropic confounder—intelligence—is one that is influenced by Education Years. Moreover, most of the sensitivity screens for possible violations to the MR assumptions revealed little evidence for distortions due to pleiotropy. The exceptions are for HDL levels and physical activity, for which there was enough variability across the sensitivity estimators to view their results with more caution. A strength of our study worth mentioning is that it leveraged the power of 11 large GWA studies to examine these complexly woven traits.

The public-health relevance of the bidirectional causal relationship between intelligence and Education Years cannot be overstated, however. If the present findings primarily reflect the benefits of higher cognitive ability—which they could—then whether Education Years influences cognitive ability determines interventional strategies. Because Education Years increases cognitive ability, public-health efforts to retain people in higher education may be warranted as part of a developing arsenal to help limit and even prevent the staggeringly deleterious effects of T2D. The message is the same, importantly, even if intelligence is not the driving force in the current study. Whatever it is about the landscape of higher education, more years of schooling appears to help reduce the risk for a major source of metabolic morbidity and mortality.

Methods

Conceptual approach

MR is an analytic, instrumental variables technique that capitalizes on Mendel’s Laws of Inheritance, genotype assignment at conception, and pleiotropy (genes influencing more than one trait) for causal inference^13–15.

MR uses genetic variants strongly associated with traits of interest as opposed to the observed traits themselves in models. By relying on the random assortment of alleles (Mendel’s Laws) and the temporal assignment of genotype at conception, MR avoids most sources of confounding and reverse causation that distort causal estimates in observational studies. In two-sample MR, summary statistics are pulled from two genome-wide association (GWA) studies. These summary statistics are the data sources for two-sample MR^4,6,16–19 (Figure 1).

Figure 1.

Two-sample MR testing the causal effect of Education Years on T2D. Estimates of the SNP-Education Years associations (β^ZX) are calculated in sample 1 (from a genome-wide association, GWA, study of Education Years). The association between these same SNPs and T2D is then estimated in sample 2 (β^ZY) (from a T2D GWA study). These estimates are combined into Wald ratios (β^XY=β^ZY/β^ZX). The β^XY estimates are meta-analyzed using the inverse-variance weighted analysis (β^IVW) method and various sensitivity analyses. The IVW method produces an overall causal estimate Education Years on T2D.

MR also exploits vertical pleiotropy. For example, the assumption that the genetic variants for Education Years have an influence on T2D through their influence on the Education Years is an exploitation of vertical pleiotropy. But vertical pleiotropy is not the only type of pleiotropy. Horizontal pleiotropy also occurs. An example of horizontal pleiotropy would be if the SNPs associated with Education Years were also associated with some other trait (such as socioeconomic status), which then affects risk for T2D. This scenario would constitute a violation to the MR assumptions.

MR assumptions

MR has the following assumptions: (i) genetic instruments are strongly associated with the exposure; (ii) genetic instruments are independent of confounders of the exposure and the outcome; and (iii) genetic instruments are associated with the outcome only through the exposure^18,20. For example, the following must be true in order for the present analysis to be valid: (i) genetic variants robustly associated with Education Years must be chosen as instruments to test the causal relationship between Education Years and T2D; (ii) the genetic variants chosen to instrument Education Years must not be associated with confounders of the relationship between Education Years and T2D; and (iii) the genetic variants chosen to instrument Education Years must only impact T2D through their impact on Education Years. When violated, assumption (iii) describes horizontal pleiotropy, which can invalidate causal inference from vertical pleiotropy. Statistically based sensitivity estimators have been developed to evaluate potential violations to assumption (iii) (for more on this, see the subsection, Sensitivity analyses.)

Design

This study explores the impact of Education Years on T2D and 10 risk factors for T2D. For the later, a list of established risk factors for T2D was obtained from the website for the American Diabetes Association (ADA) (https://www.diabetes.org/diabetes-risk)²¹:

Being 45 or older
Being Black, Hispanic/Latino, American Indian, Asian American, or Pacific Islander
Having a parent with diabetes
Having a sibling with diabetes
Being overweight
Being physically inactive
Having high blood pressure
Having low high-density lipoprotein (HDL) cholesterol
Having high triglycerides
Having had diabetes during pregnancy (gestational diabetes)
Having been diagnosed with Polycystic Ovary Syndrome

Of these risk factors, all but “being 45 and older” and “being Black, Hispanic/Latino, American Indian, Asian American, or Pacific Islander” were suitable for investigation with two-sample MR.

Exposure data source: Education Years

The instrument for Education Years was obtained from a GWA study of Education Years performed by Okbay et al. (2016), which included 293,723 participants of European ancestry and adjusted for 10 principal components, age, sex, and study-specific controls²². Education Years, inclusive of college, was measured for those who were at least 30 years of age. International Standard Classification of Education (ISCED) categories were used to impute a years-of-education equivalent (SNP coefficients per standard deviation, SD, units of years of schooling; an SD-unit of schooling=3.6 years).

Outcome data source: T2D

The outcome data for T2D was extracted from Morris et al. (2012), which performed a GWA study of T2D in 149,821 participants of European decent, of which 34,840 had T2D²³. Their GWA adjusted for study-specific covariates and population structure.

Outcome data source: sibling with diabetes

The outcome data for having a sibling with diabetes was extracted from a GWA study performed by the Medical Research Council-Integrative Epidemiology Unit (MRC-IEU) staff, using PHESANT-derived²⁴ UK Biobank data^25,26 (UK Biobank data field 20111). Briefly, the UK Biobank is an open-access cohort that enrolled about 500,000 participants, largely of European descent²⁷. Genetic, health, and demographic data were collected on many of the participants and were made publicly available for researchers. The MRC-IEU staff ran numerous GWA studies with UK Biobank variables, adjusted for age at recruitment and sex, and made their results available through MR-Base, a public repository of summary statistics from GWA studies for use in MR studies. The GWA study of having a sibling with diabetes contained 362,826 participants, of which 31,073 were classified as having a sibling with diabetes.

Outcome data source: mother with diabetes

The outcome data for having a mother with diabetes was extracted from a GWA study performed by the MRC-IEU staff, which used PHESANT-derived UK Biobank data (UK Biobank data field 20110). The GWA study contained 423,892 participants, of which 40,091 were classified as having a mother with diabetes.

Outcome data source: father with diabetes

The outcome data for having a father with diabetes comes from a GWA study performed by the MRC-IEU staff, which used PHESANT-derived UK Biobank data (UK Biobank data field 20107). The GWA study contained 400,687 participants, of which 38,850 were classified as having a father with diabetes.

Outcome data source: overweight status

The outcome data for overweight status come from Berndt et al. (2013), which performed a GWA study of clinically defined overweight status in 158,855 participants of European ancestry, of which 93,015 were classified as overweight²⁸. Overweight case status was defined as BMI ≥25 kg/m².

Outcome data source: physical activity

The outcome data for physical activity come from a GWA study by the MRC-IEU staff, which used PHESANT-derived UK Biobank data for moderate physical activity, defined as the number of days of moderate physical activity per week performed for more than 10 minutes at a time. The GWA study included 440,266 participants.

Outcome data source: high blood pressure

A GWA study of high blood pressure (a binary measure) was performed by the MRC-IEU staff using PHESANT-derived variables²⁴ constructed from the UK Biobank data^25,26 (data field 6150: “Vascular/heart problems diagnosed by doctor: high blood pressure”). There were 461,880 participants, of which 124,227 had high blood pressure as determined by a physician.

Outcome data source: gestational diabetes

The GWA study of gestational diabetes (a binary measure) was performed by MRC-IEU staff using PHESANT-derived variables ²⁴ constructed from UK Biobank data^25,26 (data field 4041). Participants were asked if they only had diabetes during pregnancy. There were 462,933 participants, 240 of which self-reported having had gestational diabetes.

Outcome data source: polycystic ovarian syndrome

The outcome data for polycystic ovarian syndrome (a binary measure) was performed by MRC-IEU staff using PHESANT-derived variables²⁴ constructed from the UK Biobank data^25,26 (data field 20002). There were 462,933 participants, of which 571 self-reported having polycystic ovarian syndrome.

Outcome data source: HDL levels

The outcome data for circulating HDL levels (a continuous measure) come from Willer et al. (2013), which performed an age- and sex-adjusted GWA study of circulating HDL levels in up to 187,167 individuals, largely of European ancestry²⁹.

Outcome data source: triglyceride levels

The outcome data for triglyceride levels (a continuous measure) come from Willer et al. (2013), which performed an age- and sex-GWA study of circulating triglyceride levels in up to 177,861 individuals, largely of European ancestry²⁹.

To ease interpretability, all MR results for the effects of Education Years on T2D and T2D risk factors were exponentiated from log odds to odds ratios, except for outcomes of continuous variables (i.e., HDL and triglyceride levels), which are presented as beta estimates (Table 1).

View this table:

Table 1.

Causal estimates for Education Years on T2D and 10 risk factors for T2D.

The summary statistics used for the MR analyses are available in Supplementary Tables 1-11.

Instrument construction

As introduced in Figure 1, independent (those not in linkage disequilibrium, LD; R² < 0.01) SNPs associated at genome-wide significance (P < 5 × 10⁻⁸) with Education Years were extracted from the Okbay et al. (2016) GWA study. The summary statistics for the Education Years-associated SNPs were then extracted from each of the outcome GWA studies. SNP-Education Years and SNP-outcome associations were harmonized and combined with the IVW method (Figure 1).

Sensitivity analyses

A weakness of the IVW estimator is that its estimate can be biased if the meta-analyzed SNPs are directionally pleiotropic³⁰. This can cause a violation to MR assumption (iii) and invalidate the findings. To address this, MR-Egger regression, weighted median, and weighted mode MR methods can be run as complements to the IVW. The directions and magnitudes of their effect estimates can be compared to those of the IVW. Doing so is a type triangulation: comparing approaches that have different assumptions to weigh evidence³¹. The reason for this is that the various MR sensitivity estimators make different assumptions about possible underlying pleiotropy. Due to their different assumptions, it is unlikely that the IVW and sensitivity estimators would be homogeneous in the directions and magnitudes of their effect estimates if there were substantial violations to MR assumption (iii). Therefore, triangulating their directions and magnitudes of effects provides a screen against pleiotropy. (Nuanced descriptions of how the various MR estimators deal with pleiotropy are described elsewhere^30,32,33). MR-Egger regression, weighted median, and weighted mode MR sensitivity methods were run for all analyses.

A formal test for directional pleiotropy was also done with the MR-Egger intecept. If the MR-Egger intercept is not different than 1 on the exponentiated scale or 0 when non-exponentiated (P>0.05), this indicates a lack of evidence for bias due to pleiotropy in the IVW estimate.

In addition, potential outlier SNPs were removed using RadialMR regression³⁴ for the MR tests of Education Years on T2D risk factors. (The differing number of SNPs for the Education Years instruments is due to this and that the various outcome GWA studies not having a uniform set SNPs in their association studies). All instrumental variables included in this analysis have Cochrane’s Q-statistic P-values indicating no evidence for heterogeneity between SNPs³⁵. Heterogeneity in the effect estimates for SNPs can indicate pleiotropy. Thus, ensuring a lack of heterogeneity between SNPs is an additional method to boost the chance that MR assumption (iii) is not violated. Heterogeneity statistics are provided in Supplementary Tables 12-22.

The IVW and sensitivity estimations were performed in R version 3.5.2 with the “TwoSampleMR” package^16,36. Overall, 11 tests were performed. The Bonferroni correction was used to penalize for multiple testing: P=0.05/11 (0.005).

Power

The study was powered for the test of Education Years on T2D, using mRnd MR power calculator (available at http://cnsgenomics.com/shiny/mRnd/)³⁷. There was ≥80% power to detect odds ratios in the range of 0.3-0.7 (Figure 2). In addition to the overall power to detect an association, MR studies also rely on F-statistics. F-statistics provide an indication of instrument strength³⁸. F-statistics <10 are conventionally considered to be weak³⁹. F-statistics for each test are available in Table 1.

Figure 2. Power calculations for a range of plausible effects estimates for the MR test of Education Years on T2D.

Data Availability

Data availability. All data sources are publicly available and are accessible within MR-Base: http://www.mrbase.org/.

http://www.mrbase.org/

Data availability

All data sources are publicly available and are accessible within MR-Base: http://www.mrbase.org/¹⁶.

References

1.↵
Montez, J. K. & Friedman, E. M. Educational attainment and adult health: Under what conditions is the association causal? Soc Sci Med 127, 1–7 (2015).
OpenUrl CrossRef PubMed
2.↵
Gathmann, C., Jürges, H. & Reinhold, S. Compulsory schooling reforms, education and mortality in twentieth century Europe. Soc Sci Med 127, 74–82 (2015).
OpenUrl
3.↵
Barnes, J. C., Boutwell, B. B., Beaver, K. M., Gibson, C. L. & Wright, J. P. On the consequences of ignoring genetic influences in criminological research. J Crim Justice 42, 471–482 (2014).
OpenUrl CrossRef Web of Science
4.↵
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet 23, R89–98 (2014).
OpenUrl CrossRef PubMed Web of Science
5.↵
Adam, D. The causation detector. Nature 576, 196–199 (2019).
OpenUrl
6.
Burgess, S. & Thompson, S. G. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur J Epidemiol 32, 377–389 (2017).
OpenUrl CrossRef PubMed
7.↵
Davies, N. M., Dickson, M., Smith, G. D., Van Den Berg, G. J. & Windmeijer, F. The causal effects of education on health outcomes in the UK Biobank. Nat Hum Behav 2, 117–125 (2018).
OpenUrl
8.↵
Davies, N. M. et al. Multivariable two-sample Mendelian randomization estimates of the effects of intelligence and education on health. Elife 8, 1–22 (2019).
OpenUrl CrossRef PubMed
9.↵
Adams, C. D. Appraisal of the pleiotropic effects of intelligence and education on schizophrenia: a univariable and multivariable Mendelian randomization study. medRxiv (2019). doi:10.1101/19012401
OpenUrl Abstract/FREE Full Text
10.↵
Anderson, E. L. et al. Education, intelligence and Alzheimer’s disease: evidence from a multivariable two-sample Mendelian randomization study. bioRxiv 401042 (2018). doi:10.1101/401042
OpenUrl Abstract/FREE Full Text
11.↵
Deary, I. J. Intelligence. Annu. Rev. Psychol 63, 453–482 (2012).
OpenUrl CrossRef PubMed Web of Science
12.↵
Diabetes UK. Diabetes UK Facts and Figures 2019. 1–48 (2019).
13.↵
Davey Smith, G. & Ebrahim, S. ‘Mendelian randomization’: Can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 32, 1–22 (2003).
OpenUrl CrossRef PubMed Web of Science
14.
Schooling, C. M., Freeman, G. & Cowling, B. J. Mendelian randomization and estimation of treatment efficacy for chronic diseases. Am J Epidemiol 177, 1128–1133 (2013).
OpenUrl CrossRef PubMed Web of Science
15.↵
Hemani, G., Bowden, J. & Smith, G. D. Evaluating the potential role of pleiotropy in Mendelian randomization studies. Hum Mol Genet 27, 195–208 (2018).
OpenUrl
16.↵
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 7, 1–29 (2018).
OpenUrl CrossRef PubMed
17.
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol 37, 658–665 (2013).
OpenUrl CrossRef PubMed
18.↵
Bowden, J., Smith, G. D. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol 44, 512–525 (2015).
OpenUrl CrossRef PubMed
19.
Johnson, T. Efficient calculation for multi-SNP genetic risk scores. in American Society of Human Genetics Annual Meeting (2012). doi:10.1038/ng.784.
OpenUrl CrossRef PubMed
20.↵
Didelez, V. & Sheehan, N. Mendelian randomization as an instrumental variable approach to causal inference. Stat Methods Med Res. 16, 309–330 (2007).
OpenUrl CrossRef PubMed Web of Science
21.↵
American Diabetes Association. What causes diabetes? Find out and take control. Available at: https://www.diabetes.org/diabetes-risk. (Accessed: 28th January 2020)
22.↵
Okbay, A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539–542 (2016).
OpenUrl CrossRef PubMed
23.↵
Morris, A., Voight, B. & Teslovich, T. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet 44, 981–990 (2012).
OpenUrl CrossRef PubMed
24.↵
Millard, L. A. C., Davies, N. M., Gaunt, T. R., Smith, G. D. & Tilling, K. Software application profile: PHESANT: A tool for performing automated phenome scans in UK Biobank. Int J Epidemiol 47, 29–35 (2018).
OpenUrl
25.↵
Collins, R. What makes UK Biobank special? Lancet 379, 1173–1174 (2012).
OpenUrl CrossRef PubMed Web of Science
26.↵
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. Plos Med 12, 1–10 (2015).
OpenUrl CrossRef PubMed
27.↵
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol 186, 1026–1034 (2017).
OpenUrl CrossRef PubMed
28.↵
Berndt, S. I. et al. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture. Nat Genet 45, 501–512 (2013).
OpenUrl CrossRef PubMed
29.↵
Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels. Nat Genet 45, 1274–1283 (2013).
OpenUrl CrossRef PubMed
30.↵
Spiller, W., Davies, N. M. & Palmer, T. M. Software application profile: mrrobust — a tool for performing two-sample summary Mendelian randomization analyses. Int J Epidemiol 48, 684–690 (2019).
OpenUrl
31.↵
Lawlor, D. A., Tilling, K. & Davey Smith, G. Triangulation in aetiological epidemiology. Int J Epidemiol 45, 1866–1886 (2016).
OpenUrl CrossRef PubMed
32.↵
Yarmolinsky, J. et al. Appraising the role of previously reported risk factors in epithelial ovarian cancer risk: a Mendelian randomization analysis. PLOS Med 16, e1002893 (2019).
OpenUrl CrossRef
33.↵
Hwang, L., Lawlor, D. A., Freathy, R. M., Evans, D. M. & Warrington, N. M. Using a two-sample Mendelian randomization design to investigate a possible causal effect of maternal lipid concentrations on offspring birth weight. Int J Epidemiol 005, 1–11 (2019).
OpenUrl CrossRef
34.↵
Bowden, J. et al. Improving the visualization, interpretation and analysis of two-sample summary data Mendelian randomization via the Radial plot and Radial regression. Int J Epidemiol 1–15 (2018). doi:10.1093/ije/dyy101
OpenUrl CrossRef PubMed
35.↵
Del Greco M F., Minelli, C., Sheehan, N. A. & Thompson, J. R. Detecting pleiotropy in Mendelian randomisation studies with summary data and a continuous outcome. Stat. Med. 34, 2926–2940 (2015).
OpenUrl CrossRef PubMed
36.↵
R Core Team. R: A language and environment for statistical computing. (2013).
37.↵
Burgess, S., Small, D. S. & Thompson, S. G. A review of instrumental variable estimators for Mendelian randomization. Stat Methods Med Res 1–26 (2015). doi:10.1177/0962280215597579
OpenUrl CrossRef PubMed
38.↵
Burgess, S. & Thompson, S. G. Avoiding bias from weak instruments in mendelian randomization studies. Int J Epidemiol 40, 755–764 (2011).
OpenUrl CrossRef PubMed Web of Science
39.↵
Pierce, B. L. & Burgess, S. Efficient design for mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am J Epidemiol 178, 1177–1184 (2013).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted February 07, 2020.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (399)
Allergy and Immunology (708)
Anesthesia (200)
Cardiovascular Medicine (2918)
Dentistry and Oral Medicine (333)
Dermatology (249)
Emergency Medicine (438)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1032)
Epidemiology (12711)
Forensic Medicine (12)
Gastroenterology (827)
Genetic and Genomic Medicine (4567)
Geriatric Medicine (415)
Health Economics (726)
Health Informatics (2913)
Health Policy (1068)
Health Systems and Quality Improvement (1074)
Hematology (386)
HIV/AIDS (922)
Infectious Diseases (except HIV/AIDS) (14081)
Intensive Care and Critical Care Medicine (842)
Medical Education (422)
Medical Ethics (115)
Nephrology (467)
Neurology (4335)
Nursing (234)
Nutrition (636)
Obstetrics and Gynecology (801)
Occupational and Environmental Health (734)
Oncology (2261)
Ophthalmology (643)
Orthopedics (258)
Otolaryngology (324)
Pain Medicine (278)
Palliative Medicine (83)
Pathology (499)
Pediatrics (1196)
Pharmacology and Therapeutics (502)
Primary Care Research (494)
Psychiatry and Clinical Psychology (3734)
Public and Global Health (6916)
Radiology and Imaging (1524)
Rehabilitation Medicine and Physical Therapy (895)
Respiratory Medicine (915)
Rheumatology (436)
Sexual and Reproductive Health (443)
Sports Medicine (383)
Surgery (486)
Toxicology (60)
Transplantation (210)
Urology (178)

[1] 1.↵
Montez, J. K. & Friedman, E. M. Educational attainment and adult health: Under what conditions is the association causal? Soc Sci Med 127, 1–7 (2015).
OpenUrl CrossRef PubMed

[2] 2.↵
Gathmann, C., Jürges, H. & Reinhold, S. Compulsory schooling reforms, education and mortality in twentieth century Europe. Soc Sci Med 127, 74–82 (2015).
OpenUrl

[3] 3.↵
Barnes, J. C., Boutwell, B. B., Beaver, K. M., Gibson, C. L. & Wright, J. P. On the consequences of ignoring genetic influences in criminological research. J Crim Justice 42, 471–482 (2014).
OpenUrl CrossRef Web of Science

[4] 4.↵
Davey Smith, G. & Hemani, G. Mendelian randomization: genetic anchors for causal inference in epidemiological studies. Hum Mol Genet 23, R89–98 (2014).
OpenUrl CrossRef PubMed Web of Science

[5] 5.↵
Adam, D. The causation detector. Nature 576, 196–199 (2019).
OpenUrl

[6] 6.
Burgess, S. & Thompson, S. G. Interpreting findings from Mendelian randomization using the MR-Egger method. Eur J Epidemiol 32, 377–389 (2017).
OpenUrl CrossRef PubMed

[7] 7.↵
Davies, N. M., Dickson, M., Smith, G. D., Van Den Berg, G. J. & Windmeijer, F. The causal effects of education on health outcomes in the UK Biobank. Nat Hum Behav 2, 117–125 (2018).
OpenUrl

[8] 8.↵
Davies, N. M. et al. Multivariable two-sample Mendelian randomization estimates of the effects of intelligence and education on health. Elife 8, 1–22 (2019).
OpenUrl CrossRef PubMed

[9] 9.↵
Adams, C. D. Appraisal of the pleiotropic effects of intelligence and education on schizophrenia: a univariable and multivariable Mendelian randomization study. medRxiv (2019). doi:10.1101/19012401
OpenUrl Abstract/FREE Full Text

[10] 10.↵
Anderson, E. L. et al. Education, intelligence and Alzheimer’s disease: evidence from a multivariable two-sample Mendelian randomization study. bioRxiv 401042 (2018). doi:10.1101/401042
OpenUrl Abstract/FREE Full Text

[11] 11.↵
Deary, I. J. Intelligence. Annu. Rev. Psychol 63, 453–482 (2012).
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Diabetes UK. Diabetes UK Facts and Figures 2019. 1–48 (2019).

[13] 13.↵
Davey Smith, G. & Ebrahim, S. ‘Mendelian randomization’: Can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol 32, 1–22 (2003).
OpenUrl CrossRef PubMed Web of Science

[14] 14.
Schooling, C. M., Freeman, G. & Cowling, B. J. Mendelian randomization and estimation of treatment efficacy for chronic diseases. Am J Epidemiol 177, 1128–1133 (2013).
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Hemani, G., Bowden, J. & Smith, G. D. Evaluating the potential role of pleiotropy in Mendelian randomization studies. Hum Mol Genet 27, 195–208 (2018).
OpenUrl

[16] 16.↵
Hemani, G. et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 7, 1–29 (2018).
OpenUrl CrossRef PubMed

[17] 17.
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol 37, 658–665 (2013).
OpenUrl CrossRef PubMed

[18] 18.↵
Bowden, J., Smith, G. D. & Burgess, S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol 44, 512–525 (2015).
OpenUrl CrossRef PubMed

[19] 19.
Johnson, T. Efficient calculation for multi-SNP genetic risk scores. in American Society of Human Genetics Annual Meeting (2012). doi:10.1038/ng.784.
OpenUrl CrossRef PubMed

[20] 20.↵
Didelez, V. & Sheehan, N. Mendelian randomization as an instrumental variable approach to causal inference. Stat Methods Med Res. 16, 309–330 (2007).
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
American Diabetes Association. What causes diabetes? Find out and take control. Available at: https://www.diabetes.org/diabetes-risk. (Accessed: 28th January 2020)

[22] 22.↵
Okbay, A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539–542 (2016).
OpenUrl CrossRef PubMed

[23] 23.↵
Morris, A., Voight, B. & Teslovich, T. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet 44, 981–990 (2012).
OpenUrl CrossRef PubMed

[24] 24.↵
Millard, L. A. C., Davies, N. M., Gaunt, T. R., Smith, G. D. & Tilling, K. Software application profile: PHESANT: A tool for performing automated phenome scans in UK Biobank. Int J Epidemiol 47, 29–35 (2018).
OpenUrl

[25] 25.↵
Collins, R. What makes UK Biobank special? Lancet 379, 1173–1174 (2012).
OpenUrl CrossRef PubMed Web of Science

[26] 26.↵
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. Plos Med 12, 1–10 (2015).
OpenUrl CrossRef PubMed

[27] 27.↵
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am J Epidemiol 186, 1026–1034 (2017).
OpenUrl CrossRef PubMed

[28] 28.↵
Berndt, S. I. et al. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture. Nat Genet 45, 501–512 (2013).
OpenUrl CrossRef PubMed

[29] 29.↵
Willer, C. J. et al. Discovery and refinement of loci associated with lipid levels. Nat Genet 45, 1274–1283 (2013).
OpenUrl CrossRef PubMed

[30] 30.↵
Spiller, W., Davies, N. M. & Palmer, T. M. Software application profile: mrrobust — a tool for performing two-sample summary Mendelian randomization analyses. Int J Epidemiol 48, 684–690 (2019).
OpenUrl

[31] 31.↵
Lawlor, D. A., Tilling, K. & Davey Smith, G. Triangulation in aetiological epidemiology. Int J Epidemiol 45, 1866–1886 (2016).
OpenUrl CrossRef PubMed

[32] 32.↵
Yarmolinsky, J. et al. Appraising the role of previously reported risk factors in epithelial ovarian cancer risk: a Mendelian randomization analysis. PLOS Med 16, e1002893 (2019).
OpenUrl CrossRef

[33] 33.↵
Hwang, L., Lawlor, D. A., Freathy, R. M., Evans, D. M. & Warrington, N. M. Using a two-sample Mendelian randomization design to investigate a possible causal effect of maternal lipid concentrations on offspring birth weight. Int J Epidemiol 005, 1–11 (2019).
OpenUrl CrossRef

[34] 34.↵
Bowden, J. et al. Improving the visualization, interpretation and analysis of two-sample summary data Mendelian randomization via the Radial plot and Radial regression. Int J Epidemiol 1–15 (2018). doi:10.1093/ije/dyy101
OpenUrl CrossRef PubMed

[35] 35.↵
Del Greco M F., Minelli, C., Sheehan, N. A. & Thompson, J. R. Detecting pleiotropy in Mendelian randomisation studies with summary data and a continuous outcome. Stat. Med. 34, 2926–2940 (2015).
OpenUrl CrossRef PubMed

[36] 36.↵
R Core Team. R: A language and environment for statistical computing. (2013).

[37] 37.↵
Burgess, S., Small, D. S. & Thompson, S. G. A review of instrumental variable estimators for Mendelian randomization. Stat Methods Med Res 1–26 (2015). doi:10.1177/0962280215597579
OpenUrl CrossRef PubMed

[38] 38.↵
Burgess, S. & Thompson, S. G. Avoiding bias from weak instruments in mendelian randomization studies. Int J Epidemiol 40, 755–764 (2011).
OpenUrl CrossRef PubMed Web of Science

[39] 39.↵
Pierce, B. L. & Burgess, S. Efficient design for mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am J Epidemiol 178, 1177–1184 (2013).
OpenUrl CrossRef PubMed Web of Science