Abstract
Unlike conventional epidemiological studies that use observational data to estimate “associations” between risk factors and disease, the science of causal inference has identified situations where causal estimates can be made from observational data, using results such as the “backdoor criteria”. These results are combined here with established epidemiological methods, to calculate simple population attribution fractions that estimate the causal influence of risk factors on disease incidence, and can be estimated using conventional proportional hazards methods. A counterfactual argument gives an attribution fraction for individuals. Causally meaningful attribution fractions cannot be constructed for all risk factors or confounders, but they can for the important established risk factors of smoking and body mass index (BMI). Using the new results, the causal attribution of smoking and BMI to the incidence of 226 diseases in the UK Biobank are estimated, and summarised in terms of disease chapters from the International Classification of Diseases (ICD-10). The diseases most strongly attributed to smoking and BMI are identified, finding 11 with attribution fractions greater than 0.5, and a small number with protective associations. The results provide new tools to quantify the causal influence of risk factors such as smoking and BMI on disease, and survey the causal influence of smoking and BMI on the landscape of disease incidence in the UK Biobank population.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research was funded by a fellowship from the Nuffield Department of Population Health, University of Oxford.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study uses UK Biobank data, that is available by application from: www.ukbiobank.ac.uk
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Following feedback, the title, abstract, and introductions have been rewritten, and the Methods moved and simplified. A new subsection has been added to discuss limitations of the work.
Data Availability
The study uses UK Biobank data, that is available by application from: www.ukbiobank.ac.uk