ABSTRACT
Early evidence that patients with (multiple) pre-existing diseases are at highest risk for severe COVID-19 has been instrumental in the pandemic to allocate critical care resources and later vaccination schemes. However, systematic studies exploring the breadth of medical diagnoses, including common, but non-fatal diseases are scarce, but may help to understand severe COVID-19 among patients at supposedly low risk. Here, we systematically harmonized >12 million primary care and hospitalisation health records from ∼500,000 UK Biobank participants into 1448 collated disease terms to systematically identify diseases predisposing to severe COVID-19 (requiring hospitalisation or death) and its post-acute sequalae, Long COVID. We identified a total of 679 diseases associated with an increased risk for severe COVID-19 (n=672) and/or Long COVID (n=72) that spanned almost all clinical specialties and were strongly enriched in clusters of cardio-respiratory and endocrine-renal diseases. For 57 diseases, we established consistent evidence to predispose to severe COVID-19 based on survival and genetic susceptibility analyses. This included a possible role of symptoms of malaise and fatigue as a so far largely overlooked risk factor for severe COVID-19. We finally observed partially opposing risk estimates at known risk loci for severe COVID-19 for etiologically related diseases, such as post-inflammatory pulmonary fibrosis (e.g., MUC5B, NPNT, and PSMD3) or rheumatoid arthritis (e.g., TYK2), possibly indicating a segregation of disease mechanisms. Our results provide a unique reference that demonstrates how 1) complex co-occurrence of multiple – including non-fatal – conditions predispose to increased COVID-19 severity and 2) how incorporating the whole breadth of medical diagnosis can guide the interpretation of genetic risk loci.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The authors acknowledge the Scientific Computing of the IT Division at the Charite - Universitaetsmedizin Berlin for providing computational resources that have contributed to the research results reported in this paper (https:// www.charite.de/en/research/research_support_services/research_infrastructure/science_it/#c30646061). This work was supported by funding of the German Centre for Cardiovascular Research (DZHK) and the German Ministry of Education and Research (BMBF), and the UKRI/NIHR Strategic Priorities Award in Multimorbidity Research for the Multimorbidity Mechanism and Therapeutics Research Collaborative (MR/V033867/1) to C.L.. H.H. and S.D. are supported by Health Data Research UK and the National Institute for Health Research (NIHR) Biomedical Research Centre at University College London NHS Hospitals Trust. M.A. and G.K. are supported by National Institutes of Health/National Institute on Aging grants RF1AG059093, U01AG061359, U19AG063744, R01AG069901, U19AG074879, and R01AG081322. G.K. also received funding from the German Federal Ministry of Education and Research (BMBF) (BiomarKid, 01EA2203B) under the umbrella of the European Joint Programming Initiative 'A Healthy Diet for a Healthy Life' (JPI HDHL) and of the ERA-NET Cofund ERA-HDHL (GA No 696295 of the EU Horizon 2020 Research and Innovation Programme) and of the German Network for Mitochondrial Disorders (mitoNET, 01GM1906C). This work was supported by the de.NBI Cloud within the German Network for Bioinformatics Infrastructure (de.NBI) funded by the German Federal Ministry of Education and Research (BMBF) (031A532B, 031A533A, 031A533B, 031A534A, 031A535A, 031A537A, 031A537B, 031A537C, 031A537D, 031A538A).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The UKB was approved by the National Research Ethics Service Committee Northwest Multi-Centre Haydock and all study procedures were performed in accordance with the World Medical Association Declaration of Helsinki ethical principles for medical research.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
↵* These authors jointly supervised the work
The manuscript has been substantially revised focussing entirely on COVID-19 in adherence to COPI regulations that made the linkage to most of the primary data possible.
Data availability
Genome-wide summary statistics for diseases (‘phecodes’) in UK Biobank were generated, in part, from primary care data released to UK Biobank specifically for the use of COVID-19 research only, according to COPI regulations, and can therefore not be made publicly available. All Access to individual level data can be requested by bona fide researchers from the UK Biobank (https://www.ukbiobank.ac.uk/). This research has been conducted under the application 44448. Mapping of Read codes to phecodes can be downloaded from https://github.com/spiros/ukbiobank-read-to-phecode.