Abstract
SARS-CoV-2 is responsible for the coronavirus disease 2019 (COVID-19) and the current health crisis. Despite intensive research efforts, the genes and pathways that contribute to COVID-19 remain poorly understood. We therefore used an integrative genomics (IG) approach to identify candidate genes responsible for COVID-19 and its severity. We used Bayesian colocalization (COLOC) and summary-based Mendelian randomization to combine gene expression quantitative trait loci (eQTLs) from the Lung eQTL (n=1,038) and eQTLGen (n=31,784) studies with published COVID-19 genome-wide association study (GWAS) data from the COVID-19 Host Genetics Initiative. Additionally, we used COLOC to integrate plasma protein quantitative trait loci (pQTL) from the INTERVAL study (n=3,301) with COVID-19-associated loci. Finally, we determined any causal associations between plasma proteins and COVID-19 using multi-variable two-sample Mendelian randomization (MR). We found that the expression of 20 genes in lung and 31 genes in blood was associated with COVID-19. Of these genes, only three (LZTFL1, SLC6A20 and ABO) had been previously linked with COVID-19 in GWAS. The novel loci included genes involved in interferon pathways (IL10RB, IFNAR2 and OAS1). Plasma ABO protein, which is associated with blood type in humans, demonstrated a significant causal relationship with COVID-19 in MR analysis; increased plasma levels were associated with an increased risk of having COVID-19 and risk of severe COVID-19. In summary, our study identified genes associated with COVID-19 that may be prioritized for future investigation. Importantly, this is the first study to demonstrate a causal association between plasma ABO protein and COVID-19.
Competing Interest Statement
S.M. reports personal fees from Novartis and Boehringer-Ingelheim, outside the submitted work. W.T. reports fees to Institution from Roche-Ventana, AbbVie, Merck-Sharp-Dohme and Bristol-Myers-Squibb, outside the submitted work. M.v.d.B. reports research grants paid to Institution from Astra Zeneca, Novartis, outside the submitted work. D.D.S. reports research funding from AstraZeneca and received honoraria for speaking engagements from Boehringer Ingelheim and AstraZeneca over the past 36 months, outside of the submitted work.
Funding Statement
A.I.H.C. and S.M. are funded by MITACS Accelerate grant. D.D.S. holds the De Lazzari Family Chair at HLI and a Tier 1 Canada Research Chair in COPD. Y.B. holds a Canada Research Chair in Genomics of Heart and Lung Diseases.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Not applicable. Publicly available third-party data sets were used for this.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
The results associated to this manuscript are available on the additional file. Specific datasets used to conduct this research are available as follow: The COVID-19 summary statistic can be accessed through the COVID-19 HG website (https://www.covid19hg.org/). Lung eQTL study expression and genotype data can be obtained through GEO platform accession number GSE23546 and the dbGaP Study Accession phs001745.v1.p1, respectively. Blood cis-eQTLs summary statistics can be accessed through the eQTLGen website (https://www.eqtlgen.org/cis-eqtls.html). The INTERVAL study pQTLs summary statistic are available through the European Genotype Archive (accession number EGAS00001002555).