Abstract
Background Understanding the host genetic architecture and viral immunity contributes to the development of effective vaccines and therapeutics for controlling the COVID-19 pandemic. Alterations of immune responses in peripheral blood mononuclear cells play a crucial role in the detrimental progression of COVID-19. However, the effects of host genetic factors on immune responses for severe COVID-19 remain largely unknown.
Methods We constructed a powerful computational framework to characterize the host genetics-influenced immune cell subpopulations for severe COVID-19 by integrating GWAS summary statistics (N = 969,689 samples) with four independent scRNA-seq datasets (N = 606,534 cells).
Results We found that 34 risk genes were significantly associated with severe COVID-19, and the number of highly-expressed genetics-risk genes increased with the severity of COVID-19. Three cell-subtypes that are CD16+monocytes, megakaryocytes, and memory CD8+T cells were significantly enriched by COVID-19-related genetic association signals. Notably, three causal risk genes of CCR1, CXCR6, and ABO were specifically expressed in these three cell types, respectively. CCR1+CD16+monocytes and ABO+ megakaryocytes with significant up-regulated genes including S100A12, S100A8, S100A9, and IFITM1 confer higher risk to the cytokine storms among severe patients. CXCR6+ memory CD8+ T cells exhibit a notable polyfunctionality of multiple immunologic features, including elevation of proliferation, migration, and chemotaxis. Moreover, we observed a prominent increase in cell-cell interactions of both CCR1+ CD16+monocytes and CXCR6+ memory CD8+T cells in severe patients compared to normal controls among both PBMCs and lung tissues, and elevated interactions with epithelial cells could contribute to enhance the resident to lung airway for against COVID-19 infection.
Conclusions We uncover a major genetics-modulated immunological shift between mild and severe infection, including an increase in up-regulated genetic-risk genes, excessive secreted inflammatory cytokines, and functional immune cell subsets contributing high risk to severity, which provides novel insights in parsing the host genetics-influenced immune cells for severe COVID-19.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was funded by the National Natural Science Foundation of China (61871294 to J.S.), the Scientific Research Foundation for Talents of Wenzhou Medical University (KYQD20201001 to Y.M.), and Science Foundation of Zhejiang Province (LR19C060001 to J.S).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
https://www.ncbi.nlm.nih.gov/gds/?term=GSE149689,https://www.ncbi.nlm.nih.gov/gds/?term=GSE150861,and the ArrayExpress database (https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-9357).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All the GWAS summary statistics used in this study can be accessed in the official websites (www.covid19hg.org/results). The GTEx eQTL data (version 8) were downloaded from Zenodo repository (https://zenodo.org/record/3518299#.Xv6Z6igzbgl). Three scRNA-seq datasets were downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/gds/?term=GSE149689 and https://www.ncbi.nlm.nih.gov/gds/?term=GSE150861) and the ArrayExpress database (https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-9357). All analysis code in the Methods is available in an online GitHub repository at https://github.com/mayunlong89/COVID19_scRNA.
Abbreviations
- COVID-19
- coronavirus disease 2019
- SARS-CoV-2
- severe acute respiratory syndrome coronavirus 2
- GWAS
- genome-wide association study
- scRNA-seq
- single cell RNA sequencing
- PBMCs
- peripheral blood mononuclear cells
- BALF
- bronchoalveolar lavage fluid
- eQTL
- expression quantitative trait loci
- GEO
- the Gene Expression Omnibus database
- WHO
- the World Health Organization
- SNP
- single nucleotide polymorphism
- OR
- odds ratio
- MAF
- minor allele frequency
- quantile-quantile
- MAGMA
- Multi-marker Analysis of GenoMic Annotation
- LD
- linkage disequilibrium
- FDR
- false discovery rate
- KEGG
- the Kyoto Encyclopedia of Genes and Genomes
- PPC
- the Pearson correlation coefficient
- MDS
- multidimensional scaling
- OTG
- the Open Target Genetics
- PPI
- protein-protein interaction
- up-DEG
- significantly up-regulated expression gene associated with COVID-19