Abstract
Coronavirus disease 2019 (COVID-19) has a highly variable disease severity. Possible associations between peripheral blood signatures and disease severity have been investigated since the emergence of the pandemic. Although several signatures were identified based on exploratory analyses of single-cell omics data, there are no state-of-the-art validated models to predict COVID-19 severity from comprehensive transcriptome profiling of Peripheral Blood Mononuclear Cells (PBMCs). In this paper, we present a computational workflow based on a Multilayer perceptron network that predicts the necessity of mechanical ventilation from PBMCs single-cell RNA-seq data. The study includes patient cohorts from Bonn, Berlin, Stanford, and three Korean medical centers. Training and model validation are performed using Berlin and Bonn samples, while testing is performed on completely unseen samples from the Stanford and Korean datasets. Our model shows a high area under the receiver operating characteristic (AUROC) curve (Korea: 1 (CI:1-1), Stanford: 0.86 (CI:0.81-0.9)), proving our model’s robustness. Moreover, we explain our model’s performance by identifying gene loci and cell types, which are most critical for the classification task. In summary, we could show that the expression of 15 genes and the cell type proportion of 29 PBMC classes distinguish between COVID-19 disease states.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was supported by the Netzwerk Universitaetsmedizin Germany through the project CODEX+
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
We used publicly available scRNA-seq data derived from PBMCs of COVID-19 samples. Count gene expression data of Bonn and Berlin cohorts were obtained from Schulte-Schrepping, J., Reusch, N., Paclik, D., Bassler, K., Schlickeiser, S., Zhang, B., Kraemer, B., Krammer, T., Brumhard, S., Bonaguro, L., et al. (2020). Severe COVID-19 Is Marked by a Dysregulated Myeloid Cell Compartment. Cell 182, 1419-1440.e23. https://doi.org/10.1016/j.cell.2020.08.001. Samples were retrieved from patients recruited at the University Hospital Bonn and the Charite Universitaetsmedizin of Berlin, respectively. Single-cell RNA seq raw dataset of patients enrolled in the Stanford University COVID-19 Biobanking studies and the three Korean medical centers (Asan Medical Center, Severance Hospital, and Chungbuk National University Hospital) were retrieved from the gene expression omnibus (GEO) under the accession numbers GSE174072 and GSE149689, respectively.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
We used publicly available scRNA-seq data derived from PBMCs of COVID-19 samples. Count gene expression data of Bonn and Berlin cohorts were obtained from Schulte-Schrepping, J., Reusch, N., Paclik, D., Bassler, K., Schlickeiser, S., Zhang, B., Kraemer, B., Krammer, T., Brumhard, S., Bonaguro, L., et al. (2020). Severe COVID-19 Is Marked by a Dysregulated Myeloid Cell Compartment. Cell 182, 1419-1440.e23. https://doi.org/10.1016/j.cell.2020.08.001. Samples were retrieved from patients recruited at the University Hospital Bonn and the Charite Universitaetsmedizin of Berlin, respectively. Single-cell RNA seq raw dataset of patients enrolled in the Stanford University COVID-19 Biobanking studies and the three Korean medical centers (Asan Medical Center, Severance Hospital, and Chungbuk National University Hospital) were retrieved from the gene expression omnibus (GEO) under the accession numbers GSE174072 and GSE149689, respectively.