PT - JOURNAL ARTICLE AU - Zhang, Sai AU - Cooper-Knock, Johnathan AU - Weimer, Annika K. AU - Harvey, Calum AU - Julian, Thomas H. AU - Wang, Cheng AU - Li, Jingjing AU - Furini, Simone AU - Frullanti, Elisa AU - Fava, Francesca AU - Renieri, Alessandra AU - Pan, Cuiping AU - Song, Jina AU - Billing-Ross, Paul AU - Gao, Peng AU - Shen, Xiaotao AU - Timpanaro, Ilia Sarah AU - Kenna, Kevin P. AU - , AU - , AU - Davis, Mark M. AU - Tsao, Philip S. AU - Snyder, Michael P. TI - Common and rare variant analyses combined with single-cell multiomics reveal cell-type-specific molecular mechanisms of COVID-19 severity AID - 10.1101/2021.06.15.21258703 DP - 2021 Jan 01 TA - medRxiv PG - 2021.06.15.21258703 4099 - http://medrxiv.org/content/early/2021/06/21/2021.06.15.21258703.short 4100 - http://medrxiv.org/content/early/2021/06/21/2021.06.15.21258703.full AB - The determinants of severe COVID-19 in non-elderly adults are poorly understood, which limits opportunities for early intervention and treatment. Here we present novel machine learning frameworks for identifying common and rare disease-associated genetic variation, which outperform conventional approaches. By integrating single-cell multiomics profiling of human lungs to link genetic signals to cell-type-specific functions, we have discovered and validated over 1,000 risk genes underlying severe COVID-19 across 19 cell types. Identified risk genes are overexpressed in healthy lungs but relatively downregulated in severely diseased lungs. Genetic risk for severe COVID-19, within both common and rare variants, is particularly enriched in natural killer (NK) cells, which places these immune cells upstream in the pathogenesis of severe disease. Mendelian randomization indicates that failed NKG2D-mediated activation of NK cells leads to critical illness. Network analysis further links multiple pathways associated with NK cell activation, including type-I-interferon-mediated signalling, to severe COVID-19. Our rare variant model, PULSE, enables sensitive prediction of severe disease in non-elderly patients based on whole-exome sequencing; individualized predictions are accurate independent of age and sex, and are consistent across multiple populations and cohorts. Risk stratification based on exome sequencing has the potential to facilitate post-exposure prophylaxis in at-risk individuals, potentially based around augmentation of NK cell function. Overall, our study characterizes a comprehensive genetic landscape of COVID-19 severity and provides novel insights into the molecular mechanisms of severe disease, leading to new therapeutic targets and sensitive detection of at-risk individuals.Competing Interest StatementM.P.S. is a cofounder of Personalis, Qbio, Sensomics, Filtricine, Mirvie, and January. He is on the scientific advisory of these companies and Genapsys. J.L. is a cofounder of Sensomics. No other authors have competing interests.Funding StatementWe acknowledge the Stanford Genetics Bioinformatics Service Center (GBSC) for providing computational infrastructure for this study. This study was also supported by the National Institutes of Health (1S10OD023452-01 to GBSC; CEGS 5P50HG00773504, 1P50HL083800, 1R01HL101388, 1R01-HL122939, S10OD025212, P30DK116074, and UM1HG009442 to M.P.S.), the Wellcome Trust (216596/Z/19/Z to J.C.K.), and Million Veteran Program, Office of Research and Development, Veterans Health Administration (MVP001). This publication does not represent the views of the Department of Veteran Affairs or the United States Government. Figures 1a and 4b were created with BioRender.com. This study is part of the GEN-COVID Multicenter Study (https://sites.google.com/dbm.unisi.it/gen-covid), the Italian multicenter study aimed at identifying the COVID-19 host genetic bases. Specimens were provided by the COVID-19 Biobank of Siena, which is part of the Genetic Biobank of Siena, member of BBMRI-IT, of Telethon Network of Genetic Biobanks (project no. GTB18001), of EuroBioBank, and of RDConnect. We thank the CINECA consortium for providing computational resources and the Network for Italian Genomes (NIG) (http://www.nig.cineca.it) for its support. We thank private donors for the support provided to A.R. (Department of Medical Biotechnologies, University of Siena) for the COVID-19 host genetics research project (D.L n.18 of March 17, 2020). We also thank the COVID-19 Host Genetics Initiative (https://www.covid19hg.org/), MIUR project 'Dipartimenti di Eccellenza 2018-2020' to the Department of Medical Biotechnologies University of Siena, Italy, and 'Bando Ricerca COVID-19 Toscana' project to Azienda Ospedaliero Universitaria Senese. We also thank Intesa San Paolo for the 2020 charity fund dedicated to the project 'N. B/2020/0119 Identificazione delle basi genetiche determinanti la variabilita clinica della risposta a COVID-19 nella popolazione italiana' and 'Bando FISR 2020' in COVID-19 from Italian Ministry of University e Research.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The GEN-COVID study was approved by the University Hospital of Siena Ethical Review Board (Protocol n. 16929, dated March 16, 2020). The Million Veteran Program (MVP) received approval its study protocol by the Veterans Affairs Central IRB, and by the IRBs at each field center where all participants gave written informed consent. All local R&D IRB committees are: Corporal Michael J. Crescenz VA Medical Center; VA Palo Alto Health Care System; VA Salt Lake City Health Care System; Edith Nourse Rogers Memorial Veterans Hospital; Phoenix VA Health Care System; Atlanta VA Medical Center; VA Boston Healthcare System; and the North Florida / South Georgia Veterans Health System.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe GEN-COVID WES and clinical data are available by consultation (A.R.). The VA WGS and clinical data are available upon request from the corresponding authors (P.S.T. and M.P.S.); these data are not publicly available due to US Government and Department of Veteran's Affairs restrictions relating to participant privacy and consent. All other data used in this study are available from the original studies.