PT - JOURNAL ARTICLE AU - Kalantar, Katrina AU - Neyton, Lucile AU - Abdelghany, Mazin AU - Mick, Eran AU - Jauregui, Alejandra AU - Caldera, Saharai AU - Serpa, Paula Hayakawa AU - Ghale, Rajani AU - Albright, Jack AU - Sarma, Aartik AU - Tsitsiklis, Alexandra AU - Leligdowicz, Aleksandra AU - Christenson, Stephanie AU - Liu, Kathleen AU - Kangelaris, Kirsten AU - Hendrickson, Carolyn AU - Sinha, Pratik AU - Gomez, Antonio AU - Neff, Norma AU - Pisco, Angela AU - Doernberg, Sarah AU - Derisi, Joseph L. AU - Matthay, Michael A. AU - Calfee, Carolyn S. AU - Langelier, Charles R. TI - Integrated host-microbe metagenomics for sepsis diagnosis in critically ill adults AID - 10.1101/2022.07.16.22277700 DP - 2022 Jan 01 TA - medRxiv PG - 2022.07.16.22277700 4099 - http://medrxiv.org/content/early/2022/07/17/2022.07.16.22277700.short 4100 - http://medrxiv.org/content/early/2022/07/17/2022.07.16.22277700.full AB - Sepsis is a leading cause of death, and improved approaches for disease diagnosis and detection of etiologic pathogens are urgently needed. Here, we carried out integrated host and pathogen metagenomic next generation sequencing (mNGS) of whole blood (n=221) and plasma RNA and DNA (n=138) from critically ill patients following hospital admission. We assigned patients into sepsis groups based on clinical and microbiological criteria: 1) sepsis with bloodstream infection (SepsisBSI), 2) sepsis with peripheral site infection but not bloodstream infection (Sepsisnon-BSI), 3) suspected sepsis with negative clinical microbiological testing; 4) no evidence of infection (No-Sepsis), and 5) indeterminant sepsis status. From whole blood gene expression data, we first trained a bagged support vector machine (bSVM) classifier to distinguish SepsisBSI and Sepsisnon-BSI patients from No-Sepsis patients, using 75% of the cohort. This classifier performed with an area under the receiver operating characteristic curve (AUC) of 0.81 in the training set (75% of cohort) and an AUC of 0.82 in a held-out validation set (25% of cohort). Surprisingly, we found that plasma RNA also yielded a biologically relevant transcriptional signature of sepsis which included several genes previously reported as sepsis biomarkers (e.g., HLA-DRA, CD-177). A bSVM classifier for sepsis diagnosis trained on RNA gene expression data performed with an AUC of 0.97 in the training set and an AUC of 0.77 in a held-out validation set. We subsequently assessed the pathogen-detection performance of DNA and RNA mNGS by comparing against a practical reference standard of clinical bacterial culture and respiratory viral PCR. We found that sensitivity varied based on site of infection and pathogen, with an overall sensitivity of 83%, and a per-pathogen sensitivity of 100% for several key sepsis pathogens including S. aureus, E. coli, K. pneumoniae and P. aeruginosa. Pathogenic bacteria were also identified in 10/37 (27%) of patients in the No-Sepsis group. To improve detection of sepsis due to viral infections, we developed a secondary RNA host transcriptomic classifier which performed with an AUC of 0.94 in the training set and an AUC of 0.96 in the validation set. Finally, we combined host and microbial features to develop a proof-of-concept integrated sepsis diagnostic model that identified 72/73 (99%) of microbiologically confirmed sepsis cases, and predicted sepsis in 14/19 (74%) of suspected, and 8/9 (89%) of indeterminate sepsis cases. In summary, our findings suggest that integrating host transcriptional profiling and broad-range metagenomic pathogen detection from nucleic acid may hold promise as a tool for sepsis diagnosis.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was also supported by the National Heart, Lung and Blood Institute [R35 HL140026, K23HL138461-01A1, K24HL137013 (PGW), F32 HL151117, and the Chan Zuckerberg Biohub.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was approved by the UCSF Institutional Review Board (IRB) under protocol 10-02852, which granted a waiver of initial consent for blood sampling. Informed consent was subsequently obtained from patients or their surrogates for continued study participation, as previously described. Auriemma, C. L. et al. Acute respiratory distress syndrome-attributable mortality in critically ill patients with sepsis. Intensive Care Med (2020). Agrawal, A. et al. Plasma angiopoietin-2 predicts the onset of acute lung injury in critically ill patients. Am J Respir Crit Care Med 187, (2013). I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe processed genecount data are available from the National Center for Biotechnology Information Gene Expression Omnibus database under accession code GSE189403. The raw fastq files with microbial sequencing reads are available from the Sequence Read Archive under BioProject ID: PRJNA783060.