Abstract
Tuberculosis (TB) remains a leading cause of death from an infectious disease worldwide. This is partly due to a lack of tools to effectively screen and triage individuals with potential TB. Whole blood RNA signatures have been extensively studied as potential biomarkers for TB, but they have failed to meet the World Health Organization’s (WHOs) target product profiles (TPPs) for a non-sputum triage or diagnostic test. In this study, we investigated the utility of plasma cell-free RNA (cfRNA) as a host response biomarker for TB. We used RNA profiling by sequencing to analyze plasma samples from 182 individuals with a cough lasting at least two weeks, who were seen at outpatient clinics in Uganda, Vietnam, and the Philippines. Of these individuals, 100 were diagnosed with microbiologically-confirmed TB. Our analysis of the plasma cfRNA transcriptome revealed 541 differentially abundant genes, the top 150 of which were used to train 15 machine learning models. The highest performing model led to a 9-gene signature that had a diagnostic accuracy of 89.1% (95% CI: 83.6-93.4%) and an area under the curve of 0.934 (95% CI: 0.8674-1) for microbiologically-confirmed TB. This 9-gene signature exceeds the optimal WHO TPPs for a TB triage test (sensitivity: 96.2% [95% CI: 80.9-100%], specificity: 89.7% [95% CI: 72.4-100%]) and was robust to differences in sample collection, geographic location, and HIV status. Overall, our results demonstrate the utility of plasma cfRNA for the detection of TB and suggest the potential for a point-of-care, gene expression-based assay to aid in early detection of TB.
One Sentence Summary This study is the first to investigate the utility of circulating RNA in plasma as a new class of host response signature for tuberculosis and provides evidence that plasma RNA signatures are highly specific for TB and robust against differences in patient cohorts and sample processing.
Competing Interest Statement
A. C. is listed as an inventor on submitted patents pertaining to cell-free nucleic acids (US patent applications 63/237,367 and 63/429,733). I.D.V. is a member of the Scientific Advisory Board of Karius Inc., Kanvas Biosciences and GenDX. I.D.V. is listed as an inventor on submitted patents pertaining to cell-free nucleic acids (US patent applications 63/237,367, 63/056,249, 63/015,095, 16/500,929, 41614P-10551-01-US) and receives consulting fees from Eurofins Viracor. All other authors declare that they have no competing interests.
Funding Statement
This work was supported by the National Institutes of Health (NIH) grants R01AI146165, R21AI133331, R21AI124237, R01AI151059, and a grant from the Bill and Melinda Gates Foundation INV-003145 (to I.D.V.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Institutional Review Boards at Cornell University (protocols IRB0145569, 1902008555); UCSF (protocol 20-32670); Heidelberg University (S-539/2020); the Makerere University School of Medicine (protocol 2017-020); Vietnam National Lung Hospital (protocol 566/2020/NCKH), and De La Salle Medical and Health Sciences Institute (protocol 2020-33-02-A) gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All code will be made available on Github. Processed sequencing data will be deposited in the National Institutes of Health (NIH) / National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) and Gene Expression Omnibus (GEO) repositories under restricted access via Database for Genotypes and Phenotypes (dbGAP).