Abstract
Rapid genotype-based drug susceptibility testing for the Mycobacterium tuberculosis complex (MTBC) relies on a comprehensive knowledgebase of the genetic determinants of resistance. We built a catalog of resistance-associated mutations in MTBC using a novel regression-based approach and benchmarked it against the 2nd edition of the World Health Organization mutation catalog. We trained multivariate logistic regression models on over 50,000 MTBC isolates to associate binary resistance phenotypes for 15 antitubercular drugs with variants extracted from candidate resistance genes. Regression detects 452/457 (99%) resistance-associated variants identified using the existing method (a.k.a, SOLO method) and grades 218 (29%) more total variants than SOLO. The regression-based catalog achieves higher sensitivity on average (+3.2 percentage points, pp) than SOLO with smaller average decreases in specificity (−1.0 pp) and positive predictive value (−1.8 pp). The regression pipeline also detects isoniazid resistance compensatory mutations in ahpC and variants linked to bedaquiline and aminoglycoside hypersusceptibility. These results inform the continued development of targeted next generation sequencing, whole genome sequencing, and other commercial molecular assays for diagnosing resistance in MTBC. In addition to grading genetic variants by their associations with phenotype, regression models could potentially provide an accurate and scalable method of predicting antibiotic resistance from bacterial genetic profiles.
Competing Interest Statement
TCR received salary support from FIND. Support for this project was provided through funding from Unitaid through The Foundation for Innovative New Diagnostics. The views expressed by the authors do not necessarily reflect the views of the funding agency.
Funding Statement
This study was funded by the National Institutes of Health, National Institute of Allergy and Infectious Diseases, National Science Foundation, Wellcome Trust, UK Medical Research Council (MRC) Centre for Global Infectious Disease Analysis, FIND, and Unitaid.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present study are available upon reasonable request to the authors.