Keeping up with the pathogens: Improved antimicrobial resistance detection and prediction from Pseudomonas aeruginosa genomes

Danielle E. Madden; Timothy Baird; Scott C. Bell; Kate L. McCarthy; Erin P. Price; Derek S. Sarovich

doi:10.1101/2022.08.11.22278689

Abstract

Background Antimicrobial resistance (AMR) is an intensifying threat that requires urgent mitigation to avoid a post-antibiotic era. The ESKAPE pathogen, Pseudomonas aeruginosa, represents one of the greatest AMR concerns due to increasing multi- and pan-drug resistance rates. Shotgun sequencing is quickly gaining traction for in silico AMR profiling due to its unambiguity and transferability; however, accurate and comprehensive AMR prediction from P. aeruginosa genomes remains an unsolved problem.

Methods We first curated the most comprehensive database yet of known P. aeruginosa AMR variants. Next, we performed comparative genomics and microbial genome-wide association study analysis across a Global isolate Dataset (n=1877) with paired antimicrobial phenotype and genomic data to identify novel AMR variants. Finally, the performance of our P. aeruginosa AMR database, implemented in our ARDaP software, was compared with three previously published in silico AMR gene detection or phenotype prediction tools – abritAMR, AMRFinderPlus, ResFinder – across both the Global Dataset and an analysis-naïve Validation Dataset (n=102).

Results Our AMR database comprises 3639 mobile AMR genes and 733 AMR-conferring chromosomal variants, including 75 chromosomal variants not previously reported, and 284 chromosomal variants that we show are unlikely to confer AMR. Our pipeline achieved a genotype-phenotype balanced accuracy (bACC) of 85% and 81% across 10 clinically relevant antibiotics when tested against the Global and Validation Datasets, respectively, vs. just 56% and 54% with abritAMR, 58% and 54% with AMRFinderPlus, and 60% and 53% with ResFinder.

Conclusions Our ARDaP software and associated AMR variant database provides the most accurate tool yet for predicting AMR phenotypes in P. aeruginosa, far surpassing the performance of current tools. Implementation of our ARDaP-compatible database for routine AMR prediction from P. aeruginosa genomes and metagenomes will improve AMR identification, addressing a critical facet in combatting this treatment-refractory pathogen. However, knowledge gaps remain in our understanding of the P. aeruginosa resistome, particularly the basis of colistin AMR.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by Advance Queensland (AQIRF0362018 and AQRF13016-17RD2), the Wishlist Sunshine Coast Health Foundation (2019-14), and the National Health and Medical Research Council (GNT455919). The funders had no role in study design, data acquisition, analysis, interpretation, writing, or manuscript submission.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Clinical specimen collection was approved by The Prince Charles Hospital (HREC/13/QPCH/127 and HREC/2019/QPCH/48013) and Royal Brisbane and Womens Hospital (DA: jl) Human Research Ethics Committees. Participants with cystic fibrosis or chronic obstructive pulmonary disease provided written consent. A waiver of informed consent was approved for other specimens due to a low-to-negligible risk assessment.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

Based on peer review feedback, we have made the following updates to our manuscript: 1. We have now added the new AMR prediction tool, abritAMR, to our analysis. Its inclusion means we have comprehensively tested the performance of our ARDaP-compatible AMR database against three published AMR detection/prediction tools; 2. We have added 486 genomes to our Global Dataset - Sun et al., 2023 (n=197 Chinese genomes) and Cortes-Lara et al., 2021 (n=289 Spanish genomes), along with 13 new strains from our own collection to our Validation Dataset. We then repeated our genome-wide association study, comparative genomic, and machine learning analyses across these larger datasets; 3. Numerous text updates and changes; and 4. Updates to all figures and supplementary data files.

Data Availability

The datasets supporting the conclusions of this article are available in the NCBI GenBank (references NSXK00000000.1 and NSZK00000000.1) and Sequence Read Archive database via BioProject accession PRJNA761496 (Validation Dataset), and BioProject accessions PRJEB15036, PRJEB21341, PRJEB29539, PRJEB14771, PRJNA297679, PRJNA317143, PRJNA526797, PRJNA264310, PRJNA388357, PRJNA793523, PRJEB40140 and PRJNA532924 (Global Dataset). The P. aeruginosa AMR database constructed for this study is freely available at: https://github.com/dsarov/ARDaP/tree/master/Databases/Pseudomonas_aeruginosa_pao1. The ARDaP software is freely available at: https://github.com/dsarov/ARDaP. All code used in this manuscript is available at https://github.com/dsarov/P_aeruginosa_ARDaP_manuscript.

Data Availability

The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.