Abstract
The major opportunistic pathogen Escherichia coli is the largest cause of antimicrobial resistance (AMR) associated infections and deaths globally. Considerable antigenic diversity has been documented in Extra-intestinal pathogenic E. coli (ExPEC). Still, the need for systematic genomic surveys of asymptomatic colonisation and invasive disease has precluded the quantification of K-type invasive potential across different ExPEC lineages. We assembled and curated an in-silico capsular typing database for group 2 and group 3 K-loci from >20,000 genomes and applied it to paired carriage and disease cohorts to investigate K-type epidemiology. The most virulent circulating capsules have estimated odds ratios of >10 for being found in bloodstream infections versus carriage. The invasive potential differed markedly between lineages, and subclades of the global multi-drug resistant ST131, which displayed limited O and H antigens but substantial K-type diversity. We also discovered that insertion sequence elements contribute to the evolutionary dynamics of group 2 and group 3 K-loci by importing new capsular genes. Furthermore, the level of capsule diversity was positively correlated with more recombinogenic lineages that could adapt their antigenic repertoire faster. Our investigation highlights several K-types and lineages that contribute disproportionately to invasive ExPEC disease, which are associated with high levels of AMR. These results have significant translational potential, including improved ExPEC diagnostics, personalised therapy options, and the ability to build predictive regional risk maps by combining genomic surveys with demographic and patient frailty data.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The study was was funded by the Trond Mohn Foundation (grant identifier TMS2019TMT04 to A.K.P., R.A.G., O.S., P.J.J., and J.C.). The presented work has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions (grant No. 801,133 to S.A.-A. and A.K.P.), from Wellcome Trust (grant no. 220540/Z/20/A to YS, TL) and has also been supported by the European Research Council (grant No. 742158 to J.C.).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study used ONLY openly available human data that were originally published here: 1.Gladstone, R. A. et al. Emergence and dissemination of antimicrobial resistance in Escherichia coli causing bloodstream infections in Norway in 2002 17: a nationwide, longitudinal, microbial population genomic study. The Lancet Microbe 2, e331 e341 (2021). 2.Arredondo Alonso, S. et al. Plasmid driven strategies for clone success in Escherichia coli. bioRxiv 2023.10.14.562336 (2023) 3.Pontinen, A. K. et al. Modulation of multi drug resistant clone success in Escherichia coli populations: a longitudinal multi country genomic and antibiotic usage cohort study. Lancet Microbe (2024) 4.Kallonen, T. et al. Systematic longitudinal survey of invasive Escherichia coli in England demonstrates a stable population structure only transiently disturbed by the emergence of ST131. Genome Res. (2017) 5.Maklin, T. et al. Strong pathogen competition in neonatal gut colonisation. Nat. Commun. 13, 7417 (2022). 6.Shao, Y. et al. Primary succession of Bifidobacteria drives pathogen resistance in neonatal microbiota assembly. Nat. Microbiol. 9, 2570 2582 (2024). 7.Liu, C. M. et al. Using source associated mobile genetic elements to identify zoonotic extraintestinal E. coli infections. One Health 16, 100518 (2023). 8.Ludden, C. et al. One Health Genomic Surveillance of Escherichia coli Demonstrates Distinct Lineages and Mobile Genetic Elements in Isolates from Humans versus Livestock. MBio 10, (2019). 9.Maklin, T. et al. Geographical variation in colorectal and urinary tract linked cancer incidence is associated with population exposure to colibactin producing Escherichia coli. Lancet Microbe (2024) 10.Blackwell, G. A. et al. Exploring bacterial diversity via a curated and searchable snapshot of archived DNA sequences. PLoS Biol. 19, e3001421 (2021). 11.Horesh, G. et al. A comprehensive and high quality collection of Escherichia coli genomes and their genes. Microb Genom 7, (2021). 12.Dicks, J. et al. NCTC3000: a century of bacterial strain collecting leads to a rich genomic data resource. Microb. Genom. 9, mgen000976 (2023). 13.Zhou, Z., Charlesworth, J. & Achtman, M. HierCC: a multi level clustering scheme for population assignments based on core genome MLST. Bioinformatics 37, 3645 3646 (2021).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Funding: The project was funded by the Trond Mohn Foundation (grant identifier TMS2019TMT04 to A.K.P., R.A.G., Ø.S., P.J.J., and J.C.). The presented work has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions (grant No. 801,133 to S.A.-A. and A.K.P.), from Wellcome Trust (grant no. 220540/Z/20/A to YS, TL) and has also been supported by the European Research Council (grant No. 742158 to J.C.).
Data Availability
All data produced are available online at https://github.com/rgladstone/EC-K-typing and https://zenodo.org/records/14000489