Abstract
Syphilis is an important sexually transmitted infection caused by the bacterium Treponema pallidum subspecies pallidum. The last two decades have seen syphilis incidence rise in many high-income countries, yet the evolutionary and epidemiological relationships that underpin this are poorly understood, as is the global T. pallidum population structure. We assembled a geographically and temporally diverse collection of clinical and laboratory samples comprising 726 T. pallidum genomes. We used detailed phylogenetic analysis and clustering to show that syphilis globally can be described by only two deeply branching lineages, Nichols and SS14. We show that both of these lineages can be found circulating concurrently in 12 of the 23 countries sampled. To provide further phylodynamic resolution we subdivided Treponema pallidum subspecies pallidum into 17 distinct sublineages. Importantly, like SS14, we provide evidence that two Nichols sublineages have expanded clonally across 9 countries contemporaneously with SS14. Moreover, pairwise genome analysis showed that recent isolates circulating in 14 different countries were genetically identical in their core genome to those from other countries, suggesting frequent exchange through international transmission pathways. This contrasts with the majority of samples collected prior to 1983, which are phylogenetically distinct from these more recently isolated sublineages. Bayesian temporal analysis provided evidence of a population bottleneck and decline occurring during the late 1990s, followed by a rapid population expansion a decade later. This was driven by the dominant T. pallidum sublineages circulating today, many of which are resistant to macrolides. Combined we show that the population of contemporary syphilis in high-income countries has undergone a recent and rapid global expansion.
Competing Interest Statement
MK declares institutional funding from Roche, Hologic and Siemens unrelated to this work. The remaining authors have no competing interests to declare. The funders had no input into the study design, interpretation or decision to submit for publication.
Funding Statement
MAB and NRT are supported by Wellcome funding to the Sanger Institute (#206194). MM is funded by the UKRI and NIHR [COV0335; MR/V027956/1, NIHR200125] and the EDCTP [RIA2018D-249]. DMW is funded by a Queensland Advancing Clinical Research Fellowship from the Queensland Government. DAW is supported by an Investigator Grant (1174555) from the National Health and Medical Research Council of Australia. SAL is funded by the National Institutes of Health (R01 AI42143). The funders had no input into the study design, interpretation or decision to submit for publication. This research was funded in whole, or in part, by the Wellcome Trust (#206194). For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethical approval for all clinical samples was granted by the London School of Hygiene and Tropical Medicine Observational Research Ethics Committee (REF#16014) and the National Health Service (UK) Health Research Authority and Health and Care Research Wales (UK; 19/HRA/0112).
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
Sequencing reads for all novel genomes have been deposited at the European Nucleotide Archive in BioProjects PRJEB28546, PRJEB33181 and PRJNA701499. All accessions, corresponding pseudonymised sample identifiers and related metadata are available in Supplementary Data 1.