Data Availability
All data was obtained from the public repositories GISAID, COG-UK, and GenBank. Full sample credit is included in Supplementary Data 1. Code to replicate our analysis is available at https://github.com/jmcbroome/cluster-heuristic. Code for complete simulation of covid-like phylogenetic trees is available at https://github.com/jmcbroome/pandemic-simulator Our implementation of our heuristic is implemented as part of matUtils https://github.com/yatisht/usher with additional documentation at https://usher-wiki.readthedocs.io/en/latest/ Our website source code is available at https://github.com/jmcbroome/introduction-website.