Data availability
The dataset used for biomarker development is publicly available, open access with no restrictions, as described in detail in Wallen et al 2022 9. The dataset is available on two repositories to enable the investigators to download the data in either the raw or processed form. The raw metagenomic sequences and accompanying metadata are on NCBI SRA under BioProject ID PRJNA834801 [https://www.ncbi.nlm.nih.gov/bioproject/834801]. The post-QC and post taxonomic profiling data are on Zenodo [https://zenodo.org/record/7246185]. Here we used species-level data (presented in Supplement) that was extracted from the taxonomic profiling data downloaded from Zenodo in January 2024.