1. Summary
Background HIV-1 infections initiated by multiple founder variants are characterised by a higher viral load and a worse clinical prognosis, yet little is known about the routes of exposure through which multiple variant transmission is most likely, and whether methods of quantifying the number of founder variants differ in their accuracy.
Methods We conducted a systematic review of studies that estimated founder variant multiplicity in HIV-1 infection, searching MEDLINE, EMBASE and Global Health databases for papers published between 1st January 1990 and 14th September 2020 (PROSPERO study CRD42020202672). Leveraging individual patient estimates from these studies, we performed a logistic meta-regression to estimate the probability that an HIV infection is initiated by multiple founder variants. We calculated a pooled estimate using a random effects model, subsequently stratifying this estimate across nine transmission routes in a univariable analysis. We then extended our model to adjust for different study methods in a multivariable analysis, recalculating estimates across the nine transmission routes.
Findings We included 71 publications in our analysis, comprising 1664 individual patients. Our pooled estimate of the probability that an infection is initiated by multiple founder variants was 0·25 (95% CI: 0·21-0·30), with moderate heterogeneity (Q = 137·1, p < ·001, I2 = 65·3%). Our multivariable analysis uncovered differences in the probability of multiple variant infection by transmission route. Relative to a baseline of male-to-female transmission, the probability for female-to-male multiple variant transmission was significantly lower at 0·10 (95% CI: 0·05-0·21), while the probability for people-who-inject-drugs (PWID) transmission was significantly higher at 0·29 (0·13-0·52). There was no significant difference in the probability of multiple variant transmission between male-to-female transmission (0·16 (0·08-0·29)), post-partum mother-to-child (0·12 (0·02-0·51)), pre-partum mother-to-child (0·13 (0·05-0·32)), intrapartum mother-to-child (0.21 (0·08-0·44)) and men-who-have-sex-with-men (MSM) transmission (0·23 (0·03-0·7)).
Interpretation We identified PWID transmissions are significantly more likely to result in an infection initiated by multiple founder variants, whilst female-to-male infections are significantly less likely. Quantifying how the routes of HIV infection impact the transmission of multiple variants allows us to better understand how the evolution and epidemiology of HIV-1 determine the clinical picture.
Funding This study was supported by the MRC Precision Medicine Doctoral Training Programme (ref: 2259239) and a ERC Starting Grant awarded to KEA (award number 757688).
Evidence before this study The majority of HIV-1 infections are initiated by a single, genetically homogeneous founder variant. Infections initiated by multiple founders, however, are associated with a significantly faster decline of CD4+ T Cells in untreated individuals, ultimately leading to an earlier onset of AIDS. Through our systematic search of MEDLINE, EMBASE and Global Health databases, we identified 82 studies that classify the founder variant multiplicity of acute HIV infections. As these studies vary in the methodology used to calculate the number of founder variants, it is difficult to evaluate the multiplicity of founder variants across routes of exposure.
Added value of this study Using meta-regression, we estimated the probability of multiple founder infections across exposure routes by accounting for variability in methodology between studies. Our multivariable meta-regression adjusted for heterogeneity across study methodology and uncovered differences in the probability that an infection is initiated by multiple founder variants by transmission route, with the probability for female-to-male transmission significantly lower than for male-to-female transmission. By contrast, the probability for transmission among people-who-inject-drugs (PWID) was significantly higher. There was no difference in the probability of multiple founder variant transmission for mother-to-child transmission or men-who-have-sex-with-men (MSM) when compared with male-to-female.
Implications of all the available evidence Because HIV-1 infections initiated by multiple founders are associated with a poorer prognosis, determining whether the route of infection affects the probability of transmission of multiple variants will facilitate an improved understanding of how the evolution and epidemiology of HIV-1 determine clinical progression. Our results identify that PWID transmissions are significantly more likely to result in an infection initiated by multiple founder variants compared to male-to-female. This reiterates the need for focussed public health programmes that reduce the burden of HIV-1 in this vulnerable risk group.
3. Introduction
Transmission of HIV-1 results in a dramatic reduction in genetic diversity, with a large proportion of infections initiated by a single founder variant.1,2 An appreciable minority of infections, however, appear to be the result of multiple founder variants simultaneously transmitted in a single exposure.3 Importantly, these multiple founder infections are associated with both significant increases in set point viral load and the rate of CD4+ T lymphocyte decline.4–7 HIV-1 infections initiated via different routes of exposure are subject to different virological, cellular and physiological environments, which likely influence the probability of acquiring infection.8–10 For example, the probability of transmission upon exposure increases six-fold between heterosexual transmission and transmission between people who inject drugs (PWID), and up to eighteen-fold for men who have sex with men (MSM).11
Despite these differences in the probability of HIV-1 acquisition by route of exposure, there is currently no consensus about the effect of route of exposure on the transmission of multiple founder variants. Differences in selection pressure during transmission have been observed between sexual exposure routes, with reduced selection occurring during transmission from males to females than vice-versa, and less selection occurring between men who have sex with men (MSM) relative to those heterosexual exposure overall.12,13 However, studies quantifying the number of founder variants are inconsistent with these findings, which may be due to differences in methodology and study population.3,12,14,15 In sexual transmission, the probability of both transmission and founder variant multiplicity may also be influenced by inflammation, genital ulcerative disease and hormonal contraception, perhaps suggesting that the integrity of mucosal barrier underpins this process.14,16 But, a significantly higher proportion of multiple founder infections in PWID transmissions, which bypass mucosal barriers altogether, has also not been consistently observed and so the role of exposure on the risk of acquiring a multiple founder infection remains unclear.17,18 To estimate the role of exposure route on the acquisition of multiple HIV-1 founder variants, we conducted a meta-regression leveraging all available individual patient data, accounting for heterogeneity across methodology and study population.
4. Methods
4.1. Search Strategy and Eligibility Criteria
We searched MEDLINE, EMBASE and Global Health databases for papers published between 1 January 1990 to 14 September 2020 (S2: Supplementary Methods). To be included, studies must have reported original estimates of founder variant multiplicity in people acutely infected with HIV-1, be written in English and document ethical approval. Studies were excluded if they did not distinguish between single and multiple founder variants, if they did not detail the methods used, or if the study was conditional on having identified multiple founder variants. Additionally, studies were excluded if they solely reported data concerning people living with HIV-1 who had known or suspected superinfection, who were documented as having received pre-exposure prophylaxis, or if the transmitting partner was receiving antiretroviral treatment. No restrictions were placed on study design, geographic location, or age of participants. Publications were screened independently by SL and JB. Reviewers were blinded to the publication authors during the title and abstract screens and full text reviews were conducted independently, before a consensus was reached, with consultation with other co-authors when necessary.
4.2. Data Extraction
Individual patient data (IPD) were collated from all studies, with authors contacted if these data were not available. Studies were excluded from further analysis if no IPD were obtained. Only individuals for whom a route of exposure was known were included. Additionally, we removed any entries for individuals with known or suspected superinfection, who were receiving pre-exposure prophylaxis or for whom the transmitting partner was receiving antiretroviral therapy. For this final individual patient dataset for analysis, we recorded whether an infection was initiated by one or multiple variants and nine predetermined covariates:
Route of exposure. Female-to-male (HSX-FTM), male-to-female (HSX-MTF), men-who-have-sex-with-men (MSM), pre-partum, intrapartum and post-partum mother to child (MTC), or people who inject drugs (PWID)).
Method of quantification. Methodological groupings were defined by the properties of each approach, resulting in six levels: phylogenetic, haplotype, distance, model, or molecular (Table 1). Molecular methods interpret the formation of heteroduplexes during gel electrophoresis of viral RNA; haplotype methods identify linkage patterns of individual polymorphisms; distance and model-based methods assume a threshold or distribution of diversity that is reasonably expected to occur under a hypothesis of neutral exponential growth from a single founder and determine whether the observed diversity is consistent with the modelled values; and phylogenetic methods either use recipient sequences only, in which case a star-like topology is expected to be observed for single founder infections, or use source and recipient sequences from known transmission pairs, such that the number of distinct clades of recipient sequences nested within the source sequences corresponds to the number of founder variants.
HIV subtype. Canonical geographically delimited subtypes (A-D, F-H, J and K) and circulating recombinant forms (e.g. CRF01_AE).19,20 IPD where subtyping was unclear or not conducted were assigned ‘unknown,’ while putative recombinants not recognised as circulating recombinant forms were assigned ‘recombinant.’
Delay between infection and sampling. For sexual or injection drug use exposure, the delay was classified as either less than or equal to 21 days if the patient was seronegative at time of sampling (Feibig stages I-II) or more than 21 days if the patient was seropositive (Fiebig stages III-VI). For mother-to-child infections, if infection was confirmed at birth, or within 21 days of birth, the delay was classified as either less than or equal to 21 days. A positive mRNA or antibody test definitively reported after this period was classified as a delay of greater than 21 days.
Number of genomes analysed per participant.
Genomic region analysed. Classified as envelope (Env), pol, gag or near full length genome (NFLG).
Alignment length analysed. Measured in base pairs, discretised at 250, 500, 1000, 2000, 4000, 8000, near full length genome (NFLG) intervals.
Use of single genome amplification (SGA) to generate viral sequences. A binary classification (yes or no) as to whether the viral genomic data were generated using SGA. Regular bulk or near endpoint polymerase chain reaction (PCR) amplification can generate significant errors such as Taq-polymerase mediated template switching, nucleotide misincorporation or unequal amplicons resampling.21,22 In SGA, serial dilutions of viral nucleic acids are made, which, assuming the proportion of positive PCR reaction at each dilution follows a null Poisson distribution, reduces the final reactions to contain a single variant that can be cloned, sequenced and then analysed.22,23
Study cohort. The epidemiological cohort from which the patient was sampled.
If information from any of these nine covariates was missing or could not be inferred from the study, we classified its value as unknown. We excluded covariate levels for which there were fewer than 6 data points. For our base case analysis, we removed repeat measurements for the same individual, and used only those from the earliest study or, where the results of different methods were reported by the same study, the conclusive method used for each individual.
4.3. Pooled Meta-Analysis
We calculated a pooled estimates of the probability of multiple founder variant infection from our base case model: a ‘one-step’ generalised linear mixed model (GLMM) assuming an exact binomial distribution, with a normally distributed random effect on the intercept for within-study clustering and fitted by approximate maximum likelihood.25 Heterogeneity was measured in terms of τ2, the between-study variance; I2, the percentage of variance attributable to study heterogeneity; and Cochran’s Q, an indicator of larger variation between studies than of subjects within studies.26 Publication bias was assessed using funnel plots and Egger’s regression test.27
Whilst pooled estimates obtained through a ‘one-step’ approach are usually congruent with the canonical ‘two-step’ meta-analysis model, discrepancies may arise due to differences in likelihood specification, weighting schemes, and specification of the intercept or estimation of residual variances.28 We compared the results from our base case model with a two-step binomial-normal model to confirm our estimates were consistent. We performed additional sensitivity analyses to test the robustness of our pooled estimate to our exclusion criteria: iteratively excluding single studies, excluding studies that contained fewer than 10 participants, excluding studies that consisted solely of single founder infections, excluding IPD that did not use single genome amplification, and including only those data that matched our reference methodology of haplotype-based methods and whole genome analysis. In each of these sensitivity analyses, the base case model was refitted as previously described. To investigate the impact of our treatment of repeated measurements, we created 1000 datasets in which the included datapoint for each individual was sampled at random from a pool of their possible measurements. Each of these 1000 datasets thus contained a single datapoint per individual and we refitted the base case model to calculate a distribution of pooled estimates.
4.4. Meta-regression
We extended our base case model by conducting a univariable meta-regression with each covariate contributing a fixed effect and, assuming normally distributed random effects of publication. Pooled heterogeneity measures were calculated for each covariate level. We extended the base case model in a multivariable analysis, where we defined publication and cohort as crossed random effects before sequentially adding fixed effects covariates and evaluating interactions; assessing convergence, singularity and multicollinearity between fixed effects. The fixed effects were selected according to a ‘keep it maximal’ principle, in which covariates were only removed to facilitate a non-singular fit.29 We defined our reference case as heterosexual male-to-female transmission, evaluated through haplotype-based methods, analysis of the whole genome sequences and a sampling delay of less than 21 days. Stratified predictions of the proportion of infections initiated by multiple founders and bootstrapped 95% confidence intervals, conditioned on the reference case, were calculated. We performed sensitivity analyses to test the robustness of the selected multivariable meta-regression model: iteratively excluding single studies, excluding studies that contained fewer than 10 participants, excluding studies that consisted solely of single founder infections and excluding IPD that did not use single genome amplification. The re-sampling sensitivity analysis was repeated on our selected multivariable model as described above.
5. Results
5.1. Study and Patient Selection
Our search found 7416 unique papers, of which 7334 were excluded. Of the remaining 380 results, 207 were further excluded after abstract screening, leaving a total of 82 eligible studies for individual patient data (IPD) collation. We successfully extracted IPD from 80 of these studies, comprising 3251 data points. The 80 selected studies from which IPD were collated, were published between 1992 and 2020. Of the 3251 data points extracted, 1477 were excluded from our base case analysis to avoid repeated measurements; arising either between different studies that analysed the same individuals (resulting in the exclusion of five studies), or from repeat analysis of individuals within the same study. After excluding participants for whom the route of exposure was unknown or for whom one or more of their covariate values did not meet the minimum number of observations across the whole participants range of values, our final dataset for our base case analysis comprises estimates from 1664 unique patients across 71 studies.
5.2. Study and Patient Characteristics
Our base case dataset includes a median of 13 participants per study (range 2-124) and represents infections associated with heterosexual transmission (42·2%, (n = 703), MSM transmission (37·3%, n = 621), MTC mother-to-child transmission (14·1%, n = 234), and PWID transmission (6·4%, n = 106) (Fig. S3). Among heterosexual transmissions, 67·6% (n = 475) were male-to-female transmissions, 30% (n = 211) were female-to-male transmissions, with the remainder undisclosed (n = 17). Similarly, we subdivided MTC transmission according to the timing of infection with 44·4% (n = 104) pre-partum, 24·4% (n = 57) intrapartum, 4·7% (n = 11) post-partum, with the remainder undisclosed (n = 62). Our dataset spanned geographical regions and dominant subtypes, capturing the diversity of the HIV epidemic (Figs 2, S3). Across the base case dataset, phylogenetic methods constituted 37·1% (n = 618) of estimates, 26·7% (n = 445) were estimated using haplotype methods, 20·9% (n = 347) using molecular methods, and 12·9% (n = 215) and 2·34% (n = 39) of estimates were inferred using distance and model-based methods respectively (Table 2, Fig 2).
5.3. Meta-analyses
5.3.1. Pooled Estimate
Our base case analysis using a GLMM estimated the probability that an infection is initiated by multiple founder variants to be 0.25 (95% CI: 0.21-0.29), identifying significant heterogeneity (Q = 137.1, p <. 001, I2 = 65·3%). Our sensitivity analyses revealed the pooled estimate is robust to the choice of model, the inclusion of estimates from repeat participants, and to the exclusion of studies that contained fewer than 10 participants (Fig. S4, S5). While analysing only data that matched our reference case study methodology did not change our estimate, it widened the confidence intervals of our estimate (0.25 (95% CI: 0.05-0.67)). We did not identify any studies that significantly influenced the pooled estimate (Fig. S6). Visual inspection of a funnel plot and a non-significant Egger’s Test (t = -0·2663, df = 56, p = 0·7910), were consistent with an absence of publication bias in our dataset (Fig. S7).
5.3.2. Meta-Regression
We extended our base case binomial GLMM using uni- and multivariable fixed effects. Relative to a reference exposure route of male-to-female transmission, our univariable analysis found significantly lower odds of female-to-male transmission being initiated by multiple founder variants (Odds Ratio (OR): 0·56 (95% CI 0·33-0·87)), while other exposure routes were not significantly different. The univariable analyses also indicated significantly greater odds of multiple founder variants if the envelope genomic region was analysed (OR: 2·06 (95% CI:1·16-3·98)), relative to the whole genome. Other methodological covariates, however, such as method of quantification and sampling delay were not significantly associated with the odds that HIV-1 infection is initiated by multiple founder variants.
Our base case multivariable model calculated the probability of multiple founder variants across the seven routes of transmission controlling for method, genomic region and sampling delay (Fig. 3). Compared to a male-to-female transmission probability of 0·16 (95% CI: 0·08-0·29), there was no evidence that the probability of multiple founder variants differed across MSM (0·23 (0·03-0·7)) or MTC transmission. Stratifying MTC transmissions by the putative timing of infection, we calculated pre-partum were initiated by multiple founders with probability 0·13 (0·05-0·32), post-partum with probability 0·12 (0·02-0·51), and intrapartum transmissions with probability 0·21 (0·08-0·44).
By contrast, we found that female-to-male transmissions were less likely to be initiated by multiple founders than male-to-female transmissions, with probability 0·10 (95% CI: 0·05-0·21) (OR: 0·61 (95% CI 0·36-0·94)). Conversely, PWID transmission was more likely to be initiated by multiple founders (0·29 (0·13-0·52)), compared to male-to-female (OR: 2.19 (1·10-4·42)).
We calculated the accuracy of estimating the probability of multiple founder variants compared to a gold-standard methodological reference scenario of using haplotype-based methods on whole genome sequences with individuals with less than 21 delays between infection and sampling. Our base case analysis indicates using model-based methods underestimates the chance of multiple founder variants (OR: 0·32 (95% CI: 0·05-0·82)), while using the gag or envelope genomic regions overestimates the chance of detecting multiple founder variants by (OR of 4·32; 95% CI:1·03-20·47 and 1·78 (0·99-3·86) respectively). Our sensitivity analyses revealed the odds ratios calculated using the uni- and multivariable models are robust to inclusion of data from repeated participants, and to the exclusion of studies that contained fewer than 10 participants, of studies that consisted solely of single founder infections, and of individual data that did not use single genome amplification (Fig. S7).
6. Discussion
Using data from 71 previous studies, we estimated that a quarter of HIV-1 infections are initiated by multiple founder variants. When controlling for different methodologies across studies, the probability that an infection is initiated by multiple founders decreased relatively by 37.5% for female-to-male infections with respect to a baseline of male-to-female infections, but increased by 81.25% for infections transmitted between people who inject drugs. Further, we found that model-based methods, representing a group of approaches that determine founder multiplicity by comparing the observed distribution of diversity with that expected under neutral exponential outgrowth from single variant transmission, were less likely to identify multiple founder infections. Together these results suggest that while the exposure route probably influences the number of founder variants, previous comparison has been difficult due to different study methodologies.
Our pooled estimate is consistent with the seminal study of Keele et al., who found 23·5% (24/102) of their participants had infections initiated by multiple founders.3 Our stratified predicted probabilities are also in line with those of previous smaller studies. A nine-study meta-analysis of 354 subjects found 0·34 of PWID infections were initiated by multiple founders compared with 0·29 (95% CI: 0·13-0·52) in our study; 0·2 for heterosexual infections compared to 0·23 (0·06-0·56) and 0·25 for MSM infections for which we calculated 0·23 (0·03-0·7).12 Likewise, an earlier meta-analysis of five studies and 235 subjects found PWID infections were at significantly greater odds than heterosexual infections of being initiated by a single founder, with the frequency of founder variant multiplicity increasing 3-fold, while a smaller, non-signficant 1·5-fold increase was observed with respect to MSM transmissions.17 In both instances, these studies restricted the number of participants so that the methodology in estimating founder variant multiplicity was consistent across all subjects. In this study, in contrast, we were able to extend our meta-analysis by leveraging individual level data to control for methodological sources of heterogeneity.
We did not identify any significant effect of sampling delay on the probability that an infection is identified to be initiated by multiple founders. While previous work has shown a negative association between detection of multiple founder variants and the delay from infection to sampling, this discrepancy is likely due to the range of the delay analysed. Specifically, Leitner and Romero-Severson found a reduced chance of multiple founder variants over a period of 8 years, while our study analyses over a shorter time span of less or greater than a 3 week delay89
Certain routes of transmission that our analysis found to be associated with a higher or lower probability of multiple founder variants, have previously been identified as having higher or lower probabilities of transmission, respectively. For example, we estimated that female-to-male multiple variant transmission is 39% less likely than that of male-to-female, while the per exposure transmission probability has been estimated at half as likely.11 Similarly, MSM infections are 46% more likely to be initiated by multiple founders, but here the probability of infection following a given exposure can be up to 33-fold greater than that in male-to-female infections. By contrast, although PWID infections were found to be the most likely to be initiated by multiple founders, PWID are less likely to be infected upon exposure than MSM, and 14-fold greater than male-to-female infections. Further, mother-to-child infections are not significantly more likely than MSM or heterosexual infections to be initiated by multiple founders, but the probability of infection for mother-to-child exposures is 16-times and 565-times greater, respectively. Our results suggest a complicated relationship between the probability of transmission and the probability of multiple founder transmission.
Our analysis has some limitations. First, our definition of single and multiple founder variants is determined by the individual studies, however questions remain concerning the definition of a founder variant. Recent studies have suggested a continuum of genotypic diversity exists, rather than discrete variants that give rise to distinct phylogenetic diversification trajectories and may not be reflected by this binary classification.90,98 Indeed, although a threshold is specified for distance-based methods, above which the observed diversity is defined to be to great to be explained by neutral exponential growth, this threshold often varies between publications.100,101 For example, both Keele et al and Li et al analysed the diversity of the envelope protein, but whilst the former classifies populations with less than 0.47% diversity as homogenous, Li et al included samples up to 0.75%.3,15 The distinction between single and multiple founder variants may further be blurred by non-coalescent sources of variation such as recombination and APOBEC mediated hypermutation, which would erroneously inflate diversity measures unless accounted for.102,103 Ultimately, the classification of multiple/single founders is subjective and may also be informed by cognitive biases of the authors. This is pertinent to studies which recruit participants from specific, often marginalised risk groups (e.g. MSM, PWID), where authors may have been more likely to classify multiple founder infections based on their prior assumptions.Second, we acknowledge that under the hypothesis that the proportion of infections initiated by multiple founders varies by transmission route, our point estimate will be influenced by the relative proportion of transmission routes in our dataset. Globally, it is estimated that 70% of infections are transmitted heterosexually, compared to 42.2% in our dataset, which reflects the longstanding geographical bias of research towards patients in the global north.104 Therefore, our point estimate should be considered a summary of the published data over the course of the HIV-1 epidemic, and not a global estimate at any fixed point in time. Third, we were unable to account for the stage of infection in the transmitter, despite recent findings that transmitters with acute infections are more likely to initiate multiple variant infections, because we had insufficient data regarding the transmitting partner within our dataset.99 Finally, we acknowledge the bootstrapped confidence intervals are wide and may lead to uncertainty in our estimates. These arise as a product of small sample sizes for certain observations, and the crossed random effects of publication and cohort used in the meta-regression. In particular, our finding that infections analysed using gag are significantly more likely to be initiated by multiple founders demonstrates substantial uncertainty, and is arguably unlikely considering the mutation rate of envelope is significantly higher than gag during primary infection.105 We note that in this case, the results of our univariable analysis of genomic region analysed are more consistent with our prior expectations.
This systematic review and meta analysis has demonstrated that infections initiated by multiple founders account for a quarter of HIV-1 infections across all known routes of transmission. We find that transmissions involving people who inject drugs are significantly more likely to be initiated by multiple founder variants, whilst female-to-male infections are significantly less likely, relative to male-to-female infections. Quantifying how the routes of HIV infection impact the transmission of multiple variants allows us to better understand the evolution, epidemiology and clinical picture of HIV transmission.
7. Contributors
KEA conceived the study. JB, SL, DT, KEA designed the study. JB and SL extracted the data. JB performed the experiments and analysed the data. All authors interpreted the data. JB and KEA drafted the manuscript, with critical revisions from all authors. All authors approved the final version of the manuscript
Data Availability
Previously published data will be available alongside the code used for this study in a GitHub repository
8. Declaration of Interests
The authors declare no competing interests
9. Data Sharing
Code use in this study is available at https://github.com/J-Baxter/foundervariantsHIV_sysreview
12. Supplementary
12.1. S1: PRISMA Checklist
12.2. S2: Supplementary Methods
12.2.1. Full search query submitted to MEDLINE, EMBASE and Global Health databases
(((((transmi*.af. or found*.af. or bottleneck.af. or single.af. or multiple.af. or multiplicity.af. or breakthrough.ti. or TF.af.) and (virus*.af. or variant*.af. or strain.af. or lineage.af. or phenotyp*.af.)) and (HIV.ti. or HIV-1.ti. or human immunodeficiency virus.ti. or env.ti. or envelope.ti or gag.ti. or pol.ti.)) and ((single genome amplification.af. or sga.af. or sgs.af. or ((sequencing.af. or characterized.af.) and (single genome.af. or deep.af. or whole genome.af. or full length.af. or full-length.af.))) or divers*.af. or distance.af. or poisson-fitter.af. or fitness.af. or (monophyletic.af. or paraphyletic.af. or polyphyletic.af.) or (phylogenetic*.af. and (clade.af. or topology.af. or tree.af. or linked.af. or diver*.af. or distance.af. or sieve.af. or molecular dating.af.)))) not ((SIV.ti,ab. or simian immunodeficiency.ti,ab. or fiv.ti,ab. or feline immunodeficiency virus.ti,ab. or exp Hepacivirus/ or Hepatitis.ti,ab. or exp Flaviviridae/ or Tuberculosis.ti,ab. or Enterovirus.ti,ab. or exp Spumavirus/ or diarrhoea.ti,ab. or diarrhea.ti,ab. or superinfection.ti. or exp Malaria/ or CMV.ti,ab. or HPV.ti,ab. or SHIV.ti,ab. OR exp HIV-2/ or phylogeo*.af. or network.ti. or exp HIV Protease Inhibitors/ or exp HIV Integrase Inhibitors/)))
Set to these databases:
Ovid MEDLINE(R) and Epub Ahead of Print, In-Process & Other Non-Indexed Citations, Daily and Versions(R)
Global Health 1910 to 2020 Week 36
EMBASE & EMBASE Classic 1947 – Sep 11
12.2.2. Software and Computational Methods
All code associated with this study is available under GNU General Public License v3.0 at the following GitHub repository: foundervariantsHIV_sysreview.
The analyses were conducted in R 3.6.1, using the following packages: lme4, 1.1-23, (Bates et al. 2007); metafor, 2.4-0, (Viechtbauer 2010); performance, 0.6.1, ; cowplot, 1.0.0, ; ggplot2, 3.3.2,; dplyr, 1.0.3, (Wickham et al. 2015); forcats, 0.5.0, ; mltools, 0.3.5, ; parallel, 3.6.1,; reshape2,1.4.3, ; stringr, 1.4.0, ; tidyr, 1.0.
12.3. S3: Time Structure of Route of Exposure and Method
12.4. S4: Sensitivity Analyses for Pooling
12.5. S5: Leave-One-Out Cross Validation
12.6. S6: Evaluation of Publication Bias
12.7. S7: Sensitivity Analyses for Meta-regression (i)
10. Acknowledgements
JB was supported by the MRC Precision Medicine Doctoral Training Programme (ref: 2259239); CJV-A and KEA were funded by an ERC Starting Grant (award number 757688) awarded to KEA. We are grateful to Kamini Gounder, Mary Kearney, Vladimir Novitsky, Morgane Rolland and Sodsai Tovanabutra for agreeing to share additional individual patient data with the authors in order to complete this study.