RT Journal Article SR Electronic T1 Machine learning to predict retention and viral suppression in South African HIV treatment cohorts JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.02.03.21251100 DO 10.1101/2021.02.03.21251100 A1 Maskew, M A1 Sharpey-Schafer, K A1 Voux, L De A1 Bor, J A1 Rennick, M A1 Crompton, T A1 Majuba, P A1 Sanne, I A1 Pisa, PT A1 Miot, J YR 2021 UL http://medrxiv.org/content/early/2021/02/05/2021.02.03.21251100.abstract AB Background To optimize South Africa’s HIV response and reach PEPFAR’s 95:95:95 targets require same day initiations and patients successfully remaining on antiretroviral therapy, and remaining virally suppressed. Much effort and resources with HIV programmes have centred on tracking and tracing for those lost to follow-up (LTFU), through various back to care strategies to ensure retention of patients. However, programmes have noted the need for targeted, data driven and predictive approaches for identifying most at risk of patients for LTFU and viral unsuppression outcomes using available phenotypic data.Methods We applied predictive machine learning algorithms to anonymised, patient-level HIV programmatic data from two districts in South Africa, January 2016 - July 2019. We developed patient risk scores for two outcomes: 1) visit attendance within 28 days of next scheduled clinic visit and 2) suppression of next HIV viral load (VL). Demographic, clinical, behavioural (e.g. visit patterns) and laboratory data were investigated in multiple models as potential predictor variables of attending next scheduled visit and VL result at next visit.Findings Longitudinal data was analysed using combinations of predictor features and classification algorithms. 445,636 Patients were included in the retention model and 363,977 in the VL model. Predictive accuracy, summarized by Area Under the Curve (AUC), ranged from 0.69 for attendance at next scheduled visit to 0.76 for VL suppression, suggesting the model correctly classified patients to whether a scheduled visit would be attended in 2 out of 3 patients and whether the VL result at next test would be suppressed in approximately 3 out of 4 patients. Key predictors for both retention & VL suppression include metrics on prior late or LTFU visits, the number of prior VL tests, the time since their last visit, the number of visits on their current regimen, the patients’ age, and their time on treatment. For retention, the number of visits at the current facility and the details of the next appointment date were also predictors. For VL suppression, other predictors included the range of thee previous VL value.Conclusion Machine learning can be used to identify HIV patients at risk for defaulting on treatment and not being virally suppressed. Predictive modelling can improve targeting of interventions through differentiated models of care, increasing cost-effectiveness and improving patient outcomes.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study has been made possible by the generous support of the American People and the President's Emergency Plan for AIDS Relief (PEPFAR) through USAID under the terms of Cooperative Agreement 72067419CA00004 to HE2RO. The contents are the responsibility of the authors and do not necessarily reflect the views of PEPFAR, USAID or the United States Government. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Analysis was approved by the Human Research Ethics Committee of the University of the Witwatersrand (M140201).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAccess to primary data is subject to restrictions owing to privacy and ethics policies set by the South African Government.