PT - JOURNAL ARTICLE AU - Dantas, Leila F. AU - Peres, Igor T. AU - Bastos, Leonardo S. L. AU - Marchesi, Janaina F. AU - de Souza, Guilherme F. G. AU - Gelli, João Gabriel M. AU - Baião, Fernanda A. AU - Maçaira, Paula AU - Hamacher, Silvio AU - Bozza, Fernando A. TI - App-based symptom tracking to optimize SARS-CoV-2 testing strategy using machine learning AID - 10.1101/2020.09.01.20186049 DP - 2020 Jan 01 TA - medRxiv PG - 2020.09.01.20186049 4099 - http://medrxiv.org/content/early/2020/09/03/2020.09.01.20186049.short 4100 - http://medrxiv.org/content/early/2020/09/03/2020.09.01.20186049.full AB - Background Tests are scarce resources, especially in low and middle-income countries, and the optimization of testing programs during a pandemic is critical for the effectiveness of the disease control. Hence, we aim to use the combination of symptoms to build a regression model as a screening tool to identify people and areas with a higher risk of SARS-CoV-2 infection to be prioritized for testing.Materials and Methods We applied machine learning techniques and provided a visualization of potential regions with high densities of COVID-19 as a risk map. We performed a retrospective analysis of individuals registered in “Dados do Bem”, an app-based symptom tracker in use in Brazil.Results From April 28 to July 16, 2020, 337,435 individuals registered their symptoms through the app. Of these, 49,721 participants were tested for SARS-CoV-2 infection, being 5,888 (11.8%) positive. Among self-reported symptoms, loss of smell (OR[95%CI]: 4.6 [4.4 - 4.9]), fever (2.6 [2.5 - 2.8]), and shortness of breath (2.1 [1.6-2.7]) were associated with SARS-CoV-2 infection. Our final model obtained a competitive performance, with only 7% of false-negative users among the predicted as negatives (NPV = 0.93). From the 287,714 users still not tested, our model estimated that only 34.5% are potentially infected, thus reducing the need for extensive testing of all registered users. The model was incorporated by the “Dados do Bem” app aiming to prioritize users for testing. We developed an external validation in the state of Goias and found that of the 465 users selected, 52% tested positive.Conclusions Our results showed that the combination of symptoms might predict SARS-Cov-2 infection and, therefore, can be used as a tool by decision-makers to refine testing and disease control strategies.Competing Interest StatementThe authors have declared no competing interest.Funding StatementWe would like to thank all collaborators from Dados do Bem, which specified and developed the application, and supported data collection. This work was supported by Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq) (Grant numbers 169049/20175 to L.F.D., 306802/20155 and 403863/20163 to S.H., 310523/20168 to F.A.B., 144636/20190 to I.T.P.); Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES) Finance Code 001 (Grant number 88887.178844/201800 to L.S.L.B); Fundacao de Amparo a Pesquisa do Estado do Rio de Janeiro (FAPERJ); and Pontificia Universidade Catolica do Rio de Janeiro (PUC-Rio). Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study is retrospective and had no human interference. All data acquired were anonymized, and the "Dados do Bem" app follows the Brazilian General Data Protection Regulation (Lei Geral de Protecao de Dados - LGPD). All users provided informed consent of de-identified data-use to non-commercial research upon registration in the app. All answers were optional.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).Yes I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of this study are available from "Dados do Bem" but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of "Dados do Bem".