Abstract
Predicting an individual’s risk of infectious disease is a critical technology in infectious disease response. During the COVID-19 pandemic, identifying and isolating individuals at high risk of infection was an essential task for epidemic control. We introduce a new machine learning model that predicts the risk of COVID-19 infection using only individuals’ mobile cell tower location information. This model distinguishes the cell tower location information of an individual into residential and non-residential areas and calculates whether the cell tower locations overlapped with other individuals. It then generates various variables from the information of overlapping and predicts the possibility of COVID-19 infection using a machine learning algorithm. The predictive model we developed showed performance comparable to models using individual’s clinical information. This predictive model, which can be used to predict infections of diseases with asymptomatic infections such as COVID-19, has the advantage of supplementing the limitations of existing infectious disease prediction models that use symptoms and other information.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study was funded by Bill & Melinda Gates Foundation and RIGHT Foundation.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Ethics commiette/IRB of Sungkyunkwan University Bioethics Committee Ethics exempted ethical approval for this work. The IRB information * IRB name: Sungkyunkwan University Bioethics Committee * IRB File No: SKKU2022-11-014 * Evaluation Date: November 11, 2022 * Review Result: IRB review exemption
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
* Conceptualization, Project administration, Resources, Supervision, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing
↵¶ Conceptualization, Data curation, Investigation, Writing – original draft, Writing – review & editing
↵& Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing
↵% Conceptualization, Data curation, Writing – review & editing
Data Availability
All data produced are not available. Raw data cannot be shared publicly because of the sensitive nature of the personal information collected, which includes location data and COVID-19 test results. According to South Korean law, unless individual consent is specifically obtained from the data subject, this information cannot be transferred to third parties.