Abstract
The COVID-19 pandemic has presented a series of new challenges to governments and health care systems. Testing is one important method for monitoring and therefore controlling the spread of COVID-19. Yet with a serious discrepancy in the resources available between rich and poor countries not every country is able to employ widespread testing. Here we developed machine learning models for predicting the number of COVID-19 cases in a country based on multilinear regression and neural networks models. The models are trained on data from US states and tested against the reported infections in the European countries. The model is based on four features: Number of tests Population Percentage Urban Population and Gini index. The population and number of tests have the strongest correlation with the number of infections. The model was then tested on data from European countries for which the correlation coefficient between the actual and predicted cases R2 was found to be 0.88 in the multi linear regression and 0.91 for the neural network model. The model predicts that the actual number of infections in countries where the number of tests is less than 10% of their populations is at least 26 times greater than the reported numbers.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research is funded by the Enhanced Undergraduate Research fund from the university of Groningen
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The article is about the use of machine learning to predict the covid infections
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Paper in collection COVID-19 SARS-CoV-2 preprints from medRxiv and bioRxiv
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.