ABSTRACT
The cases of COVID-19 have been reported in the United States since January 2020. There were over 103 million confirmed cases and over one million deaths as of March 23, 2023. We propose a COVINet by combining the architecture of both Long Short-Term Memory and Gated Recurrent Unit and incorporating actionable covariates to offer high-accuracy prediction and explainable response. First, we train COVINet models for confirmed cases and total deaths with five input features, compare their Mean Absolute Errors (MAEs) and Mean Relative Errors (MREs) and benchmark COVINet against ten competing models from the United States CDC in the last four weeks before April 26, 2021. The results show that COVINet outperforms all competing models for MAEs and MREs when predicting total deaths. Then, we focus on the prediction for the most severe county in each of the top 10 hot-spot states using COVINet. The MREs are small for all predictions made in the last 7 or 30 days before March 23, 2023. Beyond predictive accuracy, COVINet offers high interpretability, enhancing the understanding of pandemic dynamics. This dual capability positions COVINet as a powerful tool for informing effective strategies in pandemic prevention and governmental decision-making.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Trial
Not Applicable
Funding Statement
No funding was received for this article
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Not Applicable
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
We updated the data preprocessing method, refreshed the data to 2023, and compared the results of our approach with several other methods.
Data Availability
We collected the number of cumulative confirmed cases and total deaths fromWe collect the daily numbers of cumulative confirmed cases and deaths from January 21, 2020, to March 23, 2023, for infected counties in the US from the New York Times. The daily cumulative confirmed cases and deaths are collected from health departments and the US CDC, where patients are identified as "confirmed" based on positive laboratory tests and clinical symptoms and exposure. All risk factors are compiled from 2020 annual data on the County Health Rankings and Roadmaps program's official website. In addition, the longitude and latitude of each infected county are collected from Census TIGER 2000. January 21 to May 19, 2020, for counties in the United States from the New York Times, based on reports from state and local health agencies. The county health rankings reports from the year 2020 were compiled from the County Health Rankings and Roadmaps program official website.