PT - JOURNAL ARTICLE AU - Abdulhameed, Yunus A. AU - Roberts, Samuel AU - Aguilar, Jacob B. AU - Kercheville, James AU - Gutierrez, Juan B. TI - Data rectification to account for delays in reporting disease incidence with an application to forecasting COVID-19 cases AID - 10.1101/2024.04.08.24305398 DP - 2024 Jan 01 TA - medRxiv PG - 2024.04.08.24305398 4099 - http://medrxiv.org/content/early/2024/04/12/2024.04.08.24305398.short 4100 - http://medrxiv.org/content/early/2024/04/12/2024.04.08.24305398.full AB - Effective monitoring of infectious disease incidence remains a major challenge to public health. Difficulties in estimating the trends in disease incidence arise mainly from the time delay between case diagnosis and the reporting of cases to public health databases. However, predictive models usually assume that public data sets faithfully reflect the state of disease transmission. In this paper, we study the effect of delayed case reporting by comparing data reported by the Johns Hopkins Coronavirus Resource Center (CRC) with that of the raw clinical data collected from the San Antonio Metro Health District (SAMHD), San Antonio, Texas. An insight on the subtle effect that such reporting errors potentially have on predictive modeling is presented. We use an exponential distribution model for the regression analysis of the reporting delay. The proposed model for correcting reporting delays was applied to our recently developed SEYAR (Susceptible, Exposed, Symptomatic, Asymptomatic, Recovered) dynamical model for COVID-19 transmission dynamics. Employing data from SAMHD, we demonstrate that the forecasting ability of the SEYAR model is substantially improved when the rectified reporting obtained from our proposed model is utilized. The methods and findings demonstrated in this work have ample applicability in the forecasting of infectious disease outbreaks. Our findings suggest that failure to consider reporting delays in surveillance data can significantly alter forecasts.Competing Interest StatementThe authors have declared no competing interest.Clinical Protocols https://mathresearch.utsa.edu/wp/?p=58 Funding StatementThe work was done during my postdoc at UTSAAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The City of San Antonio entered a partnership with Bexar County to create the San Antonio Metro Health District (SAMHD); this entity oversees collecting case data for infectious diseases. San Antonio is part of the South Texas Regional Advisory Council (STRAC) designated by the Texas Department of State Health Services (DSHS) to maintain and develop emergency healthcare system. As a result of the interaction between the PI and government officials, a data use agreement between University of Texas at San Antonio, SAMHD and STRAC was enacted on April 5th, 2020. The two prominent sources of source data used for analysis used to obtain the datasets was the Coronavirus Resource Center at Johns Hopkins University. See link below. https://www.nytimes.com/interactive/2021/us/covid-cases.html. An interactive web-based dashboard to track COVID-19 in real time. The Lancet infectious diseases https://www.thelancet.com/journals/laninf/article/PIIS1473-3099(20)30120-1/fulltext https://www.sa.gov/Directory/Departments/SAMHD https://mathresearch.utsa.edu/wp/?p=58 I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present study are available upon reasonable request to the authors