PT - JOURNAL ARTICLE AU - De Salazar, PM AU - Lu, F AU - Hay, JA AU - Gómez-Barroso, D AU - Fernández-Navarro, P AU - Martínez, E AU - Astray-Mochales, J AU - Amillategui, R AU - García-Fulgueiras, A AU - Chirlaque, MD AU - Sánchez-Migallón, A AU - Larrauri, A AU - Sierra, MJ AU - Lipsitch, M AU - Simón, F AU - Santillana, M AU - Hernán, MA TI - Near real-time surveillance of the SARS-CoV-2 epidemic with incomplete data AID - 10.1101/2021.01.25.20230094 DP - 2021 Jan 01 TA - medRxiv PG - 2021.01.25.20230094 4099 - http://medrxiv.org/content/early/2021/01/26/2021.01.25.20230094.short 4100 - http://medrxiv.org/content/early/2021/01/26/2021.01.25.20230094.full AB - Designing public health responses to outbreaks requires close monitoring of population-level health indicators in real-time. Thus, an accurate estimation of the epidemic curve is critical. We propose an approach to reconstruct epidemic curves in near real time. We apply this approach to characterize the early SARS-CoV-2 outbreak in two Spanish regions between March and April 2020.We address two data collection problems that affected the reliability of the available real-time epidemiological data, namely, the frequent missing information documenting when a patient first experienced symptoms, and the frequent retrospective revision of historical information (including right censoring). This is done by using a novel back-calculating procedure based on imputing patients’ dates of symptom onset from reported cases, according to a dynamically-estimated “backward” reporting delay conditional distribution, and adjusting for right censoring using an existing package, NobBS, to estimate in real time (nowcast) cases by date of symptom onset. This process allows us to obtain an approximation of the time-varying reproduction number (Rt) in real-time.At each step, we evaluate how different assumptions affect the recovered epidemiological events and compare the proposed approach to the alternative procedure of merely using curves of case counts, by report day, to characterize the time-evolution of the outbreak. Finally, we assess how these real-time estimates compare with subsequently documented epidemiological information that is considered more reliable and complete that became available later in time. Our approach may help improve accuracy, quantify uncertainty, and evaluate frequently unstated assumptions when recovering the epidemic curves from limited data obtained from public health surveillance systems in other locations.Competing Interest StatementML discloses honoraria/consulting from Merck, Affinivax, Sanofi-Pasteur, Bristol Myers-Squibb, and Antigen Discovery; research funding (institutional) from Pfizer, and an unpaid scientific advice to Janssen, Astra-Zeneca, One Day Sooner, and Covaxx (United Biomedical). The rest of co-authors declare no competing interest. Funding StatementP.M.D was supported by the fellowship Fundacion Ramon Areces. M.L. was supported by the Morris-Singer Fund and by Models of Infectious Disease Agent Study Award Number U54GM088558 from the National Institute Of General Medical Sciences (US National Institutes of Health). M.S was supported by the National Institute Of General Medical Sciences, award number R01GM130668-02. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Access to surveillance data for this study was granted by the Ministry of Health, Spain. The surveillance protocol was approved by the Spanish Inter-territorial Council of the National Health System. Our study was part of the public health response to the outbreak, and therefore no explicit ethical evaluation was necessary. The Institutional Review Board (IRB) of the Harvard T.H. Chan School of Public Health (Boston, Massachusetts, US) determined that this study was "Not human subjects research" as defined by DHHS regulations or FDA regulations.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of the study was obtained from the Spanish System for Surveillance at the National Center of Epidemiology (RENAVE) through the Web platform SiViEs (System for Surveillance in Spain). Data is publicly available as aggregated daily numbers at https://cnecovid.isciii.es/covid19/.