Abstract
Since late 2019, the novel coronavirus SARS-CoV-2 has introduced a wide array of health challenges globally. In addition to a complex acute presentation that can affect multiple organ systems, increasing evidence points to long-term sequelae being common and impactful. The worldwide scientific community is forging ahead to characterize a wide range of outcomes associated with SARS-CoV-2 infection; however the underlying assumptions in these studies have varied so widely that the resulting data are difficult to compareFormal definitions are needed in order to design robust and consistent studies of Long COVID that consistently capture variation in long-term outcomes. Even the condition itself goes by three terms, most widely “Long COVID”, but also “COVID-19 syndrome (PACS)” or, “post-acute sequelae of SARS-CoV-2 infection (PASC)”. In the present study, we investigate the definitions used in the literature published to date and compare them against data available from electronic health records and patient-reported information collected via surveys. Long COVID holds the potential to produce a second public health crisis on the heels of the pandemic itself. Proactive efforts to identify the characteristics of this heterogeneous condition are imperative for a rigorous scientific effort to investigate and mitigate this threat.
Competing Interest Statement
Julie A. McMurry: Cofounder, Pryzm Health; Melissa A. Haendel: co-founder Pryzm Health
Funding Statement
The analyses described in this publication were conducted with data or tools accessed through the NCATS N3C Data Enclave covid.cd2h.org/enclave and supported by NCATS U24 TR002306. Halie M. Rando was supported by The Gordon and Betty Moore Foundation (GBMF 4552) and the National Human Genome Research Institute (R01 HG010067); Halie M. Rando supported by The Gordon and Betty Moore Foundation (GBMF 4552) and the National Human Genome Research Institute (R01 HG010067); Tellen D. Bennett supported by NIH UL1TR002535 03S2 and NIH UL1TR002535; James Brian Byrd supported by NIH grant K23HL128909 protected Dr. Byrd's time to participate.; Christopher G. Chute supported by U24 TR002306; Rachel Deer supported by UTMB CTSA, 2P30AG024832-16 (PI: Volpi). This research was possible because of the patients whose information is included within the data from participating organizations (covid.cd2h.org/dtas) and scientists who have contributed to the on-going development of this community resource. The project described was supported by the National Institute of General Medical Sciences, 5U54GM104942-04. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The N3C data transfer to NCATS is performed under a Johns Hopkins University Reliance Protocol # IRB00249128 or individual site agreements with NIH. Use of the N3C data for this study is authorized under the following IRB Protocol: University of North Carolina, University of North Carolina Chapel Hill Institutional Review Board: exempted, 21-0309 Stony Brook University, Office of Research Compliance, Division of Human Subject Protections, Stony Brook University: exempted, IRB2021-00098 The N3C Data Enclave is approved under the authority of the NIH Institutional Review Board for Protocol 000082 associated with NIH iRIS reference number: 546652 entitled: "NCATS National COVID-19 Cohort Collaborative (N3C) Data Enclave Repository." Further information can be found at ncats.nih.gov/n3c/resources.
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
↵** Contact author: Melissa A. Haendel, Center for Health AI, University of Colorado Anschutz Medical Campus, Aurora, CO, USA (melissa{at}tislab.org)
Data Availability
The N3C Data Enclave (covid.cd2h.org/enclave) houses fully reproducible, transparent, and broadly available limited and de-identified datasets (HIPAA definitions: https://www.hhs.gov/hipaa/for-professionals/privacy/specialtopics/de-identification/index.html). Data is accessible by investigators at institutions that have signed a Data Use Agreement with NIH who have taken human subjects and security training and attest to the N3C User Code of Conduct. Investigators wishing to access the limited dataset must also supply an institutional IRB protocol. All requests for data access are reviewed by the NIH Data Access Committee. A full description of the N3C Enclave governance has been published;193 information about how to apply for access is available on the NCATS website: https://ncats.nih.gov/n3c/about/applying-for-access. Reviewers and health authorities will be given access permission and guidance to aid reproducibility and outcomes assessment. A Frequently Asked Questions about the data and access has been created at: https://ncats.nih.gov/n3c/about/program-faq The data model is OMOP 5.3.1, specifications are posted at: https://ncats.nih.gov/files/OMOP_CDM_COVID.pdf