RT Journal Article SR Electronic T1 EHR-based Case Identification of Pediatric Long COVID: A Report from the RECOVER EHR Cohort JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.05.23.24307492 DO 10.1101/2024.05.23.24307492 A1 Botdorf, Morgan A1 Dickinson, Kimberley A1 Lorman, Vitaly A1 Razzaghi, Hanieh A1 Marchesani, Nicole A1 Rao, Suchitra A1 Rogerson, Colin A1 Higginbotham, Miranda A1 Mejias, Asuncion A1 Salyakina, Daria A1 Thacker, Deepika A1 Dandachi, Dima A1 Christakis, Dimitri A A1 Taylor, Emily A1 Schwenk, Hayden A1 Morizono, Hiroki A1 Cogen, Jonathan A1 Pajor, Nate M A1 Jhaveri, Ravi A1 Forrest, Christopher B. A1 Bailey, L. Charles A1 YR 2024 UL http://medrxiv.org/content/early/2024/05/23/2024.05.23.24307492.abstract AB Objective Long COVID, marked by persistent, recurring, or new symptoms post-COVID-19 infection, impacts children’s well-being yet lacks a unified clinical definition. This study evaluates the performance of an empirically derived Long COVID case identification algorithm, or computable phenotype, with manual chart review in a pediatric sample. This approach aims to facilitate large-scale research efforts to understand this condition better.Methods The algorithm, composed of diagnostic codes empirically associated with Long COVID, was applied to a cohort of pediatric patients with SARS-CoV-2 infection in the RECOVER PCORnet EHR database. The algorithm classified 31,781 patients with conclusive, probable, or possible Long COVID and 307,686 patients without evidence of Long COVID. A chart review was performed on a subset of patients (n=651) to determine the overlap between the two methods. Instances of discordance were reviewed to understand the reasons for differences.Results The sample comprised 651 pediatric patients (339 females, Mage = 10.10 years) across 16 hospital systems. Results showed moderate overlap between phenotype and chart review Long COVID identification (accuracy = 0.62, PPV = 0.49, NPV = 0.75); however, there were also numerous cases of disagreement. No notable differences were found when the analyses were stratified by age at infection or era of infection. Further examination of the discordant cases revealed that the most common cause of disagreement was the clinician reviewers’ tendency to attribute Long COVID-like symptoms to prior medical conditions. The performance of the phenotype improved when prior medical conditions were considered (accuracy = 0.71, PPV = 0.65, NPV = 0.74).Conclusions Although there was moderate overlap between the two methods, the discrepancies between the two sources are likely attributed to the lack of consensus on a Long COVID clinical definition. It is essential to consider the strengths and limitations of each method when developing Long COVID classification algorithms.Competing Interest StatementDr. Mejias reports funding from Janssen, Merck for research support, and Janssen, Merck and Sanofi-Pasteur for Advisory Board participation; Dr. Rao reports prior grant support from GSK and Biofire and is a consultant for Sequiris. Dr. Jhaveri is a consultant for AstraZeneca, Seqirus and Dynavax, and receives an editorial stipend from Elsevier. All other authors have no conflicts of interest to disclose.Funding StatementThis research was funded by the National Institutes of Health (NIH) Agreement OT2HL161847-01 as part of the Researching COVID to Enhance Recovery (RECOVER) program of research.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Institutional Review Board (IRB) approval was obtained under Biomedical Research Alliance of New York (BRANY) protocol #21-08-508. BRANY IRB waived the need for consent and HIPAA authorization.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present study are available upon reasonable request to the authors. PASCpost-acute sequelae of SARS-CoV-2 infectionCOVID-19coronavirus disease 2019SARS-CoV-2severe acute respiratory syndrome coronavirus 2PCRpolymerase chain reactionEHRelectronic health recordMIS-Cmultisystem inflammatory syndrome in childrenICD-10International Classification of Diseases, version 10