RT Journal Article SR Electronic T1 Clinical coding of long COVID in primary care 2020-2023 in a cohort of 19 million adults: an OpenSAFELY analysis JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2023.12.04.23299364 DO 10.1101/2023.12.04.23299364 A1 Henderson, Alasdair D A1 Butler-Cole, Ben FC A1 Tazare, John A1 Tomlinson, Laurie A A1 Marks, Michael A1 Jit, Mark A1 Briggs, Andrew A1 Lin, Liang-Yu A1 Carlile, Oliver A1 Bates, Chris A1 Parry, John A1 Bacon, Sebastian CJ A1 Dillingham, Iain A1 Dennison, William A A1 Costello, Ruth E A1 Wei, Yinghui A1 Walker, Alex J A1 Hulme, William A1 Goldacre, Ben A1 Mehrkar, Amir A1 MacKenna, Brian A1 , A1 Herrett, Emily A1 Eggo, Rosalind M YR 2023 UL http://medrxiv.org/content/early/2023/12/04/2023.12.04.23299364.abstract AB Background Long COVID is the patient-coined term for the persistent symptoms of COVID-19 illness for weeks, months or years following the acute infection. There is a large burden of long COVID globally from self-reported data, but the epidemiology, causes and treatments remain poorly understood. Primary care is used to help identify and treat patients with long COVID and therefore Electronic Health Records (EHRs) of past COVID-19 patients could be used to help fill these knowledge gaps. We aimed to describe those with long COVID in primary care records in England.Methods With the approval of NHS England we used routine clinical data from over 19 million adults in England linked to SARS-COV-2 test result, hospitalisation and vaccination data to describe trends in the recording of 16 clinical codes related to long COVID between November 2020 and January 2023. We calculated rates per 100,000 person-years and plotted how these changed over time. We compared crude and minimally adjusted rates of recorded long COVID in patient records between different key demographic and vaccination characteristics using negative binomial models.Findings We identified a total of 55,465 people recorded to have long COVID over the study period, with incidence of new long COVID records increasing steadily over 2021, and declining over 2022. The overall rate per 100,000 person-years was 177.5 cases in women (95% CI: 175.5-179) and 100.5 men (99.5-102). In terms of vaccination against COVID-19, the lowest rates were observed in those with 3+ vaccine doses (103.5 [95% CI: 101.5-105]). Finally, the majority of those with a long COVID record did not have a recorded positive SARS-COV-2 test 12 weeks before the long COVID record.Interpretation EHR recorded long COVID remains very low compared and incident records of long COVID declined over 2022. We found the lowest rates of recorded long COVID in people with 3 or more vaccine doses. We summarised several sources of possible bias for researchers using EHRs to study long COVID.Competing Interest StatementBG is a Non-Executive Director at NHS Digital; he also receives personal income from speaking and writing for lay audiences on the misuse of science. All other authors declare no competing interests.Funding StatementThis research was supported by the National Institute for Health and Care Research (NIHR) (OpenPROMPT: COV-LT2-0073)). In addition, this research used data assets made available as part of the Data and Connectivity National Core Study, led by Health Data Research UK in partnership with the Office for National Statistics and funded by UK Research and Innovation (grant ref MC_PC_20058). In addition, the OpenSAFELY Platform is supported by grants from the Wellcome Trust (222097/Z/20/Z); MRC (MR/V015737/1, MC_PC-20059, MR/W016729/1); NIHR (NIHR135559, COV-LT2-0073), and Health Data Research UK (HDRUK2021.000, 2021.0157). The views expressed are those of the authors and not necessarily those of the NIHR, NHS England, UK Health Security Agency (UKHSA) or the Department of Health and Social Care. Funders had no role in the study design, collection, analysis, and interpretation of data; in the writing of the report; and in the decision to submit the article for publication.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This research is part of the OpenPROMPT study 'Quality-of-life in patients with long COVID: harnessing the scale of big data to quantify the health and economic costs' Health Research Authority (HRA) and Health and Care Research Wales (HCRW) gave ethical approval for this work (IRAS project ID 304354). The Research Ethics Committee of the London School of Hygiene & Tropical Medicine gave ethical approval for this work (ref 28030) Research Ethics Committee of South Central-Berkshire B gave favourable opinion (ref 22/SC/0198).I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAccess to the underlying identifiable and potentially re-identifiable pseudonymised electronic health record data is tightly governed by various legislative and regulatory frameworks, and restricted by best practice. The data in the NHS England OpenSAFELY COVID-19 service is drawn from General Practice data across England where TPP is the data processor. TPP developers initiate an automated process to create pseudonymised records in the core OpenSAFELY database, which are copies of key structured data tables in the identifiable records. These pseudonymised records are linked onto key external data resources that have also been pseudonymised via SHA-512 one-way hashing of NHS numbers using a shared salt. University of Oxford, Bennett Institute for Applied Data Science developers and PIs, who hold contracts with NHS England, have access to the OpenSAFELY pseudonymised data tables to develop the OpenSAFELY tools. These tools in turn enable researchers with OpenSAFELY data access agreements to write and execute code for data management and data analysis without direct access to the underlying raw pseudonymised patient data, and to review the outputs of this code. All code for the full data management pipeline - from raw data to completed results for this analysis - and for the OpenSAFELY platform as a whole is available for review at github.com/OpenSAFELY.AbbreviationsEHRElectronic Health RecordsSARS-COV-2Severe acute respiratory syndrome coronavirus 2