Abstract
Background There has been extensive speculation about the relationship between COVID-19 and various cardiometabolic and pulmonary conditions. This a complex question: COVID-19 may cause a cardiometabolic or respiratory event; admission for a clinical event may result in hospital-acquired SARS-CoV-2 infection; both may contribute to a patient surpassing the threshold for presenting to services; and the presence of a pandemic may change whether patients present to services at all. To inform analysis of these questions, we set out to describe the overall rate of various key clinical events over time, and their relationship with COVID-19.
Methods Working on behalf of NHS England, we used data from the OpenSAFELY platform containing data from approximately 40% of the population of England. We selected the whole adult population of 17m patients and within this identified two further mutually exclusive groups: patients who tested positive for SARS-CoV-2 in the community; and patients hospitalised with COVID-19. We report counts of death, DVT, PE, ischaemic stroke, MI, heart failure, AKI and diabetic ketoacidosis in each month between February 2019 and October 2020 within each of: the general population, community SARS-CoV-2 cases, and hospitalised patients with COVID-19. Outcome events were defined using hospitalisations, GP records and cause of death data.
Results For all outcomes except death there was a lower count of events in April 2020 compared to April 2019. For most outcomes the minimum count of events was in April 2020, where the decrease compared to April 2019 in events ranged from 5.9% (PE) to 40.0% (heart failure). Despite hospitalised COVID-19 patients making up just 0.14% of the population in April 2020, these patients accounted for an extremely high proportion of cardiometabolic and respiratory events in that month (range of proportions 10.3% (DVT) to 33.5% (AKI)).
Interpretation We observed a substantial drop in the incidence of cardiometabolic and pulmonary events in the non-COVID-19 general population, but high occurrence of COVID-19 among patients with these events. Shortcomings in routine NHS secondary care data, especially around the timing and order of events, make causal interpretations challenging. We caution that the intermediate findings reported here should be used to inform the design and interpretation of any studies using a general population comparator to evaluate the relationship between COVID-19 and other clinical events.
Introduction
High prevalence of cardiometabolic and pulmonary complications has been reported in people with COVID-191–3. The relationship between COVID-19 and cardiometabolic or respiratory events is complex: COVID-19 may cause a cardiometabolic or respiratory event; admission for a clinical event may result in hospital-acquired SARS-CoV-2 infection; both may contribute to admission and diagnosis; and the presence of a pandemic may change whether patients present to services at all, as evidenced by a substantial drop in the total number of admission for some conditions, such as acute coronary syndromes4.
We set out to describe the overall rate of various key clinical events over time, and their relationship with COVID-19, using the linked data within OpenSAFELY including detailed primary care records, hospital episodes and spells from SUS, and ONS death certificates. Our aim was to provide a broad overview of how the rate of recorded outcome events has changed during the pandemic across the whole population; and to describe the co-occurrence of COVID-19 and each clinical event, in order to inform the design and interpretation of future studies seeking to understand causal relationships with COVID-19.
Methods
Study design and data sources
We conducted a time-series study, calculating a series of period prevalences, for each month. We used electronic health record (EHR) data from primary care practices using TPP software linked to Office for National Statistics (ONS) death registrations and Secondary Uses Service (SUS) data (containing hospital records) through OpenSAFELY. This is a data analysis platform developed during the COVID-19 pandemic, on behalf of NHS England, to allow near real-time analysis of pseudonymised primary care records at scale, covering approximately 40% of the population in England, operating within the EHR vendor’s highly secure data centre.5,6 Details on Information Governance for the OpenSAFELY platform can be found in the Appendix.
Population
We extracted monthly cohorts containing the whole adult population (age ≥18 years) who were alive on the first day of each month (February 2019 to October 2020 inclusive, the months for which there was complete data). The population was stratified by presence or absence of a positive SARS-CoV-2 test result, as well as being hospitalised with COVID-19. All people were in the general population group unless they had a positive test for SARS-CoV-2 or were hospitalised with COVID-19. People were categorised in the SARS-CoV-2 positive group if they had a positive test in that month, or in any previous month. Similarly, people were categorised into the hospitalised with COVID-19 group if they were hospitalised with COVID-19 during that month, or in any previous month. If a person both tests positive and is hospitalised with COVID-19, then they are categorised into the hospitalised with COVID-19 group from the month that the COVID-19 hospitalisation occurs.
Outcomes
We measured eight outcomes: deep vein thrombosis (DVT), pulmonary embolism (PE), ischaemic stroke, myocardial infarction (MI), heart failure, acute kidney injury (AKI) and ketoacidosis. Outcomes were defined as the presence of a diagnostic code for each of the respective outcomes, either in the general practice record, in hospital, or as a cause of death on a death certificate. For AKI and ketoacidosis, the outcomes were restricted just to events recorded in hospital or on the death certificate.
Statistical methods
For each month, we counted the number of patients with each of the relevant outcome events. Each person was counted only once each month, but people could appear in multiple months if they have repeated records of the outcome. Counts where there were fewer than 5 people in a strata were redacted to ensure anonymity. We calculated 95% confidence intervals for the total counts for each month.
Software and reproducibility
Data management was performed using the OpenSAFELY software, Python 3.8 and SQL, and analysis using Python 3.8. All codelists alongside code for data management and analyses can be found at: github.com/opensafely/population-outcomes-burden-research. All software for the OpenSAFELY platform is available for review and re-use at github.com/opensafely-core.
Results
The base population was similar across all months, with an average population of 17,595,257 adults. The SARS-CoV-2 positive population started at zero before the pandemic, before increasing to 209,752 people by October 2020. The population hospitalised with COVID-19 reached 32,372 by October 2020.
Counts of each outcome per month are presented in Figure 1. All outcomes show a decrease in events during the pandemic, with the exception of death, where there was an increase, and ketoacidosis, where uncertainty is higher due to the smaller number of events. Full tables containing the counts on which Figure 1 is based, as well as figures for the base population can be found at github.com/opensafely/population-outcomes-burden-research/tree/master/released_output.
During the peak of the first wave of the pandemic, in April 2020, total deaths increased by 73.4% compared to a year earlier (April 2019), while totals for all other outcomes decreased by between 5.9% and 40.0%. Full counts and changes are in Table 1. Counting only events that occured in patients without a record of SARS-CoV-2 infection or hospitalisation with COVID-19, then reductions are greater still.
The population of patients hospitalised with COVID-19 was 0.14% of the total population in April 2020, rising to 0.18% by October 2020. Hospitalised COVID-19 patients are dramatically overrepresented in the population of patients who had outcome events, gIven that the proportion of events occuring in this population ranges from 10.3% to 33.5%.
Discussion
Summary
Our study has two key findings that are superficially in conflict with each other: a range of key cardiometabolic and respiratory events were substantially more common in patients who also had COVID-19; however the incidence of these conditions in the whole population overall fell substantially during the peak of the COVID-19 pandemic in early 2020, with the reduction in event rates varying between 5.9% and 40% for different outcomes. The proportion of cardiometabolic and respiratory events where the patient was also hospitalised with COVID-19 varied between 10.3 and 33.5%.
Strengths and limitations
A key strength of our analysis is size: our analysis is based on the detailed primary care and summary hospital records of over 17 million people. Our aim was to give a high level overview of how the volume of events changed during the pandemic, and the co-occurrence of COVID-19 and each event: although we observed a high proportion of events within the COVID-19 population, we did not design the study to demonstrate a causal link between the two. Such relationships are likely to be complex: for example, some patients may be admitted with a cardiometabolic or respiratory event, and then acquire COVID-19 within hospital; some patients may develop COVID-19 and then experience a cardiometabolic or respiratory event as a consequence; some patients may have both COVID-19 and a cardiometabolic or respiratory event in the community, with both contributing to their presentation to services. We did not aim to determine the order that events occurred in: however in many cases the order of events cannot be reliably ascertained from the data available at national level, such as SUS or HES data, combined with SGSS test date data. For example it is not possible to reliably determine the time course or order of events within a hospital spell.
Findings in context
A previous study using in-hospital records reported a 40% reduction in the incidence of acute coronary syndromes during the peak of Wave 1 of the pandemic in England, consistent with our findings, but did not explore co-occurrence with COVID-194. Our definition of myocardial infarction was similar, but also measured events recorded in the community and on death certificates: we report a 28.3% reduction in events. A recent preprint using detailed COVID-19 Clinical Information Network/International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) data alongside sparse SUS/HES data estimated that 20-25% of SARS-CoV-2 infections were hospital-acquired during the first wave of the pandemic in England7.
Implications and future research
Our results suggest the SARS-CoV-2 pandemic is affecting incidence and recording of events in a variety of ways. COVID-19 appears to be strongly associated with various cardiometabolic and respiratory events; but the rate of those events fell very substantially during the peak of the first wave of COVID-19. The reduction in these cardiometabolic and respiratory events across the population may be driven by a range of factors: patients may have been reluctant to present to services with cardiometabolic and respiratory events due to fear of infection during the peak of the pandemic; or there may have been a true reduction in incidence of those events, perhaps caused by a change in exposure to risk factors such as air pollution or strenuous activity; or multiple factors may have combined to produce the reduction in presentations. Competing risk of death is also likely to be important, as many of those who died from COVID-19 would also be at high risk of the measured outcomes.
We therefore present this intermediate data for the wider community to inform study designs and further analyses by our group and others. We suggest two immediate future directions: firstly, more complex analyses of existing national data, consisting of detailed primary care data, and sparse secondary care data such as SUS and HES where the timing and order of events is unclear; secondly, further analysis of timing and order of events in richer datasets covering a smaller population, such as that managed by ISARIC8, or other bespoke data collections. We caution that the intermediate findings reported here should be used to inform the design and interpretation of any studies using a general population comparator to measure the relationship between COVID-19 and other clinical events. The uncertainties identified in the data, and the results, both highlight the need for greater granularity of data related to details and timing of health events occuring in hospital.
Data Availability
All data were linked, stored and analysed securely within the OpenSAFELY platform (https://opensafely.org/). Detailed pseudonymized patient data are potentially re-identifiable and therefore not shared. We rapidly delivered the OpenSAFELY data analysis platform without prior funding to deliver timely analyses of urgent research questions in the context of the global COVID-19 health emergency: now that the platform is established, we are developing a formal process for external users to request access in collaboration with NHS England. Details of this process will be published in due course on the OpenSAFELY website.
https://github.com/opensafely/population-outcomes-burden-research
Conflicts of interest
BG has received research funding from the Laura and John Arnold Foundation, the Wellcome Trust, the NIHR Oxford Biomedical Research Centre, the NHS National Institute for Health Research School of Primary Care Research, the Mohn-Westlake Foundation, Health Data Research UK (HDR-UK), the Good Thinking Foundation, the Health Foundation, and the World Health Organisation; he also receives personal income from speaking and writing for lay audiences on the misuse of science. IJD has received unrestricted research grants and holds shares in GlaxoSmithKline (GSK).
Funding
This work was supported by the Medical Research Council MR/V015737/1 and the Longitudinal Health and Wellbeing strand of the National Core Studies programme. TPP provided technical expertise and infrastructure within their data centre pro bono in the context of a national emergency. The OpenSAFELY software platform is supported by a Wellcome Discretionary Award. BG’s work on clinical informatics is supported by the NIHR Oxford Biomedical Research Centre, the NIHR Applied Research Collaboration Oxford and Thames Valley, the Mohn-Westlake Foundation, and NHS England; all DataLab staff are supported by BG’s grants on this work. LS reports grants from Wellcome, MRC, NIHR, UKRI, British Council, GSK, British Heart Foundation, and Diabetes UK outside this work. JPB is funded by a studentship from GSK. AS is employed by LSHTM on a fellowship sponsored by GSK. KB holds a Sir Henry Dale fellowship jointly funded by Wellcome and the Royal Society (107731/Z/15/Z). HIM is funded by the National Institute for Health Research (NIHR) Health Protection Research Unit in Immunisation, a partnership between Public Health England and LSHTM. AYSW holds a fellowship from BHF. RM holds a Sir Henry Wellcome fellowship. EW holds grants from MRC. RG holds grants from NIHR and MRC. ID holds grants from NIHR and GSK. RM holds a Sir Henry Wellcome Fellowship funded by the Wellcome Trust (201375/Z/16/Z). HF holds a UKRI fellowship. KB holds a Doctoral Research Fellowship from NIHR. The views expressed are those of the authors and not necessarily those of the NIHR, NHS England, Public Health England or the Department of Health and Social Care.
Funders had no role in the study design, collection, analysis, and interpretation of data; in the writing of the report; and in the decision to submit the article for publication.
Data Sharing
All data were linked, stored and analysed securely within the OpenSAFELY platform (https://opensafely.org/). Detailed pseudonymized patient data are potentially re-identifiable and therefore not shared. We rapidly delivered the OpenSAFELY data analysis platform without prior funding to deliver timely analyses of urgent research questions in the context of the global COVID-19 health emergency: now that the platform is established, we are developing a formal process for external users to request access in collaboration with NHS England. Details of this process will be published in due course on the OpenSAFELY website.
Ethical Approval
This study was approved by the Health Research Authority (REC 20/LO/0651) and by the LSHTM Ethics Board (#21863).
Acknowledgements
We are very grateful for all the support received from the TPP Technical Operations team throughout this work; for generous assistance from the information governance and database teams at NHS England / NHSX.
Appendix
Information governance and ethics
NHS England is the data controller; TPP is the data processor; and the key researchers on OpenSAFELY are acting on behalf of NHS England. OpenSAFELY is hosted within the TPP environment which is accredited to the ISO 27001 information security standard and is NHS IG Toolkit compliant;9,10 patient data are pseudonymised for analysis and linkage using industry standard cryptographic hashing techniques; all pseudonymised datasets transmitted for linkage onto OpenSAFELY are encrypted; access to the platform is via a virtual private network (VPN) connection, restricted to a small group of researchers who hold contracts with NHS England and only access the platform to initiate database queries and statistical models. Pseudonymised structured data include demographics, medications prescribed from primary care, diagnoses, and laboratory measures. No free text data are included. All database activity is logged; only aggregate statistical outputs leave the platform environment following best practice for anonymisation of results such as statistical disclosure control for low cell counts.11 The OpenSAFELY research platform adheres to the obligations of the UK General Data Protection Regulation (GDPR) and the Data Protection Act 2018. In March 2020, the Secretary of State for Health and Social Care used powers under the UK Health Service (Control of Patient Information) Regulations 2002 (COPI) to require organisations to process confidential patient information for the purposes of protecting public health, providing healthcare services to the public and monitoring and managing the COVID-19 outbreak and incidents of exposure; this sets aside the requirement for patient consent.12 Taken together, these provide the legal bases to link patient datasets on the OpenSAFELY platform. GP practices, from which the primary care data are obtained, are required to share relevant health information to support the public health response to the pandemic, and have been informed of the OpenSAFELY analytics platform. This study was approved by the Health Research Authority (REC reference 20/LO/0651) and by the LSHTM Ethics Board (ref 21863).