Abstract
Biological evidence suggests ursodeoxycholic acid (UDCA) - a common treatment of cholestatic liver disease - may prevent severe COVID-19 outcomes. With the approval of NHS England, we conducted a population-based cohort study using primary care records, linked to death registration data and hospital records through the OpenSAFELY-TPP platform. We estimated the hazard of COVID-19 hospitalisation or death between 1 March 2020 and 31 December 2022, comparing UDCA treatment to no UDCA treatment in a population with indication. Of 11,320 eligible individuals, 642 were hospitalised or died with COVID-19 during follow-up, 402 (63%) events among UDCA users. After confounder adjustment, UDCA was associated with a 21% (95% CI 7%-33%) relative reduction in the hazard of COVID-19 hospitalisation or death, consistent with an absolute risk reduction of 1.3% (95% CI 1.0%-1.6%). Our findings support calls for clinical trials investigating UDCA as a preventative measure for severe COVID-19 outcomes.
Introduction
Ursodeoxycholic acid (UDCA) is first-line therapy in the treatment of primary biliary cholangitis (PBC) and commonly prescribed for people with primary sclerosing cholangitis (PSC). Both PBC and PSC are comparatively rare liver diseases that can lead to cirrhosis and end-stage liver disease. PBC typically affects females more than males and most commonly presents at 50-60 years of age [1,2], while PSC is more common in men and presents at younger ages, often with comorbid inflammatory bowel disease [3,4]. UDCA has been shown to delay the progression of PBC, is usually prescribed lifelong [1,5], and is generally well-tolerated. A UK audit found that <10% of patients with PBC discontinued UDCA, with nausea, diarrhoea and vomiting the most frequent intolerances [6].
There is some biological evidence suggesting that UDCA protects against SARS-CoV-2 infection [7–9]. The proposed mechanism is that UDCA suppresses the signalling of the farnesoid X receptor (FXR) which reduces expression of angiotensin-converting enzyme 2 (ACE2), a cellular membrane protein which is the main receptor of SARS-CoV-2. Reduced ACE2 limits opportunities for SARS-CoV-2 to enter cells and reduces viral replication after infection. Whilst cohort and registry studies have investigated the association between UDCA exposure and severe COVID-19 outcomes, conflicting results were obtained [7,10–15]. These studies had important limitations relating to sample size, small numbers of outcomes and UDCA exposure measurement at a single time point or based on exposure during the study period. In addition, the extent to which vaccination has modified this association is unclear. The value of clinical trials of UDCA as prophylaxis or treatment for COVID-19, despite strong biological plausibility of benefit, in an era of high vaccination coverage is therefore unclear.
To address this evidence gap, we used routinely-collected data covering 43% of the population in England [16], to estimate the hazard of the composite outcome of COVID-19 related hospitalisation or death between 1 March 2020 and 31 December 2022, comparing use of ursodeoxycholic acid treatment versus no ursodeoxycholic acid treatment among people with PBC or PSC.
Methods
Study design
We conducted a population-based cohort study using primary care records managed by the GP software provider TPP, linked to Office of National Statistics (ONS) death registration data and National Health Service (NHS) Secondary Use Service (SUS) data through OpenSAFELY, a data analytics platform created by our team on behalf of NHS England to address urgent COVID-19 research questions (https://opensafely.org).
Data Source
All data were linked, stored and analysed securely using the OpenSAFELY platform, https://www.opensafely.org/, as part of the NHS England OpenSAFELY COVID-19 service. Data include pseudonymised data such as coded diagnoses, medications and physiological parameters. No free text data were included. All code and codelists are shared openly for review and re-use under MIT open licence (https://github.com/opensafely/udca_covid). Detailed pseudonymised patient data is potentially re-identifiable and therefore not shared.
Study Population
The study population included people with either PBC or PSC, defined as the presence of a SNOMED CT (diagnosis) code for PBC or PSC ≥6 months before index date (1st March 2020), and therefore indicated for treatment with UDCA. Since PBC and PSC are comparatively rare, we included both conditions to increase the power of the study. People were excluded if they had: 1) <18 or >115 years of age, 2) < 6 months of registration in a TPP practice at the index date, which could preclude adequate ascertainment of key covariates, 3) missing information on sex, sustainability and transformation partnership (STP) region (an NHS administrative geographical area) or index of multiple deprivation (IMD), likely indicative of poor data quality or 4) a liver transplant prior to the index date, identified using SNOMED-CT codes in the primary care records or OPCS codes in secondary care records. People were followed from the index date until the earliest of either death, deregistration from their GP or the end of the study period (31st December 2022) (Figure 1).
Study measures
Exposure
The primary exposure was time-varying UDCA status. Baseline exposure status was defined as having at least one UDCA prescription during the 120 days prior to 1st March 2020. Prescriptions during follow-up were additionally identified and exposure status, including switches from unexposed to exposed and vice versa were updated accordingly. Given only the availability of prescription start dates we assumed treatment duration to be 120 days from prescription start date in primary analyses (derived from an assumed prescription length of up to 90 days, and a gap of up to 30-days between prescription refills). If two prescriptions overlapped, the 120-day exposure period restarted on the later prescription; overlapping days were not added to the exposure time in primary analyses. This was informed by a descriptive analysis of the prescribing data and discussions with clinicians.
Outcomes
The primary outcome was a composite of COVID-19 related death or hospitalisation. The secondary outcomes considered COVID-19 related death and COVID-19 related hospitalisation individually. Deaths were identified using linked ONS death registration data. COVID-19 related death was defined as a death where the underlying or contributory cause on the death certificate was COVID-19 (ICD-10 codes U07.1 and U07.2). COVID-19 related hospitalisations, obtained from secondary care SUS data, were defined as any hospitalisation listing a COVID-19 diagnosis in any position (ICD-10 codes U07.1 and U07.2). For the composite outcome, if a person was hospitalised before death, the date of hospitalisation was used.
Covariates
Covariates were identified through literature review and discussions with domain experts. We extracted the following fixed covariates at index date: age, sex, ethnicity, deprivation, sustainability and transformation partnership (STP) region (an NHS administrative geographical area), body mass index, smoking status, and presence of a COVID-19 high-risk diagnosis (i.e., learning difficulties, solid cancer, haematological disease, stem cell transplant, renal disease, immune-mediated inflammatory disorders identified through immunosuppressant drugs and glucocorticoid prescribing, primary immune deficiencies, HIV/AIDS, solid organ transplant, or rare neurological conditions including multiple sclerosis, motor neurone disease, myasthenia gravis or Huntington’s disease). COVID-19 high-risk diagnoses were identified through SNOMED CT codes in primary care records and ICD-10 and OPCS codes in secondary care records. Ethnicity was identified through SNOMED CT codes in primary care records and supplemented with information from secondary care records. Recorded ethnicity was grouped into White versus non-White due to small numbers of people and outcomes in non-White ethnicities. Deprivation was measured using quintiles of the Index of Multiple Deprivation (IMD), a relative measure of deprivation based on a person’s postcode [17]. Body mass index was ascertained from weight measurements within the 10 years prior to index date, restricted to those taken when the patient was aged 16 years or older. Smoking status (never/former/current) was defined by the most recent SNOMED-CT code prior to the index date. Exposure to obeticholic acid (OCA), a second line therapy used in PBC, at index date was identified from linked high cost drug data [18]. This contains prescribing from April 2018 to March 2020, a person was considered exposed if they had at least one prescription between 1st September 2019 and 31st March 2020.
Time-varying covariates were assessed every six months and at date of exposure switching [19]. Time-varying covariates included: COVID-19 vaccination, liver transplant, and liver disease severity. COVID-19 vaccinations were identified using SNOMED-CT codes; people were considered vaccinated from the date of their first vaccination. Liver transplants were identified using the same codelist as for the exclusion criteria (described above). Liver disease severity was identified using SNOMED-CT codes and ICD-10 codes, which included decompensation events including hepatic encephalopathy, ascites and variceal haemorrhage. A person was assumed to have severe disease after the date of the earliest code.
Missing data
On the assumption that both obesity and smoking are more likely to be recorded if present, people with missing body mass index were assumed to be within healthy range, and people with missing smoking information were assumed never smokers, in line with previous work [20]. A missing category was used for people with missing ethnicity information.
Statistical analysis
The study population was described in a flowchart. The characteristics of the population were summarised using descriptive statistics, stratified by UDCA exposure status at index date. Incidence rates by UDCA exposure status were calculated. Cox proportional hazards models, stratified by STP region, were used to estimate hazard ratios (HR) and 95% confidence intervals (CI) for the association between UDCA exposure and each outcome considering unadjusted, age and sex adjusted and fully adjusted models. Robust standard errors were applied.
In sensitivity analyses, we first added overlapping prescription days to exposure time. Second, we varied the assumed duration of UDCA prescriptions to be 90 days. Third, we analysed each population (i.e., PBC or PSC) separately. Fourth, we excluded people with OCA prescriptions at index date. OCA increases FXR levels, as UDCA decreases FXR levels, OCA may potentially have opposing effects on SARS-CoV-2 infection [7].
In secondary analyses, we aimed to estimate treatment effects in a population with high vaccination coverage. We repeated the primary analysis assuming an index date of 1st March 2021, as by this date a large majority of adults aged 60 years and older in the UK had received at least one COVID-19 vaccination [21]. In this analysis, we extended the adjustment for vaccination status by considering a) days since the most recent vaccination at index date, categorised into quintiles and b) a time-varying count of COVID-19 vaccinations over follow-up, this was reassessed, every 6-months and at date of exposure switching. We derived cumulative incidence plots using Royston-Parmer models, standardised to the covariate distribution in the exposed groups, with the baseline hazard modelled using a spline with 2 degrees-of-freedom.
All counts are rounded to the nearest 5 to minimise potential disclosure. Data management was performed using Python 3.8, with analysis carried out using Stata 17. Code for data management and analysis as well as codelists available online (https://github.com/opensafely/udca_covid). All iterations of the pre-specified study protocol are archived (https://github.com/opensafely/udca_covid/docs).
Information governance and ethical approval
NHS England is the data controller of the NHS England OpenSAFELY COVID-19 Service; TPP is the data processor; all study authors using OpenSAFELY have the approval of NHS England [22]. This implementation of OpenSAFELY is hosted within the TPP environment which is accredited to the ISO 27001 information security standard and is NHS IG Toolkit compliant [23]. Further details are in supplementary materials.
Results
We identified 11,305 people with PBC or PSC who met the inclusion criteria (supplementary materials). Of those, 7225 (64%) were exposed to UDCA at index date. Those prescribed UDCA at baseline were slightly older (aged 61-80 years: 55% UDCA vs 45% no UDCA), were more likely to be female (81% UDCA vs 74% no UDCA), and had more severe liver disease at baseline (42% UDCA vs 25% no UDCA) (Table 1).
There were 642 (5.7%) events of COVID-19 related hospitalisations or deaths during 29,834 person-years of follow-up. Individually there were 138 (1.2%) COVID-19 related deaths and 612 (5.4%) COVID-19 related hospitalisations. There were lower rates of COVID-19 related hospitalisation or death in those receiving UDCA users compared to non-users, with a rate of 177 events per 100,000 person-months in UDCA users, versus 184 events per 100,000 person-months in non-users. A similar pattern was seen for the individual outcomes of COVID-19 related hospitalisation and COVID-19 related death (Table 2).
In unadjusted analysis, there was no evidence of an association between UDCA exposure and the composite outcome (HR 0.91, 95% CI: 0.78 to 1.07) (Table 2 and Figure 2). After full adjustment UDCA was associated with a 21% reduction in the hazard of COVID-19 related hospitalisation or death (HR 0.79, 95% CI: 0.67 to 0.93). When analysing COVID-19 hospitalisations and deaths separately, the fully adjusted HRs were 0.81 (95% CI: 0.68 to 0.96) and 0.76 (95% CI: 0.53 to 1.08), respectively. For all analyses, the validity of the proportional assumption was assessed through investigation of Schoenfeld residual plots and there were no observed violations (supplementary materials).
The standardised cumulative incidence for COVID-19 related hospitalisations or deaths was 6.1% (95% CI: 5.2% to 7.2%) among UDCA users and 7.4% (95% CI: 6.2% to 8.8%) among non-users (Figure 3). The absolute cumulative risk difference was −1.35% (95% CI: −1.07% to −1.69%).
In sensitivity analyses, results were robust after adding overlapping days to exposure time (HR 0.81, 95% CI: 0.68 to 0.95), considering a maximum prescription duration of 90 days (HR 0.79, 95% CI: 0.68 to 0.93), and excluding people prescribed OCA (HR 0.80, 95% CI: 0.68 to 0.94). When modelling populations separately, we found associations were primarily driven by the PBC population, likely due to relatively few events in the PSC population (supplementary materials and Figure 2).
In the secondary analysis, there were 11,560 people who met the inclusion criteria as of 1st March 2021. There were 49 COVID-19 related deaths and 407 COVID-19 related hospitalisations, combined to a total of 418 COVID-19 related hospitalisations or deaths. The crude HR was 0.83 (95% CI: 0.68 to 1.01) and after adjustment the HR was 0.71 (95% CI: 0.58 - 0.87), in line with the main analysis (supplementary materials and Figure 2).
Discussion
In this large cohort study of people with PBC or PSC, use of UDCA was associated with a 21% lower hazard of COVID-19 related hospitalisation and death. Our results were consistent with a maximum absolute risk reduction of 1.6% in the context of an absolute risk of 7.2% of COVID-19 related hospitalisation or death among non-users. Our findings were robust to a variety of sensitivity analyses. Our data covered the start of the pandemic, emerging variants, and time before and after most adults were fully vaccinated. We further explored the association in a population where the majority had at least one vaccination, as most of the population is now vaccinated. In a secondary analysis limited to a study period in which a large majority of adults were vaccinated, we showed that UDCA was associated with 29% reduction in COVID-19 related hospitalisation or death.
Findings in context
There have been six observational studies of varying quality [10–15] using routinely-collected and survey data investigating the potential benefits of UDCA exposure to prevent and treat COVID-19. Two studies have shown that UDCA exposure was associated with reduced COVID-19 outcomes, including SARS-CoV-2 infection, COVID-19 hospitalisation, and death due to COVID-19 [7,10,11]. The largest study was a US cohort of 1607 male adults with cirrhosis that found a 48% reduction in severe or critical COVID-19 associated with UDCA use [10]. However, the study population included a variety of liver diseases that may have different susceptibilities to COVID-19, which the authors acknowledge may not have been adequately adjusted for [10,11]. Further, UDCA status was based on any use during follow-up, which may have resulted in exposure misclassification. Our study limited the study population to PSC and PBC to allow for better confounding control, and UDCA exposure was time-updated during follow-up.
Three studies showed no evidence of an association between UDCA and COVID-19 related or all-cause death. These were conducted in populations with established SARS-CoV-2 infection or hospitalised with COVID-19 and the numbers treated with UDCA were small [12–14]. Using a study population with acute SARS-CoV-2 infection focuses the study question on UDCA as treatment of COVID-19. The present study aimed to estimate the potential benefit of using UDCA as preventative therapy, though our design does not fully elucidate the mechanism that drives the benefit observed. Of note, two of these studies did observe a reduced need for continuous positive airway pressure [14] and a lower proportion with ICU admissions [13] in those treated with UDCA compared to no UDCA.
Strengths and limitations
Strengths of this study include the use of a large database of around 28 million records [16] which allowed us to identify a study population of over 11,000 people. We used a specific study population of people with PBC or PSC, which allowed us to identify and address confounding specific to this population. We identified a comprehensive set of potential confounders, and adjusted for these in a time-varying manner where this was deemed important, such as liver disease severity. Our results were robust to sensitivity analyses, including varying the assumed prescription duration, although power was limited when restricting to the PSC population alone. There are also limitations to acknowledge. Firstly, duration of UDCA exposure was estimated as we only had the date of prescription available and we did not have hospital prescribing information. Second, prevalence of exposure at index was lower than expected [6], suggesting the potential for exposure misclassification. However, we implemented time-varying UDCA exposure, reducing misclassification of exposure status over time. Previous studies have either determined exposure at a single time point or based on ‘ever’ exposure during the study period, which increases the potential for exposure misclassification at the time of the outcome. We do not expect exposure misclassification to be differential by outcome status; therefore, any residual misclassification would bias our results towards the null. Third, we did not have information on UDCA dose so could not examine potential dose thresholds or dose-response patterns. Fourth, as is the case in many nationwide electronic health record systems, we did not have reliable test data on SARS-CoV-2 infections covering the whole of the study period, and therefore could not examine this as an additional outcome. Fifth, we examined the potential for competing risk of liver-related death and found that, whilst there were more deaths with the primary cause as liver disease in the UDCA exposure group (n=175, 2.4%) compared to the unexposed group (n=75, 1.8%), the overall proportion experiencing liver-related death was small. In a post-hoc analysis, we removed censoring for liver-related deaths, which resulted in a slight attenuation in the estimated absolute cumulative risk difference (−1.35% versus −1.28%).
Policy implications
Although COVID-19 deaths and hospitalisations have substantially reduced since the height of the pandemic, there are still groups that remain at high risk of severe COVID-19 outcomes, despite vaccination [24]. While treatments have been introduced which reduce risk of severe COVID-19 outcomes after infection [25], vaccinations remain the only preventative measure. Therefore, further preventative measures are important for these groups. UDCA is a widely used, long-term treatment for people with liver disease that has a good safety profile [1] and is off-patent making it a good candidate for repurposing as a preventative measure for severe COVID-19 in high risk groups. Our findings support basic science evidence that UDCA prevents severe COVID-19 [7], though our data could not identify underlying mechanisms. Clinical trials have been called for [7,26], and may help to elucidate the mechanism of action UDCA as a potential preventative measure for severe COVID-19. To our knowledge, only two interventional clinical trials have been registered, one is a single-arm study in healthcare workers investigating COVID-19 infection (ClinicalTrials.gov ID: NCT05659654), the other is in children with COVID-19 infection investigating COVID-19 outcomes (ChiCTR ID: ChiCTR2200067226), both are short-term studies, and results have not yet been published.
Conclusion
Among patients with PBC and PSC, UDCA was associated with clinically-meaningful lower absolute risks and relative hazards of COVID-19 related hospitalisation and death, providing strong evidence that UDCA should be investigated further in observational and interventional studies, particularly in other high risk groups.
Data Availability
All data were linked, stored and analysed securely within the OpenSAFELY platform https://opensafely.org/. All code is shared openly for review and re-use under MIT open license (https://github.com/opensafely/udca_covid).
Funding
The OpenSAFELY Platform is supported by grants from the Wellcome Trust (222097/Z/20/Z) and MRC (MR/V015737/1, MC_PC_20059, MR/W016729/1). In addition, development of OpenSAFELY has been funded by the Longitudinal Health and Wellbeing strand of the National Core Studies programme (MC_PC_20030: MC_PC_20059), the NIHR funded CONVALESCENCE programme (COV-LT-0009), NIHR (NIHR135559, COV-LT2-0073), and the Data and Connectivity National Core Study funded by UK Research and Innovation (MC_PC_20058) and Health Data Research UK (HDRUK2021.000). LAT is funded by an NIHR Research Professorship NIHR302405. VM is funded by an NIHR Doctoral Research Fellowship (NIHR301535).
Conflicts of interest
BG has received research funding from the Bennett Foundation, the Laura and John Arnold Foundation, the NHS National Institute for Health Research (NIHR), the NIHR School of Primary Care Research, NHS England, the NIHR Oxford Biomedical Research Centre, the Mohn-Westlake Foundation, NIHR Applied Research Collaboration Oxford and Thames Valley, the Wellcome Trust, the Good Thinking Foundation, Health Data Research UK, the Health Foundation, the World Health Organisation, UKRI MRC, Asthma UK, the British Lung Foundation, and the Longitudinal Health and Wellbeing strand of the National Core Studies programme; he is a Non-Executive Director at NHS Digital; he also receives personal income from speaking and writing for lay audiences on the misuse of science. BMK is also employed by NHS England working on medicines policy and clinical lead for primary care medicines data. AM is a member of RCGP health informatics group and the NHS Digital GP data Professional Advisory Group, and received consulting fee from Induction Healthcare. LAT has received research funding from MRC, Wellcome, NIHR and GSK, consulted for Bayer in relation to an observational study of chronic kidney disease (unpaid), and is a member of 4 non-industry funded (NIHR/MRC) trial advisory committees (unpaid) and MHRA Expert advisory group (Women’s Health). REC has shares in AstraZeneca. VM received a grant from NIHR. AS is employed by LSHTM on a fellowship sponsored by GSK. JT received an AstraZeneca grant for unrelated COVID-19 research. GFM and RS have received research funding from Intercept Pharmaceuticals and Advanz Pharma. All other authors declare no conflicts of interest.
Acknowledgements
We are very grateful for all the support received from the TPP Technical Operations team throughout this work, and for generous assistance from the information governance and database teams at NHS England and the NHS England Transformation Directorate. North East Commissioning Support Unit provided support on behalf of all Commissioning Support Units to aggregate the high cost drugs data for use in OpenSAFELY studies. We thank Dr Edward Parker for his help with output checking.
Footnotes
↵* Joint senior authorship