Abstract
Routinely collected electronic health records (EHR) offer a valuable opportunity to carry out research on immunisation uptake, effectiveness and safety, using large and representative samples of the population. However, using EHR presents challenges for identifying vaccinated and unvaccinated cohorts. Some vaccinations are delivered in different care settings, so may not be fully recorded in primary care EHR. In contrast to other drugs, they do not require electronic prescription in many settings, which may lead to ambiguous coding of vaccination status and timing. Additionally, for childhood vaccination, there may be other challenges of identifying the study population eligible for vaccination due to changes in immunisation schedules over time, different vaccine indications depending on the context (e.g., tetanus vaccination after exposure) and the lack of full dates of birth in many databases of data confidentiality restrictions.
In this paper, we described our approach to tackling methodological issues related to identifying childhood immunisations in the Clinical Practice Research Datalink (CPRD) Aurum, a UK primary care dataset of EHR, as an example, and we introduce a comprehensive algorithm to support high-quality studies of childhood vaccination. We showed that a broad variety of considerations is important to identify vaccines in EHR and offer guidance on decisions to ascertain the vaccination status, such as considering data source and delivery systems (e.g., primary or secondary care), using a wide range of medical codes in combination to identify vaccination events, and using appropriate wash-out periods and quality checks to deal with issues of over-recording and back dating in EHR.
Our algorithm reproduced estimates of vaccination coverage which are comparable to official national estimates in England. This paper aims to improve transparency, quality, comparability and reproducibility of studies on immunisations.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study is funded by the National Institute for Health and Care Research (NIHR) Health Protection Research Unit in Vaccines and Immunisation (NIHR200929), a partnership between UK Health Security Agency and the London School of Hygiene and Tropical Medicine. The views expressed are those of the author(s) and not necessarily those of the NIHR, UK Health Security Agency or the Department of Health and Social Care. CWG is supported by a Wellcome Career Development Award (225868/Z/22/Z).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
We received data governance approval from Clinical Practice Research Datalink(CPRD) under protocol number 22_001706 and ethical approval from the London School of Hygiene and Tropical Medicine research ethics committee (reference number 27651).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
↵* joint last authorship
Data Availability
The study uses data from the Clinical Practice Research Datalink (CPRD). CPRD does not allow the sharing of patient-level data. The data specification for the CPRD data set is available at: https://cprd.com/cprd-aurum-may-2022-dataset. The code lists can be found at: https://github.com/Eyedeet/vaccine_methods_ehr_public/tree/main/codelists.
https://github.com/Eyedeet/vaccine_methods_ehr_public/tree/main/codelists