Abstract
Objective Growing numbers of academic medical centers offer patient cohort discovery tools to their researchers, yet the performance of systems for this use case is not well-understood. The objective of this research was to assess patient-level information retrieval (IR) methods using electronic health records (EHR), and to investigate the interplay between commonly used IR approaches and the cohort definition structure.
Materials and Methods Using the Cranfield IR evaluation methodology, we developed a test collection based on 56 test topics characterizing patient cohort requests for various clinical studies. Test collection data was derived from patient records originating from OHSU’s EHR data warehouse. Automated IR tasks were performed, varying four different parameters for a total of 48 permutations, with performance measured using B-Pref. We subsequently created 56 structured Boolean queries for the 56 topics for performance comparisons. Finally, we designed 59 taxonomy characteristics to classify the structure of the 56 topics. Six topic complexity measures were derived from these characteristics for further evaluation using a beta regression simulation.
Results The best-performing word-based automated query parameter settings achieved a mean B-Pref of 0.167 across all 56 topics. The way a topic was structured (topic representation) had the largest impact on performance. Performance not only varied widely across topics, but there was also a large variance in sensitivity to parameter settings across the topics. Structured queries generally performed better than automated queries on measures of recall and precision, but were still not able to recall all relevant patients found by the automated queries. We also found strong performance associations with the six complexity measures created from the topic taxonomy, and interactions with automated query parameter settings.
Conclusion While word-based automated methods of cohort retrieval offer an attractive solution to the labor-intensive nature of this task currently used at many medical centers, we generally found suboptimal performance in the methods tested for this study. Some of the characteristics derived from a query taxonomy could lead to improved selection of approaches based on the structure of the topic of interest. Insights gained here will help guide future work to develop new methods for patient-level cohort discovery with EHR data.
Competing Interest Statement
Steven Chamberlin, Aaron Cohen, and William Hersh have research funding from Alnylam Pharmaceuticals that is unrelated to the work described in this paper.
Funding Statement
This work was supported by NIH Grant 1R01LM011934 from the National Library of Medicine.
Author Declarations
All relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
Any clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.
Not Applicable
I have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.
Not Applicable
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.