PT - JOURNAL ARTICLE AU - Anna O. Basile AU - Anurag Verma AU - Leigh Anne Tang AU - Marina Serper AU - Andrew Scanga AU - Ava Farrell AU - Brittney Destin AU - Rotonya M. Carr AU - Anuli Anyanwu-Ofili AU - Gunaretnam Rajagopal AU - Abraham Krikhely AU - Marc Bessler AU - Muredach P. Reilly AU - Marylyn D. Ritchie AU - Joshua Denny AU - Nicholas P. Tatonetti AU - Julia Wattacheril TI - Rapid Identification and Phenotyping of Nonalcoholic Fatty Liver Disease Patients Using an Automated Algorithmic Approach in Diverse, Urban Healthcare Systems AID - 10.1101/2021.04.27.21256139 DP - 2021 Jan 01 TA - medRxiv PG - 2021.04.27.21256139 4099 - http://medrxiv.org/content/early/2021/04/30/2021.04.27.21256139.short 4100 - http://medrxiv.org/content/early/2021/04/30/2021.04.27.21256139.full AB - Background and Aims Nonalcoholic Fatty Liver Disease (NAFLD) is the most common global cause of chronic liver disease. Therapeutic interventions are rapidly advancing for its inflammatory phenotype, nonalcoholic steatohepatitis (NASH). Diagnosis codes alone fail to accurately recognize at-risk patients. The objective of the present work is to identify NAFLD patients within large electronic health record (EHR) databases for targeted intervention based on clinically relevant phenotypes.Methods We present a rule-based phenotype algorithm for the rapid identification of NAFLD patients developed using EHRs from 5.8 million adult patients at Columbia University Irving Medical Center (CUIMC). The algorithm was developed using the Observational Medical Outcomes Partnership (OMOP) Common Data Model, and queries multiple structured and unstructured data elements, including diagnosis codes, laboratory measurements, radiology and pathology modalities.Results Our approach identified 16,060 CUIMC NAFLD patients with 170 having a biopsy-proven NASH diagnosis. Fibrosis scoring on patients without histology identified 943 with scores indicative of advanced fibrosis (FIB-4, APRI, NAFLD) in 2 of the scoring metrics. The algorithm was validated at two independent healthcare systems, University of Pennsylvania Healthcare System (UPHS) and Vanderbilt Medical Center (VUMC), where 20,779 and 19,575 NAFLD patients were identified, respectively. Clinical chart review identified a high positive predictive value (PPV) for the algorithm across all healthcare systems: 91% at CUIMC, 75% at UPHS, and 85% at VUMC.Conclusions Our rule-based algorithm provides an accurate, automated approach for rapidly identifying and sub-phenotyping NAFLD patients within a large EHR system. This highlights the clinical potential algorithms have in discovering NAFLD patients at highest risk for disease progression for diagnostic and therapeutic intervention.Data Transparency Statement Algorithmic code is available for academic, non-commercial collaborations by request to the corresponding authors.Background and Context NAFLD is the leading form of liver disease worldwide with a rising prevalence in the population. Current means of identification are complex and dependent on provider recognition of clinical risk factors.New Findings We present an accurate (mean PPV=84%) and cross-institution validated, rule-based algorithm for the high-throughput, rapid identification of NAFLD patients across diverse EHR systems comprising approximately 12.1 million patients. The majority of patients were previously unidentified.Limitations Inaccessible imaging and histologic data (performed outside the healthcare system) limited our ability to verify hepatic steatosis and resulted in low sensitivity for the final step of the algorithm.Impact Our NAFLD algorithm provides an accurate means of rapidly identifying NAFLD in large EHR systems to target patients at greatest risk for disease progression and clinical outcomes towards diagnostic and therapeutic interventions.Short Summary NAFLD, the leading cause of liver disease globally, is often under-recognized in at-risk individuals. Here we present a rapid, non-invasive algorithm for identifying patients within large health systems who are at greatest risk for disease progression and clinical decompensation for diagnostic and therapeutic intervention.Competing Interest StatementJW has received research support from Janssen, Galectin, Intercept, Genfit, Shire, Conatus, Zydus, and is on the advisory board for Astra Zeneca/MedImmune, AMRA. RMC has received research support from Intercept Pharmaceuticals and Merck, Inc. MS is a consultant for Gilead, Inc. AK is a speaker and proctor for Intuitive, reviewer for surgical videos for Crowd Sourced Assessment of Technical Skills (CSATs), and a consultant for Johnson and Johnson and Surgical Specialties Corporation. AV, LT, AS, AF, BD, AA, GR, AK, MB, MPR, MDR, JD, and NPT have nothing to disclose. Patent for algorithm to Columbia University Trustees; 2021 The Trustees of Columbia University in the City of New York. The owner has no objection to reproduction of the work for academic non-commercial purposes, but otherwise reserves all copyright rights whatsoever. JW, NPT and AOB are co-inventors.Funding StatementFunding was provided by Janssen Research and Development in collaboration with Columbia University Irving Medical Center. The sponsor was involved in study concept and design. This publication was supported by the National Center for Advancing Translational Sciences, National Institutes of Health, through Grant Number UL1TR001873. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Health (NIH).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study was approved by the Columbia University Institutional Review Boards of Columbia University Irving Medical Center, University of Pennsylvania Health System and Vanderbilt University Medical Center.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData in the study were extracted from patient electronic health records at Columbia University Irving Medical Center, University of Pennsylvania Healthcare System (UPHS), and Vanderbilt Medical Center (VUMC). These data are not available for public use due to institutional privacy policies and federal regulations.EHRElectronic Health RecordNAFLDnonalcoholic fatty liver diseaseNASHnonalcoholic steatohepatitisOMOPObservational Medical Outcomes Partnership