PT - JOURNAL ARTICLE AU - Walsh, Colin G. AU - Wilimitis, Drew AU - Chen, Qingxia AU - Wright, Aileen AU - Kolli, Jhansi AU - Robinson, Katelyn AU - Ripperger, Michael A. AU - Johnson, Kevin B. AU - Carrell, David AU - Desai, Rishi J. AU - Mosholder, Andrew AU - Dharmarajan, Sai AU - Adimadhyam, Sruthi AU - Fabbri, Daniel AU - Stojanovic, Danijela AU - Matheny, Michael E. AU - Bejan, Cosmin A. TI - Scalable Incident Detection via Natural Language Processing and Probabilistic Language Models AID - 10.1101/2023.11.30.23299249 DP - 2023 Jan 01 TA - medRxiv PG - 2023.11.30.23299249 4099 - http://medrxiv.org/content/early/2023/12/01/2023.11.30.23299249.short 4100 - http://medrxiv.org/content/early/2023/12/01/2023.11.30.23299249.full AB - Post marketing safety surveillance depends in part on the ability to detect concerning clinical events at scale. Spontaneous reporting might be an effective component of safety surveillance, but it requires awareness and understanding among healthcare professionals to achieve its potential. Reliance on readily available structured data such as diagnostic codes risk under-coding and imprecision. Clinical textual data might bridge these gaps, and natural language processing (NLP) has been shown to aid in scalable phenotyping across healthcare records in multiple clinical domains. In this study, we developed and validated a novel incident phenotyping approach using unstructured clinical textual data agnostic to Electronic Health Record (EHR) and note type. It’s based on a published, validated approach (PheRe) used to ascertain social determinants of health and suicidality across entire healthcare records. To demonstrate generalizability, we validated this approach on two separate phenotypes that share common challenges with respect to accurate ascertainment: 1) suicide attempt; 2) sleep-related behaviors. With samples of 89,428 records and 35,863 records for suicide attempt and sleep-related behaviors, respectively, we conducted silver standard (diagnostic coding) and gold standard (manual chart review) validation. We showed Area Under the Precision-Recall Curve of ∼ 0.77 (95% CI 0.75-0.78) for suicide attempt and AUPR ∼ 0.31 (95% CI 0.28-0.34) for sleep-related behaviors. We also evaluated performance by coded race and demonstrated differences in performance by race were dissimilar across phenotypes and require algorithmovigilance and debiasing prior to implementation.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was funded by the U.S. Food and Drug Administration's Sentinel Initiative. All investigators were supported on FDA WO2006. Dr. Walsh is also supported in part by NIMH R01MH121455 and R01MH116269.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The IRB of Vanderbilt University Medical Center gave ethical approval for this work.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesBecause data include sensitive PHI, data are not available for dissemination outside the study team.