Abstract
INTRODUCTION Early detection of patients with cognitive impairment may facilitate care for individuals in this population. Natural language processing (NLP) is a potential approach to identifying patients with cognitive impairment from electronic health records (EHR).
METHODS We used three machine learning algorithms (logistic regression, multilayer perceptron, and random forest) using clinical terms extracted by NLP to predict cognitive impairment in a cohort of 199 patients. Cognitive impairment was defined as a mini-mental status exams (MMSE) score <24.
RESULTS NLP identified 69 (35%) patients with cognitive impairment and ICD codes identified 44 (22%). Using MMSE as a reference standard, NLP sensitivity was 35%, specificity 66%, precision 41%, and NPV 61%. The random forest method had the best test parameters; sensitivity 95%, specificity 100%, precision 100%, and NPV 97%
DISCUSSION NLP can identify adults with cognitive impairment with moderate test performance that is enhanced with machine learning.
Competing Interest Statement
TVV was part of launching Clinithink and retains a financial interest in the company.
Funding Statement
This research was funded by a grant from the National Institute on Aging, R01AG066471.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
IRB of Icahn School of Medicine at Mount Sinai gave ethical approval for this work.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.