RT Journal Article SR Electronic T1 AI-based differential diagnosis of dementia etiologies on multimodal data JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.02.08.24302531 DO 10.1101/2024.02.08.24302531 A1 Xue, Chonghua A1 Kowshik, Sahana S. A1 Lteif, Diala A1 Puducheri, Shreyas A1 Zhou, Olivia T. A1 Walia, Anika S. A1 Guney, Osman B. A1 Zhang, J. Diana A1 Pham, Serena T. A1 Kaliaev, Artem A1 Carlota Andreu-Arasa, V. A1 Dwyer, Brigid C. A1 Farris, Chad W. A1 Hao, Honglin A1 Kedar, Sachin A1 Mian, Asim Z. A1 Murman, Daniel L. A1 O’Shea, Sarah A. A1 Paul, Aaron B. A1 Rohatgi, Saurabh A1 Saint-Hilaire, Marie-Helene A1 Sartor, Emmett A. A1 Setty, Bindu N. A1 Small, Juan E. A1 Swaminathan, Arun A1 Taraschenko, Olga A1 Yuan, Jing A1 Zhou, Yan A1 Zhu, Shuhan A1 Karjadi, Cody A1 Alvin Ang, Ting Fang A1 Bargal, Sarah A. A1 Plummer, Bryan A. A1 Poston, Kathleen L. A1 Ahangaran, Meysam A1 Au, Rhoda A1 Kolachalama, Vijaya B. YR 2024 UL http://medrxiv.org/content/early/2024/02/11/2024.02.08.24302531.abstract AB Differential diagnosis of dementia, with its overlapping symptomatology, remains a significant challenge in neurology. Here we present an algorithmic framework employing state-of-the-art techniques such as transformers as well as self-supervised frameworks and harnessing a broad array of data including demographics, person-level and family medical history, medication use, neuropsychological exams, functional evaluations, and multimodal neuroimaging to identify the etiologies contributing to dementia in individuals. The study utilized 9 independent, geographically diverse datasets, including the National Alzheimer’s Coordinating Center with 45, 349 participants, the Alzheimer’s Disease Neuroimaging Initiative encompassing 1, 821 participants, and the Frontotemporal Lobar Degeneration Neuroimaging Initiative comprising 253 participants. Additionally, the Parkinson’s Progression Marker Initiative with 198 participants, the Australian Imaging, Biomarker and Lifestyle Flagship Study of Ageing cohort including 661 participants, the Open Access Series of Imaging Studies dataset with 491 participants, and the 4 Repeat Tauopathy Neuroimaging Initiative comprising 80 participants were used. The study also included two in-house datasets: one from the Lewy Body Dementia Center for Excellence at Stanford University with 182 participants, and another from the Framingham Heart Study including 1, 651 individuals. Our model traverses the intricate spectrum of dementia by mirroring real-world clinical settings, aligning diagnoses with similar management strategies, and delivering robust predictions, even in the face of incomplete data. On the testing cohort, our model achieved a micro-averaged area under the receiver operating characteristic curve (AUROC) of 0.93, and a micro-averaged area under precision-recall curve (AUPR) of 0.87, in classifying individuals with normal cognition, mild cognitive impairment and dementia. Also, the micro-averaged AUROC was 0.95 and micro-averaged AUPR was 0.68 in differentiating 10 distinct dementia etiologies, defined through a consensus among a team of neurologists. One key strength lies in our model’s capability to address mixed dementias, a prevalent challenge in clinical practice, and the incorporation of interpretability techniques further unveiled vital disease-specific patterns. On a randomly selected subset (n = 100), our model differentiated true positive and true negative cases across 12 out of 13 categories (p < 0.01), as opposed to the neurologists’ expertise in identifying 9 out of these 13 categories (p < 0.01). Furthermore, the model’s correlations with different proteinopathies were substantiated through postmortem analyses. This included a significant association with the global Alzheimer’s disease neuropathologic change (ADNC) score (p < 0.001), and notable correlations with TDP-pathology, the presence of old microinfarcts, arteriosclerosis, and Prion disease (all with p < 0.05). Our framework has the potential to be integrated as a screening tool for dementia in various clinical settings and drug trials, with promising implications for person-level management.Research in context Systematic review: Previous studies have demonstrated that models utilizing multimodal data can differentiate individuals across the dementia spectrum, identifying those with normal cognition (NC), mild cognitive impairment (MCI), and dementia (DE). Some studies have also ventured beyond this tripartite classification, aiming to differentiate Alzheimer’s disease (AD) from other forms of non-AD dementia. Majority of these investigations have approached the task as a binary classification, primarily focusing on the distinction between AD and other dementia types. Also, limited studies have effectively tackled the intricate challenge of diagnosing mixed dementia, which is a common and complex issue encountered in clinical practice.Methods and findings: Employing multimodal data from 9 distinct cohorts, encompassing 50, 686 participants, we developed an algorithmic framework that leverages transformers and self-supervised learning to facilitate differential dementia diagnoses. This model adeptly classifies individuals into 13 curated diagnostic categories, each tailored to reflect real-world clinical needs. These categories comprehensively cover the cognitive spectrum, ranging from NC, MCI to DE, and extend to 10 distinct dementia types. Our model demonstrates the capability to accurately diagnose dementia, even with incomplete data, and efficiently manage cases involving multiple co-occurring dementia conditions, a common occurrence in clinical practice. It has shown commendable performance, surpassing expert clinical assessments, and its predictions have been corroborated by postmortem data, particularly in relation to various proteinopathies.Interpretation: Our work provides a robust and adaptable framework for comprehensive dementia screening for drug trials and in various clinical settings, ranging from primary care to memory clinics.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis project was supported by various grants.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present study are available upon reasonable request to the authors