RT Journal Article SR Electronic T1 Contrastive Self-supervised Learning for Neurodegenerative Disorder Classification JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2024.07.03.24309882 DO 10.1101/2024.07.03.24309882 A1 Gryshchuk, Vadym A1 Singh, Devesh A1 Teipel, Stefan A1 Dyrba, Martin A1 , YR 2024 UL http://medrxiv.org/content/early/2024/07/04/2024.07.03.24309882.abstract AB Neurodegenerative diseases such as Alzheimer’s disease (AD) or frontotemporal lobar degeneration (FTLD) involve specific loss of brain volume, detectable in vivo using T1-weighted MRI scans. Supervised machine learning approaches classifying neurodegenerative diseases require diagnostic-labels for each sample. However, it can be difficult to obtain expert labels for a large amount of data. Self-supervised learning (SSL) offers an alternative for training machine learning models without data-labels. We investigated if the SSL models can applied to distinguish between different neurodegenerative disorders in an interpretable manner. Our method comprises a feature extractor and a downstream classification head. A deep convolutional neural network trained in a contrastive self-supervised way serves as the feature extractor, learning latent representation, while the classifier head is a single-layer perceptron. We used N=2694 T1-weighted MRI scans from four data cohorts: two ADNI datasets, AIBL and FTLDNI, including cognitively normal controls (CN), cases with prodromal and clinical AD, as well as FTLD cases differentiated into its sub-types. Our results showed that the feature extractor trained in a self-supervised way provides generalizable and robust representations for the downstream classification. For AD vs. CN, our model achieves 82% balanced accuracy on the test subset and 80% on an independent holdout dataset. Similarly, the Behavioral variant of frontotemporal dementia (BV) vs. CN model attains an 88% balanced accuracy on the test subset. The average feature attribution heatmaps obtained by the Integrated Gradient method highlighted hallmark regions, i.e., temporal gray matter atrophy for AD, and insular atrophy for BV. In conclusion, our models perform comparably to state-of-the-art supervised deep learning approaches. This suggests that the SSL methodology can successfully make use of unannotated neuroimaging datasets as training data while remaining robust and interpretable.Competing Interest StatementS.T. served as member of advisory boards of Lilly, Eisai, and Biogen, and is member of the independent data safety and monitoring board of the study ENVISION (Biogen). V.G., D.S. and M.D. declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.Funding StatementThis study was supported by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) under grant DY151/2-1, project ID 454834942.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The data used for this study is publicly available and can be obtained from the respective repositories: Alzheimer's Disease Neuroimaging Initiative (ADNI) http://adni.loni.usc.edu/data-samples/access-data, Australian Imaging Biomarkers and Lifestyle flagship study of ageing (AIBL) https://aibl.csiro.au, and Frontotemporal Lobar Degeneration Neuroimaging Initiative (FTLDNI) https://memory.ucsf.edu/research-trials/research/allftd. Data collection of the respective neuroimaging initiatives was approved by the internal review boards of each of the participating study sites. See https://adni.loni.usc.edu and https://aibl.csiro.au for details. All initiatives met common ethical standards in the collection of the data such as the Declaration of Helsinki. Analysis of the data was approved by the internal review board of the Rostock University Medical Center, reference number A 2020-0182.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe data used for this study is publicly available and can be obtained from the respective repositories: Alzheimer's Disease Neuroimaging Initiative (ADNI) http://adni.loni.usc.edu/data-samples/access-data, Australian Imaging Biomarkers and Lifestyle flagship study of ageing (AIBL) https://aibl.csiro.au, and Frontotemporal Lobar Degeneration Neuroimaging Initiative (FTLDNI) https://memory.ucsf.edu/research-trials/research/allftd. http://adni.loni.usc.edu/data-samples/access-data https://aibl.csiro.au https://memory.ucsf.edu/research-trials/research/allftd ADAlzheimer’s diseaseADNIAlzheimer’s Disease Neuroimaging InitiativeAIBLAustralian Imaging, Biomarker & Lifestyle Flagship Study of AgeingBNBatch normalisationBVbehavioral variant of frontotemporal dementiaCNCognitively normal participantsCNNConvolutional neural networkConvNeXTA highly optimized CNN model architecture recently introduced by Liu et al. (2022b)DZNEDeutsches Zentrum für Neurodegenerative Erkrankungen (German Center for Neurodegenerative Diseases)FTLDFrontotemporal lobar degenerationFTLDNIFrontotemporal Lobar Degeneration Neuroimaging InitiativeGELUGaussian error linear unitsGrad-CAMGradient-weighted class activation mappingIGIntegrated gradientsInfoNCEA form a contrastive loss metric, where NCE stands for Noise-Contrastive EstimationLNLayer-wise normalizationLRPLayer-wise relevance propagationMCCMatthews correlation coefficientMCIMild cognitive impairmentMNIMontreal Neurological InstituteMRIMagnetic resonance imagingNNCLRNearest-Neighbor Contrastive LearningPNFAProgressive non-fluent aphasiaSLSupervised learningSSLSelf-supervised learningSVsemantic variant of frontotemporal dementiaViTVision TransformersXAIExplainable artificial intelligence