Mild cognitive impairment cases affect the predictive power of Alzheimer’s disease diagnostic models using routine clinical variables

Caitlin A. Finney; Alzheimer’s Disease Neuroimaging Initiative; Artur Shvetcov

doi:10.1101/2025.02.04.25321694

Abstract

Diagnostic models using primary care routine clinical variables have been limited in their ability to identify Alzheimer’s disease (AD) patients. In this study we sought to better understand the effect of mild cognitive impairment (MCI) on the predictive performance of AD diagnostic models. We sourced data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. CatBoost was used to assess the utility of routine clinical variables that are accessible to primary care physicians, such as hematological and blood tests and medical history, in multiclass classification between healthy controls, MCI, and AD. Our results indicated that MCI indeed affected the predictive performance of AD diagnostic models. Of three subgroups of MCI that we found, this finding was driven by a subgroup of MCI patients that likely have prodromal AD. Future research should focus on distinguishing MCI from prodromal AD as the utmost priority for improving translational AD diagnostic models for primary care physicians.

Introduction

Alzheimer’s disease (AD), the most prevalent dementia accounting for 50-70% of cases, is the leading cause of disability among adults over age 65 ¹. With a rapidly increasing prevalence, AD is expected to cost the world economy more than $14.5 trillion international dollars over the next 30 years to 2050 ². Improving the ability to accurately diagnose AD is of the utmost importance and ensures that patients receive appropriate support, interventions, and have the time to employ lifestyle adjustments that prolong independence and quality of life ³.

Despite this, the timely and accurate diagnosis of AD remains challenging. To date, a significant number of diagnostic tools and models of AD have focused on the use of magnetic resonance imaging (MRI) and positron emission tomography (PET) scans to detect early indicators of AD pathology including Aβ plaques ^4–12. Others have relied on cerebrospinal fluid (CSF) and plasma biomarkers including Aβ₄₂/Aβ₄₀ ratio, total tau protein, phosphorylated tau 181 (p-tau₁₈₁), p-tau₂₃₁, p-tau₂₁₇, neurofilament light (NfL) and glial fibrillary acidic protein (GFAP) ^13,14. Although these tests show promise for AD diagnostics, there are many practical limitations that prevent widespread clinical implementation. Both neuroimaging scans and biomarker assays are associated with a high cost both to the healthcare system and patient, low availability, and high wait times, especially for those patients in rural areas ^14–17. Further, collecting CSF is an invasive procedure that requires specific technical medical expertise ¹⁴. In line with these challenges, a recent Alzheimer’s Association Primary Care Physician Dementia Care Training Survey found that half of primary care physicians do not feel that they have the local specialist resources to meet patient demand ¹⁸. In fact, primary care physicians remain better able to identify those without AD than those with it ^19,20. For many patients with AD, primary care physicians are the first point of contact with the healthcare system making them essential for patient triage, diagnosis, and management ¹⁵. Therefore, ensuring that primary care physicians have the skills and tools required for AD diagnostics is critical.

To improve diagnostic capabilities among primary care physicians, previous studies have used machine learning to examine the potential of routine, easy-to-obtain clinical measures. For example, these models include the Cardiovascular Risk Factors, Aging, and Dementia (CAIDE) ²¹, Study on Aging, Cognition and Dementia (AgeCoDe) ²², Australian National University Alzheimer’s Disease Risk Index (ANU-ADRI) ²³, Rapid Assessment of Dementia Risk (RADaR) for older adults ²⁴, and Brief Dementia Screening Indicator (BDSI)²⁵. There are limitations to these previous diagnostic models, however, as they report low sensitivities and positive predictive values (PPV), indicating that they are unable to reliably identify someone with AD. This was confirmed by a recent study showing existing machine learning-based diagnostic models of AD miss 84-91% of incident AD cases and therefore have limited clinical utility ²⁶.

The factors underlying the inability of these models to reliably diagnose AD remains unclear. One possibility is the presence of patients with mild cognitive impairment (MCI), which affects 10-15% of the population over age 65 ²⁷. Although AD first manifests clinically as MCI, not all patients with MCI will go on to develop AD ^27–29. Further, MCI is known to be characterized by heterogeneous patients with multiple etiologies associated with differing clinical presentations ²⁸. In line with this, there is evidence that MCI cases themselves are difficult to predict. Previous studies have reported precision, recall, and sensitivity metrics at or near chance levels for MCI classification ^30,31, distinguishing MCI from AD ³², and predicting conversion from MCI to AD ^33–36. This suggests that these models would have low practical utility in the clinic and are likely to mislabel most MCI cases. Combined, this highlights that MCI cases may affect the predictive power of translational AD diagnostic models.

Leveraging data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort, we sought to better understand the effects of MCI cases on the predictive performance of AD diagnostic models using easy-to-obtain clinical variables.

Results

CatBoost can identify healthy controls and MCI but not AD cases

Using baseline diagnosis, we identified patients in the ADNI cohort with AD (N = 181), MCI (N = 473), and healthy controls (N = 220). There were no differences across the groups with respect to age or sex distributions, with all groups having more males than females (Supplementary Table 1). The dataset was then randomly split into 80% training and validation and 20% withheld testing datasets. We included 120 features of routine, easy-to-obtain clinical variables (Supplementary Table 1). Using CatBoost, we first sought to determine whether any of these features were able to differentiate between healthy control, MCI, and AD patients. Our multiclass classification model was able to successfully identify healthy controls, showing high performance metrics > 0.82 (Table 1). Performance metrics for MCI identification were lower, however, at > 0.75 (Table 1). In the case of healthy controls, performance metrics suggested that the models were largely unable to successfully differentiate AD cases, with a very low sensitivity of 0.63 and PPV of 0.65 (Table 1).

View this table:

Table 1. Performance metrics of a CatBoost model for diagnosing healthy controls, MCI, and AD using 120 clinical variables as features

Given that our model was able to successfully identify healthy control cases, we hypothesized that it was having difficulty differentiating between MCI and AD cases, specifically. A confusion matrix showed that this was indeed the case (Figure 1A). The model only misclassified 9 healthy controls as MCI patients. This increased, however, to a misclassification of 13 MCI cases as AD and 12 AD cases as MCI (Figure 1A).

Figure 1.

Performance of a CatBoost model and feature importance for diagnosis of healthy control, MCI, and AD. (A) Confusion matrix showing the true (correct) and false (incorrect) prediction of healthy control, MCI, and AD. (B) Absolute SHAP values and (C) heat map of the contribution of the main features for predicting healthy controls. (D) Absolute SHAP values and (E) heat map of the contribution of the main features for predicting MCI. (F) Absolute SHAP values and (G) heat map of the contribution of the main features for predicting AD. Abbreviations: APP: augmented pulse pressure; BAT126: vitamin B12; BMI: body mass index; GDMEMORY: item 10 of Geriatric Depression Scale; GDTOTAL: total score Geriatric Depression Scale; HMT3: red blood cell count; HMT4: mean corpuscular volume; HMT7: white blood cell count; HMT11: eosinophils; HMT15: neutrophils; HMT17: white blood cell count; HMT100: mean corpuscular hemoglobin; HMT102: mean corpuscular hemoglobin concentration; LDELTOTAL: total number of story units recalled on Logical Memory-Delayed Recall; LIMMTOTAL: total number of story units recalled on Logical Memory-Immediate Recall; MHPSYCH: psychiatric medical history; PTEDUCAT: education; PTNOTRT: retirement; RCT3: gamma-glutamyl transferase; RCT: aspartate aminotransferase (serum glutamic-oxaloacetic transaminase); RCT8: serum uric acid; RCT20: cholesterol; RCT1407: alkaline phosphatase; RCT1408: lactate dehydrogenase; VSPULSE: seated pulse rate (per minute); VSRESP: respirations (per minute).

To better understand which features were being used by our CatBoost model to predict between the groups, we performed a SHAP analysis. The five most important features for identifying healthy controls included the Logical Memory-Delayed Recall, item 10 of the Geriatric Depression Scale that asks about memory problems, years of education, Logical Memory-Immediate Recall, and total score on the Geriatric Depression Scale (Figure 1B,C). For MCI cases, the total score on the Geriatric Depression Scale was replaced by eosinophils (Figure 1D,E) and by psychiatric medical history for AD (Figure 1F,G).

Feature selection slightly improves the predictive performance of CatBoost for identifying MCI and AD patients

To identify if we could improve the performance of CatBoost model, we performed filter-based feature selection to include only those variables significantly associated with the outcome (diagnosis). This resulted in 19 features being included (Supplementary Table 2).

As before, our model was successfully able to identify healthy controls, with only a small reduction in PPV (Table 2). There was also a negligible reduction, relative to the model with 120 features, in the predictive performance for identifying MCI cases (Table 2). For AD cases, however, filter-based feature selection improved the predictive performance, with sensitivity increasing from 0.63 to 0.74 (Table 2).

View this table:

Table 2. Performance metrics of a CatBoost model for diagnosing healthy controls, MCI, and AD following feature selection

These changes in predictive performance were also reflected in the confusion matrix. Here, 11 healthy controls were misclassified as MCI whereas only 9 cases of MCI were misclassified as AD and 17 AD cases misclassified as MCI (Figure 2A). This resulted in a significant drop in PPV of AD cases from 0.65 down to 0.60 here.

Figure 2.

Performance of the CatBoost model and feature importance for diagnosis of healthy control, MCI, and AD after filter-based feature selection. (A) Confusion matrix showing the true (correct) and false (incorrect) prediction of healthy control, MCI, and AD. (B) Absolute SHAP values and (C) heat map of the contribution of the main features for predicting healthy controls. (D) Absolute SHAP values and (E) heat map of the contribution of the main features for predicting MCI. (F) Absolute SHAP values and (G) heat map of the contribution of the main features for predicting AD. Abbreviations: apoe: apolipoprotein E genotype; GDMEMORY: item 10 of Geriatric Depression Scale; GDTOTAL: total score Geriatric Depression Scale; HMT8: neutrophils; HMT15: percent neutrophils; HMT16: lymphocytes; HMT100: mean corpuscular hemoglobin; HSI: heart stress index; LDELTOTAL: total number of story units recalled on Logical Memory-Delayed Recall; LIMMTOTAL: total number of story units recalled on Logical Memory-Immediate Recall; MHPSYCH: psychiatric medical history; NXGAIT: gait on neurological exam; PPR: pulse to pressure ratio; PTEDUCAT: education; VSPULSE: seated pulse rate (per minute).

Across healthy controls, MCI, and AD, a SHAP analysis indicated that the most important features for predicting all three groups were Logical Memory-Delayed Recall, Logical Memory-Immediate Recall, and years of education, (Figure 2B-G). For healthy controls, additional features included memory-related item 10 and total score of the Geriatric Depression Scale (Figure 2B-C). For MCI, item 10 of the Geriatric Depression Scale was also important and mean corpuscular hemoglobin (Figure 2D-E). The two additional features that were important for predicting AD patients were psychiatric medical history and percent neutrophils.

Poor predictive performance of CatBoost models is driven by a subgroup of MCI patients that are characteristically similar to those with AD

Given that MCI is a made up of a highly heterogenous group of patients ^27–29, we hypothesized that MCI patients may be affecting the predictive performance of AD diagnostic models. To identify if this was the case, we first tested the ability of CatBoost to distinguish between only healthy control and AD patients using 19 features previously identified using filter-based feature selection. Reducing our model to a simple binary classification resulted in the ability to readily distinguish between healthy control and AD patients as indicated by high performance metrics (>0.98; Table 3).

View this table:

Table 3. Performance metrics of CatBoost for diagnosing healthy control or AD patients

We next sought to understand why MCI patients were affecting predictive performance of our diagnostic models. Using a principal component analysis (PCA) based on our 19 identified features, we found that while there was clear group separation between healthy control and AD patients MCI patients were distributed across both clusters (Figure 3A). We identified that the MCI group could be divided into three distinct subclusters: one that overlapped healthy controls (MCI-Healthy), one that significantly overlapped with AD patients (MCI-AD), and a third that was distinct yet more closely related to AD (MCI-MCI; Figure 3B). This was further confirmed by a hierarchical cluster dendrogram (Figure 3C) and inertia plot (Figure 3D).

Figure 3.

Identification of the distinct subgroups of MCI patients. (A) Principal component analysis showing clear separation between healthy control and AD patients but significant overlap of MCI. (B) Principal component analysis showing that MCI forms three distinct subclusters (MCI-Healthy, MCI-AD, and MCI-MCI) that have varying amounts of overlap with healthy control and AD. (C) Hierarchical cluster dendrogram showing that MCI-Healthy is closely related to healthy control, MCI-AD is closely related to AD and that MCI-MCI is distinct but more related to AD. (D) Elbow plot showing the relationship between the number of clusters and the within-cluster sum of squared distances (inertia). The dotted line represents the optimal number of clusters.

We then looked to characterize the three subgroups of MCI using the top significantly different features (see Supplementary Table 3 for statistical analyses). This showed that the three MCI subgroups differed on apolipoprotein E ε4 (APOE4) genotype (Figure 4A), lymphocyte count (Figure 4B), neutrophils (Figure 4C-D), seated pulse rate (Figure 4E), Geriatric Depression Scale item 10 on memory complaints (Figure 4F), and their total scores on the Logical Memory Immediate Recall (Figure 4G) and Delayed Recall (Figure 4H) tests. We found that MCI-Healthy was the most distinct group across these features relative to MCI-MCI and MCI-AD. The MCI-Healthy Group had a lower number of people with at least one APOE4 allele, neutrophils, and percent of people that replied yes to having memory complaints on the Geriatric Depression Scale. They also had higher lymphocytes and total scores on the Logical Memory tests.

Figure 4.

Comparison between the top significantly (p < 0.001) different variables between the three MCI groups (MCI-Healthy, MCI-MCI, and MCI-AD). (A) Percent of patients with at least one APOE4 allele. (B) Lymphocyte count in blood. (C) Neutrophil count in blood. (D) Percent of neutrophils in blood. (E) Seated pulse rate per minute. (F) Percent of patients that replied yes to Geriatric Depression Scale item 10 (Do you feel you have more problems with memory than most?). (G) Total score on the Logical Memory Delayed Recall test. (H) Total score on the Logical Memory Immediate Recall test.

There were also differences across the three MCI subgroups with how stable their cognitive diagnosis was or whether it changed over time. The MCI-Healthy subgroup was the most likely to maintain a stable MCI state (Figure 5A) or revert to being cognitively healthy (Figure 5B) over time. The MCI-MCI subgroup was slightly more likely than the MCI-AD subgroup to maintain MCI (Figure 5A) or revert to healthy (Figure 5B). The MCI-Healthy subgroup relative to both the MCI-MCI (X² = 13.294, p = 0.0039) and MCI-AD groups (X² = 36.483, p < 0.0001) was also less likely to progress to AD. There were no significant differences in the rate of progression to AD between MCI-MCI and MCI-AD subgroups (Figure 5C).

Figure 5.

Percent of MCI patients with cognitive diagnosis stability or change over time across the three subgroups: MCI-Healthy, MCI-MCI, and MCI-AD. (A) Percent of MCI patients with a stable diagnosis of MCI over time. (B) Percent of MCI patients who revert to cognitively healthy status over time. (C) Percent of MCI patients who progress to AD over time.

Finally, we sought to identify which, if any, of the three MCI subgroups was driving poor predictive performance. To do this, we implemented three CatBoost models where one of the three MCI subgroups was removed and determined the models’ ability to identify healthy controls, MCI, and AD patients. When we removed either the MCI-Healthy or MCI-MCI groups, our CatBoost models were able to identify healthy controls, as before, but were still unable to identify AD patients (Table 4). When we removed the MCI-AD subgroup, however, our CatBoost model was able to identify all patient types (Table 4), suggesting that the MCI-AD subgroup, specifically, drives poor predictive performance of our diagnostic models.

View this table:

Table 4. Performance metrics of CatBoost for diagnosing healthy control, MCI, or AD patients after removing specific MCI subgroups

Discussion

There is an urgent need to better support primary care physicians in their clinical decision making on MCI and AD. To do this, we leveraged data from the ADNI cohort and used machine learning to determine the diagnostic potential of 120 easy-to-obtain clinical measures for MCI and AD.

Using all 120 measures as features, we found that while our model could readily identify healthy control cases, it was unable to identify MCI and AD patients and had a high level of confusion between these two diagnostic categories. Using filter-based feature selection, we narrowed down the measures to 19 that were highly correlated with the outcome. Although this led to a degree of improvement in the ability to diagnose MCI and AD sensitivities were still low, ranging between 0.71 to 0.74. This suggests that our model would likely miss around 25% of MCI and AD cases. Of note, however, was that our model outperformed existing ones ^{21–25,30–32} and we therefore sought to identify the features that were most important for diagnostic prediction. Using SHAP analysis, we showed that important features included Logical Memory Delayed and Immediate Recall scores, years of education, responses to item 10 (memory) and total score on the Geriatric Depression Scale, eosinophils, and psychiatric medical history.

These findings are in line with previous research. Higher Geriatric Depression Scale scores are associated with faster cognitive decline and increased risk of MCI and AD ^37–41. In line with this, AD and MCI patients have been shown to have increased incidence of psychiatric issues including depressive, apathy, and anxiety disorders ^42–44. Low levels of education are also known to be associated with an increased risk of MCI and AD ^45,46.

Peripheral neutrophil activation and lymphocytes, as well as a neutrophil-to-lymphocyte ratio, have been widely implicated in MCI and AD ^47–52. Lower levels of hemoglobin in blood have also been linked to decreased cognitive function and AD ^52,53. Although many studies show the importance of Logical Memory Immediate and Delayed Recall test scores for identifying healthy controls, MCI, and AD patients ^54,55, others have indicated that they have a limited diagnostic accuracy ⁵⁶. The reasons for these different findings are not clear and warrant further research.

An important finding of our study was that our models were confusing MCI and AD cases, specifically. To better understand why this was the case, we showed that MCI could be divided into three distinct subgroups: a subgroup that overlapped with healthy controls (MCI-Healthy), a subgroup that overlapped with AD (MCI-AD), and a subgroup that fell in between but was more closely related to AD (MCI-MCI). When we characterized these three subgroups, we found that they differed on key measures including APOE genotype, lymphocytes and neutrophils, seated pulse rate, and memory as measured by item 10 of the Geriatric Depression Scale and the Logical Memory Delayed and Immediate Recall tests. To date, there has been little consensus in the literature about how many subtypes of MCI exist⁵⁷. In our study, we showed that in the ADNI cohort there are three distinct subgroups. Other studies, however, report between two and four to five subtypes ranging from amnesic to non-amnesic MCI ^28,58–61. The discrepancies may lie between using a priori definitions of MCI, largely based on the number and type of cognitive domains that are impaired, versus our approach of data-driven post hoc definitions. Although a complete comparative assessment across both approaches was outside of the scope of our current study, and lack of sufficiently powered available data, future research would benefit from examining the merits and pitfalls of both.

We also found significant differences in the long-term trajectories of patients within each of the three MCI subgroups. MCI-AD patients were less likely to maintain a stable MCI diagnosis or to revert to being cognitively healthy relative to the MCI-Healthy and MCI-MCI subgroups. They were also more likely to progress to AD over time. Our MCI-AD group appears to have overlap with amnesic MCI previously reported in the literature. For example, studies show that patients with amnesic MCI are more likely to progress to AD over time ^62–64, overlapping with our findings here for MCI-AD. Risk of amnesic MCI also increases in patients with at least one copy of the APOE ε4 genetic variant ⁶⁰, in line with our finding that MCI-AD subgroup has the highest percentage of patients with this APOE genotype. Further, MCI-AD may be indicative of prodromal AD, as impaired delayed recall is reported to be the most early cognitive change in this group ²⁹.

Interestingly, we showed that the MCI-AD group, specifically, largely drove our models’ low ability to distinguish between MCI and AD. This finding has important implications because it suggests that only one subtype of MCI leads to poor performance of diagnostic AD models. Based on the data presented here, it’s not clear how to get around this issue in practice. One solution is to further characterize these three subtypes of MCI and develop models that can distinguish between them as well as AD and healthy control cases. To do this, however, there is a need to prioritize the establishment of substantially larger, well-defined (i.e. many clinical measures obtained) cohorts of patients with MCI and to follow their trajectories over time. This is an especially important consideration in the context of supporting primary care physicians’ ability to undertake diagnostics. Primary care physicians have particular difficulty in correctly identifying MCI cases in their patients and is exacerbated by inadequate infrastructure, resources, and equipment ^15,65,66. Current diagnostic methods for MCI largely rely on clinical judgement include subjective or objective cognitive impairment ^28,29,67 with a preservation of basic daily functioning ^28,29, which distinguishes it from AD. However, in practice, there appears to be limitations to clinical judgement. A recent study of Medicare data from the US representing >54,000 practices and >226,000 primary care physicians showed that only 0.1% of physicians and practices have MCI diagnosis rates within the expected range ⁶⁸. Overall, our work highlights the importance of continuing to focus on differentiating MCI cases from prodromal AD to improve the translatability of effective diagnostic models of AD that can be used by primary care physicians in the clinic.

There are some additional considerations with respect to our findings and implementing them in practice. The first is whether primary care physicians are motivated to diagnose MCI, as some previous studies have shown that there is a low motivation due to a perceived lack of benefits for the patients ⁶⁶. This is likely driven by a lack of effective treatments that physicians can offer their patients. It is still important, however, to acknowledge that providing an MCI, or even an MCI subtype, diagnosis, primary care physicians can provide patients and their families with assurances that what they are experiencing has a name and give them the knowledge required to plan for the future ⁶⁹. A second consideration of our work is that primary care physicians have reported feeling that they do not have sufficient time during a brief consultation to perform broad cognitive assessments ⁶⁶. Further some feel that they lack the neuropsychological training needed to complete this type of testing, which may limit widespread translational into the clinic ^15,67. Our results suggest that rather than needing to learn many complex neuropsychological questionnaires, primary care physicians only need to use the Logical Memory Immediate and Delayed Recall tests and the Geriatric Depression Scale. A final limitation of our work is that we did not examine the diagnostic predictive capabilities of any AD biomarkers. For example, previous studies have linked MCI subtypes with changes in cerebrospinal fluid (CSF) total tau ⁵⁹ and phospho-tau181 ⁷⁰. The exclusion of this data, however, was deliberate on our part as biomarkers from plasma and CSF are still not widely used, nor are recommended for use, in clinical practice especially in the context of MCI ^15,71. Further, primary care physicians lack the specialized equipment and expertise to routinely collect CSF samples from patients and AD biomarker panels remain expensive. Despite this, as AD biomarkers increase in popularity and become more widely available, future work would benefit from examining the diagnostic potential of these in the context of differentiating between subtypes of MCI, AD, and healthy controls.

In conclusion, we have identified that a particular subgroup of MCI affects the predictive performance of AD diagnostic models using primary care routine clinical variables. This subgroup (MCI-AD) was characteristically the most similar to AD and, in line with this, were significantly likely to progress to AD over time relative to the other MCI subgroups. These findings suggest that MCI-AD cases are likely representative of patients with prodromal AD. This work highlights the importance of AD diagnostic models focusing specifically on differentiating MCI cases from prodromal AD cases (who are also diagnosed as MCI) as the main way to improve their diagnostic predictive power and translatability.

Methods

Data and Patients from the ADNI Cohort

This retrospective study used data generated from the ADNI cohort, publicly available at https://ida.loni.usc.edu/ using data downloaded in April 2024. Patients in the ADNI cohort were diagnosed as either cognitively healthy, MCI, or AD based on the presence of subjective memory complaints, an MMSE score, and CDR ⁷². Healthy controls had no subjective memory complaints, MMSE range of 24-30, and CDR of 0. Patients with MCI had subjective memory complaints, MMSE 24-30 and CDR of 0.5. AD patients similarly had subjective memory complaints, a lower MMSE of 20-26, CDR of greater than 0.5, and meet the criteria for probable AD based on the National Institute of Neurological and Communicative Disorders and Stroke–Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA) criteria ⁷². Supplementary Table 1 shows the demographic characteristics of this cohort. We included continuous (e.g. weight, height, BMI, routine blood tests) and categorical (e.g. APOE genotype, physical exam) variables obtained at patients’ baseline visits that we determined would be appropriate for a primary care physician to complete (Supplementary Table 1). We also created additional variables based on cardiovascular metrics including pulse and blood pressure. These included augmented pulse pressure (systolic BP - diastolic BP) * ls, pulse to pressure ratio , heart stress index , mean arterial pressure , and systolic blood pressure to diastolic blood pressure ratio . The ADNI cohort study was approved by the institutional review boards of the participating ADNI centers, and all patients provided informed consent.

Statistical Analyses and Machine Learning

In our initial experiments, all 120 clinical variables (Supplementary Table 1) were used as features in our multi-class diagnostic model. We also performed a filter-based feature selection method to determine the clinical variables that were highly associated with the outcome (i.e. diagnosis). Here, we used a Kruskal-Wallis test (p < 0.01) for identifying significant continuous features and Cramer’s V (> 0.11) for categorical ones.

To evaluate the diagnostic potential of the clinical variables, we used CatBoost ⁷³, a gradient boosting framework optimized for both categorical and continuous variables. The complete dataset was initially split randomly into an 80% training set and a 20% held-out testing set to evaluate the final models’ performance. To address class imbalances across healthy control, MCI, and AD patients, we performed oversampling using the “imblearn” library ⁷⁴ where the underrepresented class was randomly resampled. During model development, the training set was further randomly subdivided into training and validation datasets using stratified k-fold cross-validation with four folds to ensure balanced representation across classes and mitigate potential biases. Hyperparameter fine-tuning was performed using the “optuna” library ⁷⁵, which employs a Bayesian optimization approach to efficiently search for the optimal combination of parameters. These included the following, with ranges in brackets: maximum number of trees that can be built (iterations; 100-2000), learning rate (0.01-0.3), coefficient at the L2 regularization term of the cost function (1-10), Bayesian bootstrap parameter (0.5-10.0), randomness for scoring splits (1.0-3.0), depth of the trees (3-10), minimum number of training samples in a leaf (1-100), and percentage of features to use at each split selection (0.5-1.0).

To identify the clustering across healthy controls, MCI, and AD patients, we first performed a principal component analysis (PCA) using the filter-based selected features. We then identified subclusters (subgroups) of MCI patients by taking principal component (PC) 1 and PC2 and using Gaussian mixture models for clustering. We confirmed these results using a hierarchical cluster dendrogram. All experiments and optimizations were conducted using python (v.3.11.7) with the libraries listed above and “pandas”, “numpy”, “matplotlib”, “seaborn”, “sklearn”, “catboost”, and “scipy” and Google Colab’s GPU-accelerated environment, which facilitated faster model training and evaluation. Source code is available at https://github.com/Art83/adni_mci.

Model evaluation

Performance of the machine learning models in this study were evaluated using a 20% held-out testing dataset. For all models, we report performance metrics including sensitivity (correctly identified positive cases), specificity (correctly identified negative controls), PPV (or precision; number of positive cases / total number of predicted positive cases (true and false)), negative predictive value (NPV; number of negative cases / total number of predicted negative cases (true and false)), and AUC (ability to distinguish between positive and negative cases). In the current study, we used specificity, NPV, sensitivity, and PPV as the main indicators of model performance. We note that while AUC is a commonly used performance metric in machine learning studies, it only provides a limited insight into model performance ⁷⁶. This is further supported by previous AD predictive models reporting high AUCs but low sensitivities and are therefore unable to identify incident AD cases ²⁶. We used a SHAP (Shapley Additive exPlanations) analysis to evaluate the relative contribution of features to our model performance ⁷⁷. This allowed us to identify those specific features that are most likely to act as translational diagnostic variables for primary care physicians. SHAP was done in python (v.3.11.7) using the “shap” library.

Data Availability

The data used in this study is from the ADNI cohort and is available at https://ida.loni.usc.edu/.

Code Availability

Source code is available on GitHub at https://github.com/Art83/adni_mci.

Author Contributions

C.A.F. and A.S. jointly contributed to the concept and design, interpretation of data and critical review of the manuscript for important intellectual content. A.S. performed the statistical analyses. C.A.F. acquired the data and wrote the manuscript. All authors reviewed the manuscript.

Competing Interests

The authors declare no financial or non-financial competing interests.

Acknowledgements

The authors are grateful to the Alzheimer’s Disease Neuroimaging Initiative for providing the data and to all the patients and their families for their involvement in the study. This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. C.A.F. receives salary support from the Neil & Norma Hill Foundation, Annemarie & Arturo Gandioli-Fumagalli Foundation, Perpetual Foundation – John Williams Endowment, and the Hillcrest Foundation.

References

1.↵
World Health Organization. Dementia. (2023).
2.↵
Chen, S. et al. The global macroeconomic burden of Alzheimer’s disease and other dementias: Estimates and projections for 152 countries or territories. The Lancet Global Health 12, E1534–E1543 (2024).
OpenUrl
3.↵
Porsteinsson, A. P., Isaacson, R. S., Knox, S., Sabbagh, M. N. & Rubino, I. Diagnosis of early Alzheimer’s disease: Clinical practice in 2021. The Journal of Prevention of Alzheimer’s Disease 8, 371–386 (2021).
OpenUrl
4.↵
Pellegrini, E. et al. Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: A systematic review. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 10, 519–535 (2018).
OpenUrl
5.
Javeed, A. et al. Machine learning for dementia prediction: A systematic review and future research directions. Journal of Medical Systems 47 (2023).
6.
Chen, R. & Herskovits, E. H. Machine-learning techniques for building a diagnostic model for very mild dementia. NeuroImage 52, 234–244 (2010).
OpenUrl CrossRef PubMed
7.
Davatzikos, C., Bhatt, P., Shaw, L. M., Batmanghelich, K. N. & Trojanowski, J. Q. Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification. Neurobiology of Aging 32, e19–e27 (2011).
OpenUrl
8.
Kang, W. et al. Multi-model and multi-slice ensemble learning architecture based on 2D convolutional neural networks for Alzheimer’s disease diagnosis. Computers in Biology and Medicine 136 (2021).
9.
Li, F. et al. A robust deep model for improved classification of AD/MCI patients. IEEE Journal of Biomedical and Health Informatics 19, 1610–1616 (2015).
OpenUrl
10.
Pinaya, W. H. L., et al. Using normative modelling to detect disease progression in mild cognitive impairment and Alzheimer’s disease in a cross-sectional mutli-cohort study. Scientific Reports 11 (2021).
11.
Qiu, S. et al. Fusion of deep learning models of MRI scans, mini-mental state examination, and logical memory test enhances diagnosis of mild cognitive impairment. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 10, 737–749 (2018).
OpenUrl
12.↵
Vemuri, P. et al. MRI and CSF biomarkers in normal, MCI, and AD subjects. Neurology 73, 287–293 (2009).
OpenUrl CrossRef PubMed
13.↵
Bjerke, M. & Engelbourghs, S. Cerebrospinal fluid biomarkers for early and differential Alzheimer’s disease diagnosis. Journal of Alzheimer’s Disease 62, 1199–1209 (2018).
OpenUrl
14.↵
Schindler, S. E. & Atri, A. The role of cerebrospinal fluid and other biomarker modalities in Alzheimer’s disease diagnostic revolution. Nature Aging 3, 460–462 (2023).
OpenUrl PubMed
15.↵
Liss, J. L. et al. Practical recommendations for timely, accurate diagnosis of symptomatic Alzheimer’s disease (MCI and dementia) in primary care: A review and synthesis. Journal of Internal Medicine 290, 310–334 (2021).
OpenUrl CrossRef PubMed
16.
Leming, M. J. et al. Challenges of implementing computer-aided diagnostic models for neuroimages in a clinical setting NPJ Digital Medicine 6, 129 (2023).
OpenUrl PubMed
17.↵
Mansfield, E., Noble, N., Sanson-Fisher, R., Mazza, D. & Bryant, J. Primary care physicians’ perceived barriers to optimal dementia care: A systematic review. The Gerontologist 59, 697–708 (2018).
OpenUrl
18.↵
Alzheimer’s Association. Alzheimer’s Association facts and figures. Alzheimer’s & Dementia, 391–460 (2020).
19.↵
Creavin, S. T. et al. Clinical judgement by primary care physicians for the diagnosis of all-cause dementia or cognitive impairment in symptomatic people. Cochrane Database of Systematic Reviews 6 (2022).
20.↵
Valcour, V. G., Masaki, K. H., Curb, D. & Blanchette, P. L. The detection of dementia in primary care setting. Archives of Internal Medicine 160, 2964–2968 (2000).
OpenUrl CrossRef PubMed Web of Science
21.↵
Kivipelto, M. et al. Risk score for the prediction of dementia risk in 20 years among middle aged people: A longitudinal, population-based study. The Lancet Neurology 5, 735–741 (2006).
OpenUrl PubMed
22.↵
Luck, T. et al. Risk factors for incident mild cognitive impairment - Results from the German study on ageing, cognition and dementia in primary care patients (AgeCoDe). Acta Psychiatrica Scandinavica 121, 260–272 (2010).
OpenUrl CrossRef PubMed
23.↵
Anstey, K. J. et al. A self-report risk index to predict occurrence of dementia in three independent cohorts of older adults: The ANU-ADRI. PLoS One 9, e86141 (2014).
OpenUrl CrossRef PubMed
24.↵
Capuano, A. W. et al. Derivation and validation of the rapid assessment of dementai risk (RADaR) for older adults. PLoS One 17, e0265379 (2022).
OpenUrl CrossRef PubMed
25.↵
Barnes, D. E. et al. Development and validation of a brief dementia screening indicator for primary care. Alzheimer’s & Dementia 10, 656–665 (2014).
OpenUrl
26.↵
Kivimaki, M. et al. Estimating dementia risk using multifactorial prediction models. JAMA Network Open 6, e2318132 (2023).
OpenUrl
27.↵
Anderson, N. D. State of the science on mild cognitive impairment (MCI). CNS Spectrums 24 (2019).
28.↵
Winblad, B. et al. Mild cognitive impairment - beyond controversies, towards a consensus: Report of the International Working Group on Mild Cognitive Impairment. Journal of Internal Medicine 256, 240–246 (2004).
OpenUrl CrossRef PubMed Web of Science
29.↵
Knopman, D. S., Boeve, B. F. & Petersen, R. C. Essentials of the proper diagnoses of mild cognitive impairment, dementia, and major subtypes of dementia. Mayo Clinic Proceedings 78, 1290–1308 (2003).
OpenUrl CrossRef PubMed Web of Science
30.↵
Jitsuishi, T. & Yamaguchi, A. Searching for optimal machine learning model to classify mild cognitive impairment (MCI) subtypes using multimodal MRI data. Scientific Reports 12 (2022).
31.↵
Dyrba, M. et al. Predicting prodromal Alzheimer’s disease in subjects with mild cognitive impairment using machine learning classification of multimodal multicenter diffusion-tensor and magnetic resonance imaging data. Journal of Neuroimaging 25 (2015).
32.↵
Bucholc, M., Titarenko, S., Ding, X., Canavan, C. & Chen, T. A hybrid machine learning approach for prediction of conversion from mild cognitive impairment to dementia. Expert Systems with Applications 217 (2023).
33.↵
Mallo, S. C. et al. Neuropsychiatric symptoms as predictors of conversion from MCI to dementia: A machine learning approach. International Psychogeriatrics 32, 381–392 (2019).
OpenUrl
34.
Moradi, E. et al. Machine learning framework for early MRI-based Alzheimer’s conversion prediction in MCI subjects. NeuroImage 104, 398–412 (2015).
OpenUrl CrossRef PubMed
35.
Zhang, T. et al. Predicting MCI to AD coversion using integrated sMRI and rs-fMRI: Machine learning and graph theory approach. Frontiers in Aging Neuroscience 13 (2021).
36.↵
Amoroso, N. et al. Deep learning reveals Alzheimer’s disease onset in MCI subjects: Results from an international challenge. Journal of Neuroscience Methods 302, 3–9 (2018).
OpenUrl PubMed
37.↵
Wang, Z.-T. et al. Associations of the rate of change in Geriatric Depression Scale with amyloid and cerebral glucose metabolism in cognitively normal older adults: A longitudinal study. Journal of Affective Disorders 280, 77–84 (2021).
OpenUrl
38.
Van der Mussele, S. et al. Depression in mild cognitive impairment is associated with progression to Alzheimer’s disease: A longitudinal study. Journal of Alzheimer’s Disease 42, 1239–1250 (2014).
OpenUrl
39.
Lee, C. H., Kim, D. H. & Moon, Y. S. Differential associations between depression and cognitive function in MCI and AD: A cross-sectional study. International Psychogeriatrics 31 (2019).
40.
Modrego, P. J. & Ferrandez, J. Depression in patients with mild cognitive impairment increases the risk of developing dementia of Alzheimer type: A prospective cohort study. Archives of Neurology 61, 1290–1293 (2004).
OpenUrl CrossRef PubMed Web of Science
41.↵
Steffens, D. C., McQuoid, D. R. & Potter, G. G. Amnesic mild cognitive impairment and incident dementia and Alzheimer’s disease in geriatric depression. International Psychogeriatrics 26, 2029–2036 (2014).
OpenUrl PubMed
42.↵
Di Iulio, F. et al. Occurrence of neuropsychiatric sympoms and psychiatric disorders in mild Alzheimer’s disease and mild cognitive impairment subtypes. International Psychogeriatrics 22, 629–640 (2010).
OpenUrl CrossRef PubMed
43.
Palmer, K. et al. Predictors of progression from mild cognitive impairment to Alzheimer disease. Neurology 68 (2007).
44.↵
Rosenberg, P. B. et al. The association of neuropsychiatric symptoms in MCI with incident dementia and Alzheimer disease. The American Journal of Geriatric Psychiatry 21, 685–695 (2013).
OpenUrl CrossRef PubMed
45.↵
Sattler, C., Toro, P., Schonknecht, P. & Schroder, J. Cognitive activity, education, and socioeconomic status as preventive factors for mild cognitive impairment and Alzheimer’s disease. Psychiatry Research 196, 90–95 (2012).
OpenUrl CrossRef PubMed Web of Science
46.↵
Matyas, N. et al. Continuing education for the prevention of mild cognitive impairment and Alzheimer’s-type dementia: A systematic review and overview of systematic reviews. BMJ Open 9 (2019).
47.↵
Wu, C.-Y. et al. Neutrophil activation in Alzheimer’s disease and mild cognitive impairment: A systematic review and meta-analysis of protein markers in blood and cerebrospinal fluid. Ageing Research Reviews 62 (2020).
48.
Dong, Y. et al. Neutrophil hyperactivation correlates with Alzheimer’s disease progression. Annals of Neurology 83, 387–405 (2018).
OpenUrl CrossRef PubMed
49.
Dong, X., Nao, J., Shi, J. & Zheng, D. Predictive value of routine peripheral blood biomarkers in Alzheimer’s disease. Frontiers in Aging Neuroscience 11 (2019).
50.
Kalelioglu, T. et al. The neutrophil and platelet to lymphocyte ratios in people with subjective, mild cognitive impairment and early Alzheimer’s disaese. European Psychiatry 41, 655–655 (2017).
OpenUrl
51.
Sayed, A. et al. The neutrophil-to-lymphocyte ratio in Alzheimer’s disease: Current understanding and potential applications. Journal of Neuroimmunology 349 (2020).
52.↵
Huang, L.-T., Zhang, C.-P., Wang, Y.-B. & Wang, J.-H. Association of peripheral blood cell profile with Alzheimer’s disease: A meta-analysis. Frontiers in Aging Neuroscience 14 (2022).
53.↵
Winchester, L. M., Powell, J., Lovestone, S. & Nevado-Holgado, A. J. Red blood cell indices and anemia as causative factors for cognitive function deficits and for Alzheimer’s disease. Genome Medicine 10 (2018).
54.↵
Fokuoh, E. et al. Longitudinal analysis of APOE-ε4 genotype with the logical memory delayed recall score in Alzheimer’s disease. Journal of Genetics 100 (2021).
55.↵
Warren, S. L., Moustafa, A. A., Alashwal, H. & Alzheimer’s Disease Neuroimaging Initiative. Harnessing forgetfulness: Can episodic-memory tests predict early Alzheimer’s disease? Experimental Brain Research 239, 2925–2937 (2021).
OpenUrl PubMed
56.↵
Chapman, K. R. et al. Mini Mental State Examination and Logical Memory scores for entry into Alzheimer’s disease trials. Alzheimer’s Research & Therapy 8 (2016).
57.↵
Stephan, B. C. M. et al. The neuropathological profile of mild cognitive impairment (MCI): A systematic review. Molecular Psychiatry 17, 1056–1076 (2012).
OpenUrl CrossRef PubMed Web of Science
58.↵
Busse, A., Hensel, A., Guhne, U., Angermeyer, M. C. & Riedel-Heller, S. G. Mild cognitive impairment: Long-term course of four clinical subtypes. Neurology 67, 2176–2185 (2006).
OpenUrl CrossRef PubMed
59.↵
Eliassen, C. F. et al. Biomarkers in subtypes of mild cognitive impairment and subjective cognitive decline. Brain and Behavior 7 (2017).
60.↵
Michaud, T. L., Su, D., Siahpush, M. & Murman, D. L. The risk of incident mild cognitive impairment and progression to dementia considering mild cognitive impairment subtypes. Dementia and Geriatric Cognitive Disorders 7, 15–29 (2017).
OpenUrl
61.↵
Katabathula, S., David, P. B. & Xu, R. Comorbidity-driven multi-modal subtype analysis in mild cognitive impairment of Alzheimer’s disease. Alzheimer’s & Dementia 19, 1428–1439 (2023).
OpenUrl
62.↵
Yaffe, K., Petersen, R. C., Lindquist, K., Kramer, J. & Miller, B. Subtype of mild cognitive impairment and progression to dementia and death. Dementia and Geriatric Cognitive Disorders 22, 312–319 (2006).
OpenUrl CrossRef PubMed Web of Science
63.
Fischer, P. et al. Conversion from subtypes of mild cognitive impairment to Alzheimer dementia. Neurology 6, 288–291 (2007).
OpenUrl PubMed
64.↵
Gauthier, S. et al. Mild cognitive impairment. The Lancet 367, 1262–1270 (2006).
OpenUrl
65.↵
Mitchell, A. J., Meader, N. & Pentzek, M. Clinical recognition of dementia and cognitive impairment in primary care: A meta-analysis of physician accuracy. Acta Psychiatrica Scandinavica 124, 165–183 (2011).
OpenUrl CrossRef PubMed
66.↵
Sabbagh, M. N. et al. Early detection of mild cognitive impairment (MCI) in primary care. The Journal of Prevention of Alzheimer’s Disease 7, 165–170 (2020).
OpenUrl
67.↵
Bradfield, N. I. Mild cognitive impairment: Diagnosis and subtypes. Clinical EEG and Neuroscience 54 (2021).
68.↵
Liu, Y., Jun, H., Becker, A., Wallick, C. & Mattke, S. Detection rates of mild cognitive impairment in primary care for the United States Medicare population. The Journal of Prevention of Alzheimer’s Disease 11, 7–12 (2023).
OpenUrl
69.↵
Knopman, D. S. & Petersen, R. C. Mild cognitive impairment and mild dementia: A clinical perspective. Mayo Clinic Proceedings 89, 1452–1459 (2014).
OpenUrl CrossRef PubMed
70.↵
Karikari, T. K. et al. Blood phosphorylated tau 181 as a biomarker for Alzheimer’s disease: A diagnostic performance and prediction modelling study using data from four prospective cohorts. The Lancet Neurology 19, 422–433 (2020).
OpenUrl PubMed
71.↵
Petersen, R. C. et al. Practice guideline update summary: Mild cognitive impairment: Report of the guideline development, dissemination, and implementation subcommittee of the American Academy of Neurology. Neurology 90, 126–135 (2018).
OpenUrl CrossRef PubMed
72.↵
Petersen, R. C. et al. Alzheimer’s Disease Neuroimaging Initiative (ADNI): Clinical characterization. Neurology 74, 201–209 (2009).
OpenUrl CrossRef PubMed
73.↵
Dorogush, A. V., Ershov, V. & Gulin, A. CatBoost: Gradient boosting with categorical features support. arXiv (2018).
74.↵
Lemaitre, G., Nogueira, F. & Aridas, C. K. Imbalanced-learn: A Python toolbox to tackle the curse of imbalanced datasets in machine learning. Journal of Machine Learning Research 18, 1–5 (2017).
OpenUrl
75.↵
Akiba, T., Sano, S., Yanase, T., Ohta, T. & Koyama, M. Optuna: A next-generation hyperparameter optimization framework. arXiv (2019).
76.↵
Roberts, M., Hazan, A., Dittmer, S., Rudd, J. H. F. & Schonlieb, C.-B. The curious case of the test set AUROC. Nature Machine Intelligence 6, 373–376 (2024).
OpenUrl
77.↵
Lundberg, S. M. & Lee, S. I. in 31st Conference on Neural Inforamtion Processing Systems (NIPS).

View the discussion thread.

Posted February 06, 2025.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Neurology

Subject Areas

All Articles

Addiction Medicine (407)
Allergy and Immunology (718)
Anesthesia (212)
Cardiovascular Medicine (3027)
Dentistry and Oral Medicine (344)
Dermatology (256)
Emergency Medicine (452)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1068)
Epidemiology (12944)
Forensic Medicine (12)
Gastroenterology (844)
Genetic and Genomic Medicine (4744)
Geriatric Medicine (438)
Health Economics (744)
Health Informatics (3011)
Health Policy (1090)
Health Systems and Quality Improvement (1111)
Hematology (405)
HIV/AIDS (950)
Infectious Diseases (except HIV/AIDS) (14253)
Intensive Care and Critical Care Medicine (872)
Medical Education (448)
Medical Ethics (117)
Nephrology (487)
Neurology (4523)
Nursing (241)
Nutrition (669)
Obstetrics and Gynecology (836)
Occupational and Environmental Health (754)
Oncology (2351)
Ophthalmology (664)
Orthopedics (262)
Otolaryngology (332)
Pain Medicine (294)
Palliative Medicine (85)
Pathology (511)
Pediatrics (1224)
Pharmacology and Therapeutics (515)
Primary Care Research (513)
Psychiatry and Clinical Psychology (3886)
Public and Global Health (7114)
Radiology and Imaging (1579)
Rehabilitation Medicine and Physical Therapy (943)
Respiratory Medicine (937)
Rheumatology (455)
Sexual and Reproductive Health (462)
Sports Medicine (395)
Surgery (501)
Toxicology (63)
Transplantation (216)
Urology (187)

[1] 1.↵
World Health Organization. Dementia. (2023).

[2] 2.↵
Chen, S. et al. The global macroeconomic burden of Alzheimer’s disease and other dementias: Estimates and projections for 152 countries or territories. The Lancet Global Health 12, E1534–E1543 (2024).
OpenUrl

[3] 3.↵
Porsteinsson, A. P., Isaacson, R. S., Knox, S., Sabbagh, M. N. & Rubino, I. Diagnosis of early Alzheimer’s disease: Clinical practice in 2021. The Journal of Prevention of Alzheimer’s Disease 8, 371–386 (2021).
OpenUrl

[4] 4.↵
Pellegrini, E. et al. Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: A systematic review. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 10, 519–535 (2018).
OpenUrl

[5] 5.
Javeed, A. et al. Machine learning for dementia prediction: A systematic review and future research directions. Journal of Medical Systems 47 (2023).

[6] 6.
Chen, R. & Herskovits, E. H. Machine-learning techniques for building a diagnostic model for very mild dementia. NeuroImage 52, 234–244 (2010).
OpenUrl CrossRef PubMed

[7] 7.
Davatzikos, C., Bhatt, P., Shaw, L. M., Batmanghelich, K. N. & Trojanowski, J. Q. Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification. Neurobiology of Aging 32, e19–e27 (2011).
OpenUrl

[8] 8.
Kang, W. et al. Multi-model and multi-slice ensemble learning architecture based on 2D convolutional neural networks for Alzheimer’s disease diagnosis. Computers in Biology and Medicine 136 (2021).

[9] 9.
Li, F. et al. A robust deep model for improved classification of AD/MCI patients. IEEE Journal of Biomedical and Health Informatics 19, 1610–1616 (2015).
OpenUrl

[10] 10.
Pinaya, W. H. L., et al. Using normative modelling to detect disease progression in mild cognitive impairment and Alzheimer’s disease in a cross-sectional mutli-cohort study. Scientific Reports 11 (2021).

[11] 11.
Qiu, S. et al. Fusion of deep learning models of MRI scans, mini-mental state examination, and logical memory test enhances diagnosis of mild cognitive impairment. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring 10, 737–749 (2018).
OpenUrl

[12] 12.↵
Vemuri, P. et al. MRI and CSF biomarkers in normal, MCI, and AD subjects. Neurology 73, 287–293 (2009).
OpenUrl CrossRef PubMed

[13] 13.↵
Bjerke, M. & Engelbourghs, S. Cerebrospinal fluid biomarkers for early and differential Alzheimer’s disease diagnosis. Journal of Alzheimer’s Disease 62, 1199–1209 (2018).
OpenUrl

[14] 14.↵
Schindler, S. E. & Atri, A. The role of cerebrospinal fluid and other biomarker modalities in Alzheimer’s disease diagnostic revolution. Nature Aging 3, 460–462 (2023).
OpenUrl PubMed

[15] 15.↵
Liss, J. L. et al. Practical recommendations for timely, accurate diagnosis of symptomatic Alzheimer’s disease (MCI and dementia) in primary care: A review and synthesis. Journal of Internal Medicine 290, 310–334 (2021).
OpenUrl CrossRef PubMed

[16] 16.
Leming, M. J. et al. Challenges of implementing computer-aided diagnostic models for neuroimages in a clinical setting NPJ Digital Medicine 6, 129 (2023).
OpenUrl PubMed

[17] 17.↵
Mansfield, E., Noble, N., Sanson-Fisher, R., Mazza, D. & Bryant, J. Primary care physicians’ perceived barriers to optimal dementia care: A systematic review. The Gerontologist 59, 697–708 (2018).
OpenUrl

[18] 18.↵
Alzheimer’s Association. Alzheimer’s Association facts and figures. Alzheimer’s & Dementia, 391–460 (2020).

[19] 19.↵
Creavin, S. T. et al. Clinical judgement by primary care physicians for the diagnosis of all-cause dementia or cognitive impairment in symptomatic people. Cochrane Database of Systematic Reviews 6 (2022).

[20] 20.↵
Valcour, V. G., Masaki, K. H., Curb, D. & Blanchette, P. L. The detection of dementia in primary care setting. Archives of Internal Medicine 160, 2964–2968 (2000).
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
Kivipelto, M. et al. Risk score for the prediction of dementia risk in 20 years among middle aged people: A longitudinal, population-based study. The Lancet Neurology 5, 735–741 (2006).
OpenUrl PubMed

[22] 22.↵
Luck, T. et al. Risk factors for incident mild cognitive impairment - Results from the German study on ageing, cognition and dementia in primary care patients (AgeCoDe). Acta Psychiatrica Scandinavica 121, 260–272 (2010).
OpenUrl CrossRef PubMed

[23] 23.↵
Anstey, K. J. et al. A self-report risk index to predict occurrence of dementia in three independent cohorts of older adults: The ANU-ADRI. PLoS One 9, e86141 (2014).
OpenUrl CrossRef PubMed

[24] 24.↵
Capuano, A. W. et al. Derivation and validation of the rapid assessment of dementai risk (RADaR) for older adults. PLoS One 17, e0265379 (2022).
OpenUrl CrossRef PubMed

[25] 25.↵
Barnes, D. E. et al. Development and validation of a brief dementia screening indicator for primary care. Alzheimer’s & Dementia 10, 656–665 (2014).
OpenUrl

[26] 26.↵
Kivimaki, M. et al. Estimating dementia risk using multifactorial prediction models. JAMA Network Open 6, e2318132 (2023).
OpenUrl

[27] 27.↵
Anderson, N. D. State of the science on mild cognitive impairment (MCI). CNS Spectrums 24 (2019).

[28] 28.↵
Winblad, B. et al. Mild cognitive impairment - beyond controversies, towards a consensus: Report of the International Working Group on Mild Cognitive Impairment. Journal of Internal Medicine 256, 240–246 (2004).
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Knopman, D. S., Boeve, B. F. & Petersen, R. C. Essentials of the proper diagnoses of mild cognitive impairment, dementia, and major subtypes of dementia. Mayo Clinic Proceedings 78, 1290–1308 (2003).
OpenUrl CrossRef PubMed Web of Science

[30] 30.↵
Jitsuishi, T. & Yamaguchi, A. Searching for optimal machine learning model to classify mild cognitive impairment (MCI) subtypes using multimodal MRI data. Scientific Reports 12 (2022).

[31] 31.↵
Dyrba, M. et al. Predicting prodromal Alzheimer’s disease in subjects with mild cognitive impairment using machine learning classification of multimodal multicenter diffusion-tensor and magnetic resonance imaging data. Journal of Neuroimaging 25 (2015).

[32] 32.↵
Bucholc, M., Titarenko, S., Ding, X., Canavan, C. & Chen, T. A hybrid machine learning approach for prediction of conversion from mild cognitive impairment to dementia. Expert Systems with Applications 217 (2023).

[33] 33.↵
Mallo, S. C. et al. Neuropsychiatric symptoms as predictors of conversion from MCI to dementia: A machine learning approach. International Psychogeriatrics 32, 381–392 (2019).
OpenUrl

[34] 34.
Moradi, E. et al. Machine learning framework for early MRI-based Alzheimer’s conversion prediction in MCI subjects. NeuroImage 104, 398–412 (2015).
OpenUrl CrossRef PubMed

[35] 35.
Zhang, T. et al. Predicting MCI to AD coversion using integrated sMRI and rs-fMRI: Machine learning and graph theory approach. Frontiers in Aging Neuroscience 13 (2021).

[36] 36.↵
Amoroso, N. et al. Deep learning reveals Alzheimer’s disease onset in MCI subjects: Results from an international challenge. Journal of Neuroscience Methods 302, 3–9 (2018).
OpenUrl PubMed

[37] 37.↵
Wang, Z.-T. et al. Associations of the rate of change in Geriatric Depression Scale with amyloid and cerebral glucose metabolism in cognitively normal older adults: A longitudinal study. Journal of Affective Disorders 280, 77–84 (2021).
OpenUrl

[38] 38.
Van der Mussele, S. et al. Depression in mild cognitive impairment is associated with progression to Alzheimer’s disease: A longitudinal study. Journal of Alzheimer’s Disease 42, 1239–1250 (2014).
OpenUrl

[39] 39.
Lee, C. H., Kim, D. H. & Moon, Y. S. Differential associations between depression and cognitive function in MCI and AD: A cross-sectional study. International Psychogeriatrics 31 (2019).

[40] 40.
Modrego, P. J. & Ferrandez, J. Depression in patients with mild cognitive impairment increases the risk of developing dementia of Alzheimer type: A prospective cohort study. Archives of Neurology 61, 1290–1293 (2004).
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Steffens, D. C., McQuoid, D. R. & Potter, G. G. Amnesic mild cognitive impairment and incident dementia and Alzheimer’s disease in geriatric depression. International Psychogeriatrics 26, 2029–2036 (2014).
OpenUrl PubMed

[42] 42.↵
Di Iulio, F. et al. Occurrence of neuropsychiatric sympoms and psychiatric disorders in mild Alzheimer’s disease and mild cognitive impairment subtypes. International Psychogeriatrics 22, 629–640 (2010).
OpenUrl CrossRef PubMed

[43] 43.
Palmer, K. et al. Predictors of progression from mild cognitive impairment to Alzheimer disease. Neurology 68 (2007).

[44] 44.↵
Rosenberg, P. B. et al. The association of neuropsychiatric symptoms in MCI with incident dementia and Alzheimer disease. The American Journal of Geriatric Psychiatry 21, 685–695 (2013).
OpenUrl CrossRef PubMed

[45] 45.↵
Sattler, C., Toro, P., Schonknecht, P. & Schroder, J. Cognitive activity, education, and socioeconomic status as preventive factors for mild cognitive impairment and Alzheimer’s disease. Psychiatry Research 196, 90–95 (2012).
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Matyas, N. et al. Continuing education for the prevention of mild cognitive impairment and Alzheimer’s-type dementia: A systematic review and overview of systematic reviews. BMJ Open 9 (2019).

[47] 47.↵
Wu, C.-Y. et al. Neutrophil activation in Alzheimer’s disease and mild cognitive impairment: A systematic review and meta-analysis of protein markers in blood and cerebrospinal fluid. Ageing Research Reviews 62 (2020).

[48] 48.
Dong, Y. et al. Neutrophil hyperactivation correlates with Alzheimer’s disease progression. Annals of Neurology 83, 387–405 (2018).
OpenUrl CrossRef PubMed

[49] 49.
Dong, X., Nao, J., Shi, J. & Zheng, D. Predictive value of routine peripheral blood biomarkers in Alzheimer’s disease. Frontiers in Aging Neuroscience 11 (2019).

[50] 50.
Kalelioglu, T. et al. The neutrophil and platelet to lymphocyte ratios in people with subjective, mild cognitive impairment and early Alzheimer’s disaese. European Psychiatry 41, 655–655 (2017).
OpenUrl

[51] 51.
Sayed, A. et al. The neutrophil-to-lymphocyte ratio in Alzheimer’s disease: Current understanding and potential applications. Journal of Neuroimmunology 349 (2020).

[52] 52.↵
Huang, L.-T., Zhang, C.-P., Wang, Y.-B. & Wang, J.-H. Association of peripheral blood cell profile with Alzheimer’s disease: A meta-analysis. Frontiers in Aging Neuroscience 14 (2022).

[53] 53.↵
Winchester, L. M., Powell, J., Lovestone, S. & Nevado-Holgado, A. J. Red blood cell indices and anemia as causative factors for cognitive function deficits and for Alzheimer’s disease. Genome Medicine 10 (2018).

[54] 54.↵
Fokuoh, E. et al. Longitudinal analysis of APOE-ε4 genotype with the logical memory delayed recall score in Alzheimer’s disease. Journal of Genetics 100 (2021).

[55] 55.↵
Warren, S. L., Moustafa, A. A., Alashwal, H. & Alzheimer’s Disease Neuroimaging Initiative. Harnessing forgetfulness: Can episodic-memory tests predict early Alzheimer’s disease? Experimental Brain Research 239, 2925–2937 (2021).
OpenUrl PubMed

[56] 56.↵
Chapman, K. R. et al. Mini Mental State Examination and Logical Memory scores for entry into Alzheimer’s disease trials. Alzheimer’s Research & Therapy 8 (2016).

[57] 57.↵
Stephan, B. C. M. et al. The neuropathological profile of mild cognitive impairment (MCI): A systematic review. Molecular Psychiatry 17, 1056–1076 (2012).
OpenUrl CrossRef PubMed Web of Science

[58] 58.↵
Busse, A., Hensel, A., Guhne, U., Angermeyer, M. C. & Riedel-Heller, S. G. Mild cognitive impairment: Long-term course of four clinical subtypes. Neurology 67, 2176–2185 (2006).
OpenUrl CrossRef PubMed

[59] 59.↵
Eliassen, C. F. et al. Biomarkers in subtypes of mild cognitive impairment and subjective cognitive decline. Brain and Behavior 7 (2017).

[60] 60.↵
Michaud, T. L., Su, D., Siahpush, M. & Murman, D. L. The risk of incident mild cognitive impairment and progression to dementia considering mild cognitive impairment subtypes. Dementia and Geriatric Cognitive Disorders 7, 15–29 (2017).
OpenUrl

[61] 61.↵
Katabathula, S., David, P. B. & Xu, R. Comorbidity-driven multi-modal subtype analysis in mild cognitive impairment of Alzheimer’s disease. Alzheimer’s & Dementia 19, 1428–1439 (2023).
OpenUrl

[62] 62.↵
Yaffe, K., Petersen, R. C., Lindquist, K., Kramer, J. & Miller, B. Subtype of mild cognitive impairment and progression to dementia and death. Dementia and Geriatric Cognitive Disorders 22, 312–319 (2006).
OpenUrl CrossRef PubMed Web of Science

[63] 63.
Fischer, P. et al. Conversion from subtypes of mild cognitive impairment to Alzheimer dementia. Neurology 6, 288–291 (2007).
OpenUrl PubMed

[64] 64.↵
Gauthier, S. et al. Mild cognitive impairment. The Lancet 367, 1262–1270 (2006).
OpenUrl

[65] 65.↵
Mitchell, A. J., Meader, N. & Pentzek, M. Clinical recognition of dementia and cognitive impairment in primary care: A meta-analysis of physician accuracy. Acta Psychiatrica Scandinavica 124, 165–183 (2011).
OpenUrl CrossRef PubMed

[66] 66.↵
Sabbagh, M. N. et al. Early detection of mild cognitive impairment (MCI) in primary care. The Journal of Prevention of Alzheimer’s Disease 7, 165–170 (2020).
OpenUrl

[67] 67.↵
Bradfield, N. I. Mild cognitive impairment: Diagnosis and subtypes. Clinical EEG and Neuroscience 54 (2021).

[68] 68.↵
Liu, Y., Jun, H., Becker, A., Wallick, C. & Mattke, S. Detection rates of mild cognitive impairment in primary care for the United States Medicare population. The Journal of Prevention of Alzheimer’s Disease 11, 7–12 (2023).
OpenUrl

[69] 69.↵
Knopman, D. S. & Petersen, R. C. Mild cognitive impairment and mild dementia: A clinical perspective. Mayo Clinic Proceedings 89, 1452–1459 (2014).
OpenUrl CrossRef PubMed

[70] 70.↵
Karikari, T. K. et al. Blood phosphorylated tau 181 as a biomarker for Alzheimer’s disease: A diagnostic performance and prediction modelling study using data from four prospective cohorts. The Lancet Neurology 19, 422–433 (2020).
OpenUrl PubMed

[71] 71.↵
Petersen, R. C. et al. Practice guideline update summary: Mild cognitive impairment: Report of the guideline development, dissemination, and implementation subcommittee of the American Academy of Neurology. Neurology 90, 126–135 (2018).
OpenUrl CrossRef PubMed

[72] 72.↵
Petersen, R. C. et al. Alzheimer’s Disease Neuroimaging Initiative (ADNI): Clinical characterization. Neurology 74, 201–209 (2009).
OpenUrl CrossRef PubMed

[73] 73.↵
Dorogush, A. V., Ershov, V. & Gulin, A. CatBoost: Gradient boosting with categorical features support. arXiv (2018).

[74] 74.↵
Lemaitre, G., Nogueira, F. & Aridas, C. K. Imbalanced-learn: A Python toolbox to tackle the curse of imbalanced datasets in machine learning. Journal of Machine Learning Research 18, 1–5 (2017).
OpenUrl

[75] 75.↵
Akiba, T., Sano, S., Yanase, T., Ohta, T. & Koyama, M. Optuna: A next-generation hyperparameter optimization framework. arXiv (2019).

[76] 76.↵
Roberts, M., Hazan, A., Dittmer, S., Rudd, J. H. F. & Schonlieb, C.-B. The curious case of the test set AUROC. Nature Machine Intelligence 6, 373–376 (2024).
OpenUrl

[77] 77.↵
Lundberg, S. M. & Lee, S. I. in 31st Conference on Neural Inforamtion Processing Systems (NIPS).

Mild cognitive impairment cases affect the predictive power of Alzheimer’s disease diagnostic models using routine clinical variables

Abstract

Introduction

Results

CatBoost can identify healthy controls and MCI but not AD cases

Feature selection slightly improves the predictive performance of CatBoost for identifying MCI and AD patients

Poor predictive performance of CatBoost models is driven by a subgroup of MCI patients that are characteristically similar to those with AD

Discussion

Methods

Data and Patients from the ADNI Cohort

Statistical Analyses and Machine Learning

Model evaluation

Data Availability

Code Availability

Author Contributions

Competing Interests

Acknowledgements

References

Citation Manager Formats

Subject Area