Stricker Learning Span criterion validity: a remote self-administered multi-device compatible digital word list memory measure shows similar ability to differentiate amyloid and tau PET-defined biomarker groups as in-person Auditory Verbal Learning Test

Nikki H. Stricker; John L. Stricker; Ryan D. Frank; Winnie Z. Fan; Teresa J. Christianson; Jay S. Patel; Aimee J. Karstens; Walter K. Kremers; Mary M. Machulda; Julie A. Fields; Jonathan Graff-Radford; Clifford R. Jack; David S. Knopman; Michelle M. Mielke; Ronald C. Petersen

doi:10.1101/2022.10.26.22281534

Abstract

Objective The Stricker Learning Span (SLS) is a computer-adaptive digital word list memory test specifically designed for remote assessment and self-administration on a web-based multi-device platform (Mayo Test Drive). We aimed to establish criterion validity of the SLS by comparing its ability to differentiate biomarker-defined groups to the person-administered Rey’s Auditory Verbal Learning Test (AVLT).

Participants and Methods Participants (N=353; mean age=71, SD=11; 93% cognitively unimpaired [CU]) completed the AVLT during an in-person visit, the SLS remotely (within 3 months) and had brain amyloid and tau PET scans available (within 3 years). Overlapping groups were formed for 1) those on the Alzheimer’s disease (AD) continuum (amyloid PET positive, A+, n=125) or not (A-, n=228), and those with biological AD (amyloid and tau PET positive, A+T+, n=55) vs no evidence of AD pathology (A-T-, n=195). Analyses were repeated among CU participants only.

Results The SLS and AVLT showed similar ability to differentiate biomarker-defined groups when comparing AUROCs (p’s>.05). In logistic regression models, SLS contributed significantly to predicting biomarker group beyond age, education and sex, including when limited to CU participants. Medium (A- vs A+) to large (A-T- vs A+T+) unadjusted effect sizes were observed for both SLS and AVLT. Learning and delay variables were similar in terms of ability to separate biomarker groups.

Conclusions Remotely administered SLS performed similarly to in-person-administered AVLT in its ability to separate biomarker-defined groups, providing evidence of criterion validity. Results suggest the SLS may be sensitive to detecting subtle objective cognitive decline in preclinical AD.

1. Introduction

The need for efficient and scalable approaches for identifying individuals at risk for preclinical and prodromal Alzheimer’s disease (PAD) is paramount to ongoing clinical trial efforts, emerging decentralized trials, and for identifying individuals who will most benefit from currently available pharmacologic or behavioral treatments, or those on the horizon^1,2. Self-administered cognitive measures that can be completed “remotely” (i.e., outside of a typical clinical setting, including at home) are a critical component of an early PAD detection strategy^3,4. Frequently, tests originally designed for and validated within clinic settings are converted to remote use to increase access for those unable to readily visit research centers or due to necessity during the COVID-pandemic^5-7. The limitation of this “conversion” approach is that tests are not developed specifically with remote self-administration as a priority for test design decisions, which can contribute to mixed findings when performance is compared across settings^8-10. There is an urgent need for valid self-administered cognitive assessment tools designed specifically for remote use. Verbal memory measures are among the most sensitive to early changes in the Alzheimer’s disease (AD) process¹¹ but are also challenging to adapt to remote, self-administered methods⁵.

The Stricker Learning Span (SLS) is a digital computer-adaptive word list memory test specifically designed for remote assessment¹². The SLS is administered via a web-based multi-device platform designed for unsupervised self-administration of digital cognitive tests, Mayo Test Drive (MTD): test Development through Rapid Iteration, Validation and Expansion (DRIVE)¹³. Recent work has highlighted learning as a key deficit in PAD, conceptualized as a failure to benefit from repeated exposure¹⁴ or a lack of benefit from practice^15,16. In line with this, SLS test design emphasizes learning. The SLS paradigm was influenced by cognitive science principles and neural network process simulations¹². The SLS stresses the contextual system during learning through use of high frequency word stimuli and variations in word item-level imagery to increase difficulty.

Preliminary support for the feasibility, reliability and validity of the SLS was previously reported in an all-female older adult sample¹³. Whereas that prior study used traditional approaches to test validation, the current study aimed to establish test validity using a novel approach to avoid the inherent issues with existing validation approaches. For example, one common approach is to correlate a new test with existing cognitive tests. However, existing tests, while well established, are themselves imperfect measures of hypothesized underlying constructs¹⁷. Another frequent approach is to establish validity by examining the ability of a new test to differentiate clinically defined groups. In the AD field, for example, it is common to establish the clinical validity of a new test by comparing individuals who are cognitively “normal” or unimpaired to individuals with mild cognitive impairment (MCI) or dementia; however, this introduces circularity because the use of cognitive tests is central to establishing those syndromal classifications. In vivo biomarkers offer an alternative ground truth for test validation studies that is completely independent of cognitive test performance. This is akin to validation with neuropathological diagnosis at autopsy given the correspondence between antemortem PET imaging and autopsy findings but has the notable benefit of being feasible during life^18,19. A research framework is now available to use available AD biomarkers to characterize participants using the amyloid (A), tau (T) and neurodegeneration (N), or the AT(N), system²⁰. Imaging biomarkers of N are considered non-specific to AD and will not be included in the current manuscript to limit the number of subgroups. Individuals with evidence of elevated amyloid (A+) are considered to show Alzheimer’s pathologic change. An in vivo biological diagnosis of AD is defined by the presence of both A+ and elevated tau (T+).

The objective of this study was to determine the criterion validity of the SLS. Critically, this validation study was limited to unsupervised completion of the SLS in a remote environment outside of a typical clinical research setting. Our primary study hypothesis (Aim 1) was that remotely administered SLS and in-person-administered Rey’s Auditory Verbal Learning Test (AVLT) would differentiate AD biomarker-defined groups similarly. This hypothesis is tested in groups defined by biomarker status alone (A+ vs A- and A+T+ vs A-T-) to avoid circularity. Secondary hypotheses included that the SLS would be sensitive to preclinical AD in analyses limited to CU participants (Aim 2), SLS and AVLT would show significant correlations to support convergent validity (Aim 3), and that word list learning vs. delay indices would show similar sensitivity to biologically defined AD (A-T- vs A+T+; Aim 4).

2. Methods

Most participants were recruited from the Mayo Clinic Study of Aging (MCSA), a longitudinal population-based study of aging among Olmsted County, Minnesota, residents. Participants are randomly sampled by age- and sex-stratified groups using the resources of the Rochester Epidemiology Project medical records-linkage system, which links the medical records from all county providers²¹. All participants are without dementia at MCSA enrollment. Exclusion criteria include terminal illness or hospice. Participants complete study visits every 15 months that include a physician exam, study coordinator interview, and neuropsychological testing²². The physician exam includes a medical history review, complete neurological exam, and the Short Test of Mental Status (STMS)²³. The study coordinator interview with an informant includes the Clinical Dementia Rating® scale²⁴. Participants complete multi-domain battery of nine neuropsychological tests administered by a psychometrist²². The interviewing study coordinator, examining physician, and neuropsychologist initially each make an independent diagnostic determination. A final diagnosis of cognitively unimpaired, MCI²⁵ or dementia²⁶ is then achieved through consensus agreement^22,25. The diagnostic evaluation does not consider prior clinical information, prior diagnoses, SLS performance, or knowledge of biomarkers.

To enrich the sample for participants with cognitive impairment, additional participants were recruited from the Mayo Alzheimer’s Disease Research Center (ADRC).

The study protocols were approved by the Mayo Clinic and Olmsted Medical Center Institutional Review Boards. All participants provided informed consent.

In Vivo Neuroimaging Markers of Amyloid and Tau

The most recent imaging available ±3 years of baseline SLS was used. Amyloid and tau positivity is determined using Pittsburgh Compound B PET (PiB-PET) and tau PET (flortaucipir)^27-29. PET images are acquired using a GE Discovery RX or DXT PET/CT scanner. A global cortical PiB PET standard uptake value ratio (SUVR) is computed by calculating the median uptake over voxels in the prefrontal, orbitofrontal, parietal, temporal, anterior cingulate, and posterior cingulate/precuneus regions of interest (ROIs) for each participant and dividing this by the median uptake over voxels in the cerebellar crus gray matter; for tau PET we utilize median uptake over the voxels in the meta regions consisting of entorhinal, amygdala, parahippocampal, fusiform, inferior temporal, and middle temporal ROIs normalized to the cerebellar crus gray matter²⁸. Cutoffs to determine amyloid and tau positivity are SUVR ≥ 1.48 (centiloid 22)³⁰ and ≥ 1.25²⁸ respectively, to maintain consistency with our past Cogstate-focused work^31-33.

Person-administered AVLT completed in clinic

The psychometrist reads a list of 15 words (List A) aloud and asks the participant to repeat back as many words as they can recall. This is repeated 5 times (learning trials 1-5 total). A distractor list (List B) is presented, followed by short-delay (Trial 6) of list A words. Recall of List A is again tested after 30-minutes (30-minute delay), followed by written recognition^34,35. The primary variable for this study is sum of trials (trials 1–5 total + trial 6 + 30-min recall; range 0-105), which is sensitive to early changes in memory³⁶. Additional variables include correct words on trials 1-5 total and trial 5 (thought to reflect learning), as well as 30-minute delay. Long-term percentage retention (AVLT 30-minute delay / Trial 5), thought to reflect delayed memory or storage/savings, is also reported.

Self-administered SLS completed remotely (not in clinic)

All participants completed the SLS remotely and without supervision or assistance. Participants followed a link provided in an email to complete the test session. The SLS is administered via the Mayo Test Drive (MTD) platform¹³.

The SLS is a 5-trial adaptive list learning task. Single words are visually presented sequentially during learning trials. After each list presentation, memory for the word list is tested with 4-choice recognition. Following a computer adaptive testing approach, the SLS starts with 8 items and then the number of words either stays the same, increases by 5 or decreases by 2 according to pre-specified rules based on percentage of correct responses to extend the floor and ceiling relative to traditional word list memory tests (range 2-23 words; Figure 1). Short delay follows the Symbols Test^13,37,38; all items presented on any learning trial are tested during delay (range 8-23).

Figure 1.

Stricker Learning Span (SLS) computer adaptive testing approach provides an expanded ceiling and floor relative to traditional word list memory tests. Figure used with permission of Mayo Foundation for Medical Education and Research; all rights reserved.

The SLS uses common, high frequency words that are easier to recall, but harder to recognize³⁹, as previously described¹². There are 23 item bins with 4 words each, and words within a bin have similar imagery ratings⁴⁰. Each successive item bin has lower imagery ratings, thus increasing the difficulty of subsequent items. Most (90%) of the 92 total words used are on the Dolch sight words reading list (preschool through Grade 3), with half at the preschool level⁴¹. Each test session randomly selects 1 word from each item bin as the “target,” the 3 others serve as the foils and a 23-item target word list is generated, even if not all items are presented due to the adaptive procedure. To reduce recency effects, the order of item presentation is randomized for each trial and the last item presented is never the first tested. The primary variable is sum of trials (total correct across learning trials 1-5 plus delay, range 0-108). Secondary variables include maximum (max) learning span across any trial (range 0-23), total correct across learning trials (1-5 total, range 0-85), and total correct short delay (range 0-23). Percent retention (max span / delay) is also reported to allow within-test comparisons, but this measure is not meant to be compared to AVLT percent retention as differences are expected based on differences in test design.

Inclusion Criteria

To be included in this study, participants had to have both SLS sum of trials and AVLT sum of trials available and an amyloid PET scan within 3 years. Participants who completed the SLS as of 7/7/22 were included in this manuscript. Available linked data as of 8/22/22 were included.

Statistical methods

Demographics and clinical characteristics were descriptively summarized using counts and percentages for categorical data and means and standard deviations for continuous data. Data distributions across groups (A- vs A+ and A-T- vs A+T+) were compared using chi-square tests for categorical variables and linear model ANOVA tests for continuous variables. Pearson correlation coefficients were used to characterize the linear relationship between AVLT and SLS variables. Unadjusted and adjusted Hedge’s G with weighted and pooled standard deviation was used to assess effect size for group comparisons. Unadjusted and adjusted logistic regression models were used to determine the predictive accuracy of AVLT and SLS sum of trials in predicting abnormal amyloid PET (A+ vs A-) and abnormal amyloid and tau PET (A+T+ vs A-T-). To formally compare the ability of both tests to differentiate biomarker-defined groups, the AUROC from models with AVLT were directly compared to models with SLS⁴². For models that adjusted for demographic variables, age, sex, and education were the adjustment terms for both Hedge’s G and logistic regression models. A 2-sided p-value <0.05 was considered statistically significant. All analyses were performed using R version 4.1.2.

3. Results

3.1 Participant Characteristics and Convergent Validity

The mean age of the 353 participants was 71.8 (SD=10.8) years, mean education was 15.7 (SD=2.4), 53.5% were male, 98.0% were White and 92.6% were cognitively unimpaired (Table 1). MTD remote testing was completed within half a month (mean) of the in-person visit. Participant characteristics by biomarker subgroups are reported in Table 2. As expected, based on known increases in A+ and T+ rates with increased age^27-29, biomarker positive groups were older than biomarker negative groups (p’s < .05). Biomarker positive and negative groups showed similar years of education and sex distribution (p’s > .05).

View this table:

Table 1.

Participant demographic, clinical and neuroimaging characteristics and devices used for Mayo Test Drive (mean, SD except where otherwise noted) for all participants (N=353).

View this table:

Table 2.

Demographic and clinical characteristics of biomarker subgroups and means (SDs) for cognitive variables.

SLS sum of trials and AVLT sum of trials were strongly correlated (r = 0.62, p < .001). Additional correlations are reported in Table 3 and Supplemental Figure 1.

View this table:

Table 3.

Convergent Validity: Pearson Correlations (r) between remotely administered Stricker Learning Span measures and in-person administered Auditory Verbal Learning Test measures for all participants. Table used with permission of Mayo Foundation for Medical Education and Research; all rights reserved.

3.2 The SLS shows similar ability to differentiate PET-defined biomarker groups compared to the AVLT (Aim 1, all participants)

3.2.1 AUROC comparisons

Total AUROC values for SLS sum of trials vs. AVLT sum of trials were similar for differentiating biomarker groups (p’s > .05; Table 4, Figure 2). This similarity was seen for all pairwise AUROC comparisons (adjusted and unadjusted models; A- vs A+ and A-T- vs A+T+).

View this table:

Table 4.

Logistic regression analysis predicting a) A+ vs. A- and b) A+T+ vs A-T- for both unadjusted models (cognition as the only predictor, on left) and models that also include age, sex, education, in addition to cognition (on right). The p-value comparing the AUROCs of the remotely administered Stricker Learning Span and in-person administered Auditory Verbal Learning Test provides a direct comparison of test performance for differentiating biomarker-defined groups.

Figure 2.

AUROC (95% CI) values for remotely-administered SLS sum of trials and in-person administered AVLT sum of trials.

Note. All models significantly differentiate biomarker groups better than chance (no AUROC confidence intervals include 0.5). Stricker Learning Span (SLS) sum of trials = 1-5 total correct + delay. Auditory Verbal Learning Test (AVLT) sum of trials = 1-5 total + Trial 6 + 30-minute delay recall. Unadj. = unadjusted models. Adj. = model adjusts for age, education and sex. Figure used with permission of Mayo Foundation for Medical Education and Research; all rights reserved.

3.2.1a Amyloid Groups

Unadjusted models using only the primary cognitive variable as the predictor show that both the SLS and AVLT significantly differentiate A- vs A+ (AUROCs of 0.63 and 0.64, respectively). Adjusted models that include demographic variables increase the overall AUROC values of the full model (both AUROCs=0.76), and both the SLS and AVLT significantly improve biomarker group prediction over and above the demographic variables.

3.2.1b Amyloid and Tau Groups

Unadjusted models using only the primary cognitive variable as the predictor show that both the SLS and AVLT significantly differentiate individuals without AD biomarkers (A-T-) from those with biological AD (A+T+; AUROCs of 0.72-0.73). Adjusted models that include demographic variables increase the overall AUROC values of the full model (AUROCs of 0.83-0.84), and both the SLS and AVLT significantly improve biomarker group prediction over and above the demographic variables.

3.2.2 Descriptive effect sizes from group difference analyses (Table 5)

3.2.2a Amyloid Groups

Unadjusted analyses showed medium group effect sizes for both SLS sum of trials (g = -0.51) and AVLT sum of trials (g = -0.55) for differentiating A- and A+ groups (Figure 3). Effect sizes after adjusting for demographics were decreased and small in magnitude (g’s = -0.26 to -0.27), and group differences remained significant.

View this table:

Table 5.

Unadjusted and adjusted Hedge’s g effect sizes (95% CI) between biomarker subgroup for all variables.

Figure 3.

Hedge’s g effect sizes with confidence intervals to show the magnitude of group differences for remotely administered Stricker Learning Span (SLS, red shades) and in-person Auditory Verbal Learning Test (AVLT) measures (blue shades) across biomarker groups (A- vs A+ top; A-T- vs A+T+ bottom) in all participants (left) and Cognitively Unimpaired participants (right).

Note. Groups do not significantly differ when the CI includes 0 (dashed line). Unadj. = unadjusted models. Adj. = model adjusts for age, education and sex. Remote SLS Sum of Trials = SLS trials 1-5 correct + delay correct. In-person AVLT Sum of Trials = AVLT 1-5 correct + Trial 6 (short-delay) correct + 30-minute recall correct). See Tables 4 and 5 for direction of effect sizes and confidence intervals. Figure used with permission of Mayo Foundation for Medical Education and Research; all rights reserved.

3.2.2b Amyloid and Tau Groups

Unadjusted analyses showed large group difference effect sizes for SLS sum of trials and AVLT sum of trials (both g’s = -0.88) for differentiating A- T- and A+T+ groups (Figure 3). Effect sizes after adjusting for demographics were decreased but remained moderate in magnitude and significant for both SLS sum of trials (g = -0.53) and AVLT sum of trials (g = -0.51).

3.3 The SLS shows similar ability to differentiate PET-defined biomarker groups compared to the AVLT for preclinical AD (CU participants only; Aim 2)

Because the majority (93%) of participants in the overall sample are CU, limiting analyses only to CU participants had relatively little impact on the results. Findings show that the SLS is sensitive to preclinical AD, consistent with our Aim 2 hypothesis.

3.3.1 AUROC comparisons

Total AUROC values for SLS sum of trials vs. AVLT sum of trials were similar for differentiating biomarker groups in CU participants (p’s > .05 for each pairwise comparison; Table 4).

3.3.1a Amyloid Groups

Unadjusted models using only the primary cognitive variable as the predictor show that both the SLS and AVLT significantly differentiate A- vs A+ (both AUROCs = 0.63). Adjusted models that include demographic variables increase the overall AUROC values of the full model (AUROCs=0.76-0.77), and both the SLS and AVLT significantly improve biomarker group prediction over and above the demographic variables.

3.3.1b Amyloid and Tau Groups

Adjusted models that include demographic variables increase the overall AUROC values of the full model (AUROCs = 0.81-0.83). The SLS significantly improved biomarker group prediction over and above the demographic variables; the AVLT approached significance (p=0.06).

3.3.2 Descriptive effect sizes from group difference analyses (Table 5)

3.3.2a Amyloid Groups

Unadjusted analyses showed small to medium group effect sizes for both SLS sum of trials (g = -0.47) and AVLT sum of trials (g = -0.48) for differentiating A- and A+ groups (Figure 3). Effect sizes after adjusting for demographics were decreased and small in magnitude (g’s = -0.18 to -0.22), and group differences did not reach significance.

3.3.2b Amyloid and Tau Groups

Unadjusted analyses showed medium group difference effect sizes for SLS sum of trials (g = -0.74) and AVLT sum of trials (g = -0.62) for differentiating A-T- and A+T+ groups (Figure 3). Effect sizes after adjusting for demographics were decreased (small in magnitude) with SLS sum of trials remaining significant (g = -0.37) and AVLT sum of trials no longer reaching significance (g = -0.25).

3.4 Learning measures show similar ability as delay memory measures to differentiate A-T- and A+T+ groups for both the SLS and the AVLT

3.4.1 All participants

All secondary SLS and AVLT variables show results in the expected direction with lower performance in the A+T+ compared to A-T-group (p’s < .05 for both unadjusted and adjusted analyses), with generally similar effect sizes across comparable SLS and AVLT variable pairs (Table 5). Within-test descriptive comparisons show that learning variables (1-5 total) show effect sizes that are similar in magnitude as delay variables. For example, SLS 1-5 (g = -.86) and delay (g = -.88) both show large unadjusted effect sizes, and AVLT 1-5 (g = -.82) and delay (g = -.86) also show large unadjusted effect sizes (Figure 4). Trials 1-5 total (g=-.86 SLS and -.82 AVLT) may be a slightly more advantageous learning measure for group discrimination relative to SLS max span (−.81) or AVLT Trial 5 (−.74). Similarly, comparison of two different types of delayed memory measures suggests a slight advantage for delay total correct relative to retention (−.88 vs -.55 for SLS, respectively and -.86 vs -.79 for AVLT, respectively). See Figure 5 for a visualization of trial-by-trial data for both the SLS and AVLT. See Supplemental Tables 1 and 2 for results of other memory tests.

Figure 4.

Hedge’s g unadjusted effect sizes comparing ability of learning and delay trials to differentiate individuals without AD biomarkers from those with biological AD (A-T- vs A+T+).

Note. Stricker Learning Span (SLS) Trials 1-5 = trials 1-5 total correct; Auditory Verbal Learning Test (AVLT) Trials 1-5 = 1-5 total correct; AVLT Delay = 30-minute delayed recall. Figure used with permission of Mayo Foundation for Medical Education and Research; all rights reserved.

Figure 5.

Stricker Learning Span (SLS, left panel) and Auditory Verbal Learning Test (AVLT, right panel) learning slopes and delayed memory performances across all participants without AD biomarkers (A-T-, blue) and all participants with biological AD (A+T+, red).

Note. Sum of trials is the primary variable for each test; these figures show the data comprising sum of trials for each measure (sum of trials = total correct items for all trials displayed in each respective figure). SLS items presented differ by trial based on computer adaptive testing rules, thus the highest possible number of presented across trials vary (see Figure 1 for ceiling and floor values for Trials 1-5; the delay trial can range from 8-23). The SLS uses 4-choice recognition to test item memory. In contrast, 15 words are presented each time for AVLT trials (constant range of 0-15) and item memory is tested with free recall. See Supplemental Table 2 for numeric results. AVLT Trial 6 = short-delay. AVLT delay = 30-minute delay. Figure used with permission of Mayo Foundation for Medical Education and Research; all rights reserved.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request and with appropriate institutional approvals.

4. Discussion

This study followed a novel approach to test validation and established the criterion validity of an unsupervised computer adaptive word list memory test (SLS) completed outside of a clinic setting. The SLS differentiates AD biomarker-defined groups as well as a traditional word list recall test (AVLT) administered by trained psychometrists in a clinic setting. Specifically, our Aim 1 hypothesis was supported by AUROC comparisons that showed remotely administered SLS sum of trials and in-person-administered AVLT sum of trials have comparable ability to differentiate individuals on the Alzheimer’s continuum (A+) or not (A-) and individuals meeting a research framework for a biological diagnosis of AD (A+T+) or not (A-T-) in a predominantly cognitively unimpaired sample.

In line with our prior results showing that the AVLT has the potential to be useful for detecting subtle objective cognitive decline in preclinical AD ³³, our current results extend this prior AVLT finding and suggest that the SLS also has promise in this regard (Aim 2). Specifically, when limiting the sample to CU participants, our AUROC results show that the SLS by itself could help predict, better than chance, which individuals had elevated brain amyloid vs. did not and which had elevated brain amyloid and tau vs did not. In contrast, our prior work examining the utility of the Learning/Working Memory index comprised of visual recognition and working memory tasks from the Cogstate Brief Battery administered in clinic did not significantly differentiate biomarker groups better than chance (CU A-T- vs CU A+T+ or CU A- T- vs CU A+T-) and showed that the AVLT was significantly better than the Learning/Working Memory Index for differentiating CU A-T- vs CU A+T- when comparing total AUROCs. While the predictive ability of the SLS by itself is relatively modest, predictive ability improves when demographic variables are added to the model and the SLS continues to show an independent effect over and above demographic variables. For example, a model with age, sex, education and SLS sum of trials together had an AUROC of 0.83 for predicting A+T+ status in CU participants. Thus, our current results suggest that the SLS could be a scalable, easily accessible addition to a multivariable model approach that improves overall prediction of AD risk. Given its capacity for remote self-administration, the SLS would be a good candidate screening measure in non-specialty care settings to use in combination with plasma biomarkers to inform the need for further work-up.

The highly overlapping results for the ability of the SLS and AVLT to discriminate AD biomarker groups are particularly interesting given that although the SLS was designed to mimic the sensitivity of the AVLT, it was not designed to be a one-to-one adaptation of the AVLT. Results of correlation analyses align with this intent and support our hypothesis that the SLS and AVLT would show a significant correlation (r = 0.62 observed), and therefore further support convergent validity (Aim 3). Our initial pilot study similarly supported the convergent validity of the SLS, but slightly lower correlation coefficients were observed in that study, likely due to homogeneity of that sample (e.g., all female, restricted age range, excluded individuals with dementia), and potentially due to a longer duration between in-person and remote testing (average 10 months)¹³. The current results are particularly notable given that an exact computerized replication of the AVLT facilitated by audio recording and speech recognition to allow self-administered on an iPad showed only slightly higher correlations (r=0.63-0.70) with the AVLT as typically administered in a well-controlled cross-over design completed in a clinic setting⁴³.

We also qualitatively compared commonly derived supraspan wordlist indices within each test given that the learning indices may have equivalent utility for the early detection of AD as delayed memory indices^44,45. Results supported our hypothesis that word list learning measures would show similar sensitivity as word list delay memory measures to biologically defined AD (A-T- vs A+T+; Aim 4). The effect sizes of learning trials and delayed memory are similar (Figure 4). We chose to focus on delay items correct instead of percent retention (i.e., savings) for the comparison to learning indices. It is important to note that retention is largely dependent on specific test design characteristics including the influence of serial position effects^46-48. The SLS randomizes word order to minimize the recency effect often observed in individuals with Alzheimer’s dementia that leads to an over-estimate of true forgetting⁴⁹. For example, learning to criterion studies demonstrate that individuals in the early stages of AD take a longer time to reach criterion, reflective of lower learning ability, whereas once learning is equated by reaching criterion rates of forgetting are similar to healthy control participants^48,50,51. Accordingly, SLS learning indices (1-5, max span) demonstrate a larger effect size than SLS retention in biologically defined AD versus those without AD biomarkers, whereas AVLT retention effect size is more similar to AVLT learning indices.

This study has several strengths. Most studies examining the ability of cognitive measures to differentiate biomarker groups have focused on differentiation of A+ versus A-individuals^52,53. Our inclusion of tau status defined by PET imaging, in addition to amyloid, is a strength. Our population-based sample helps to increase generalizability to clinical settings where comorbidities are common. Our approach of reporting both unadjusted and adjusted effect sizes helps illustrate the robust biomarker-group difference effect sizes observed in unadjusted analyses. For example, the SLS showed a large group difference effect size across A-T- and A+T+ groups in all participants (−0.88), that decreased to a medium effect size when controlling for demographics (−0.53) and was further attenuated when limited to CU only (−0.74 unadjusted, -0.37 adjusted). There is growing evidence that cognitive decline is not a normal part of the aging process, but rather, is reflective of previously undetected neuropathologies that increase in prevalence with increasing age^54-56. To maximize the utility of cognitive measures for informing risk of AD biomarker positivity, we recommend the use of raw scores and argue that the effect of age should not be routinely “adjusted” away as it decreases the predictive power of cognition.

Several limitations should also be noted. First, because this is a population-based study, the racial and ethnic characteristics reflect that of Olmsted County from which participants are randomly sampled, resulting in a predominantly White, Non-Hispanic sample. Second, the predominantly CU composition of this sample may have decreased AUROC values and magnitude of effect sizes for differentiating biomarker positive and negative groups, as suggested by the highly similar results seen when limiting analyses only to CU participants. A more balanced sample design, with more inclusion of individuals with mild to moderate dementia, could suggest greater utility of these memory measures for identifying individuals at risk for biomarker positivity; the current results are more relevant for preclinical detection given that 93% of our sample is CU. Third, a majority of the sample (83%) had prior exposure to the AVLT given the longitudinal nature of the MCSA and ADRC studies, thus practice effects could have impacted the ability of the AVLT to discriminate biomarker groups. Because biomarker negative participants benefit more from practice effects than biomarker positive participants, it is possible this could have amplified group difference effects for the AVLT^15,57. Future work is needed to replicate these results in a setting where both the SLS and AVLT are baseline administrations. Similarly, the entirely unsupervised and remote approach for the SLS could dampen the sensitivity of the SLS as the results presented in this study include all available remote data. While we capture participant-reported information about test interference, noise in the test environment, and participant comments that can provide additional information about test interruptions or environmental considerations, because our goal was to establish the robust criterion validity of the SLS “in the wild,” we did not apply any exclusionary criterion based on this information in the present study. Another reason we did not want to prematurely apply such exclusionary criteria is that individuals who are less able to follow instructions provided to establish the recommended test environment may be more likely to have cognitive impairment. Thus, increased likelihood of lower test performance in an uncontrolled environment, worsened by environmental distractions, could also be related to risk of cognitive decline. If cognitive screening / risk for cognitive decline is the goal, worse performance in remote settings may help identify risk in a way not captured by controlled clinical settings, adding an element of ecological validity in terms of ability to adapt to a new task without assistance. Future work will examine whether and to what degree these factors may influence test performance, as increased distractions in the home environment can negatively impact performance⁵⁸. Finally, while use of a strictly biomarker-defined ground truth is a novel aspect of this study, in vivo PET biomarkers also have some limitations such as high but imperfect reliability (manifested by “noise” in the trajectories of imaging results over time in some individuals), and the fact that PET measures of amyloid and tau pathology have a sensitivity floor and medically significant pathology can exist that lies beneath this detection threshold⁵⁹. Also, we adopted a liberal window for inclusion of available biomarker data to allow for some missed scanning opportunities during the COVID-19 pandemic, to maximize the sample size and because of the generally good stability of amyloid and tau classifications⁶⁰, but this also decreases study precision relative to a narrower time window.

In summary, SLS test design prioritized remote assessment needs and a computer-adaptive approach¹². Even though the SLS is not a direct adaptation of the AVLT, our results show highly similar ability of the remotely-administered SLS and in-person-administered AVLT to differentiate AD biomarker-defined groups. These results challenge preconceived notions about memory assessment by showing that creative use of a recognition memory paradigm that emphasizes learning in an all-remote unsupervised sample differentiates AD biomarker-defined groups as effectively as a traditional word list memory measure based on free recall responses.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request and with appropriate institutional approvals.

Acknowledgment

The authors wish to thank the participants and staff at the Mayo Clinic Study of Aging and Mayo Alzheimer’s Disease Research Center. Research reported in this publication was supported by the Kevin Merszei Career Development Award in Neurodegenerative Diseases Research IHO Janet Vittone, MD, the Rochester Epidemiology Project (R01 AG034676), the National Institute on Aging of the National Institutes of Health (grant numbers R21AG073967, P30 AG062677, P50 AG016574, U01 AG006786, RF1 AG55151, R01 AG041851, R37 AG011378), the Robert Wood Johnson Foundation, The Elsie and Marvin Dekelboum Family Foundation, GHR Foundation, Alzheimer’s Association, and the Mayo Foundation for Education and Research. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. A Mayo Clinic invention disclosure has been submitted for the Stricker Learning Span and the Mayo Test Drive platform (NHS, JLS). We have no other conflicts of interest to disclose related to this work.

The following disclosures are present, outside the scope of this work: NHS has received research support from NIH and Biogen. MMMi has consulted for Biogen, Eli Lilly, Lab Corp, Merck, and Siemens Healthineers and receives research support from NIH. MMMa and JAF receive research support from NIH. DSK serves on a Data Safety Monitoring Board for the DIAN-TU study and is an investigator in clinical trials sponsored by Lilly Pharmaceuticals, Biogen, and the University of Southern California. RCP has served as a consultant for Hoffman-La Roche Inc., Merck Inc., Genentech Inc., Biogen Inc., Eisai, Inc. and Nestle, Inc. He also receives NIH support. CRJ serves on an independent data monitoring board for Roche but he receives no personal compensation from any commercial entity. CRJ receives research support from NIH and the Alexander Family Alzheimer’s Disease Research Professorship of the Mayo Clinic.

References

↵
Cummings, J., Lee, G., Zhong, K., Fonseca, J. & Taghva, K. Alzheimer’s disease drug development pipeline: 2021. Alzheimers Dement (N Y) 7, e12179, doi:10.1002/trc2.12179 (2021).
OpenUrl CrossRef
↵
Dorsey, E. R., Kluger, B. & Lipset, C. H. The New Normal in Clinical Trials: Decentralized Studies. Ann. Neurol. 88, 863–866, doi:10.1002/ana.25892 (2020).
OpenUrl CrossRef
↵
Sabbagh, M. et al. Early Detection of Mild Cognitive Impairment MCI in an At Home Setting. J Prev Alzheimers Dis, doi:10.14283/jpad.2020.21 (2020).
OpenUrl CrossRef
↵
Sabbagh, M. N. et al. Rationale for Early Diagnosis of Mild Cognitive Impairment (MCI) Supported by Emerging Digital Technologies. J Prev Alzheimers Dis 7, 158–164, doi:10.14283/jpad.2020.19 (2020).
OpenUrl CrossRef
↵
1. Gregory G. Brown,
2. Tricia Z. King,
3. Kathleen Y. Haaland, &
4. Bruce Crosson
Bauer, R. M. & Bilder, R. M. in APA Handbook of Neuropsychology Vol. 2: Neuroscience and Neuromethods (eds Gregory G. Brown, Tricia Z. King, Kathleen Y. Haaland, & Bruce Crosson) (American Psychological Association, 2023, in press).
Mackin, R. S. et al. Reliability and Validity of a Home-Based Self-Administered Computerized Test of Learning and Memory Using Speech Recognition. Neuropsychol. Dev. Cogn. B Aging Neuropsychol. Cogn., 1–15, doi:10.1080/13825585.2021.1927961 (2021).
OpenUrl CrossRef
↵
Marra, D. E., Hamlet, K. M., Bauer, R. M. & Bowers, D. Validity of teleneuropsychology for older adults in response to COVID-19: A systematic and critical review. Clin. Neuropsychol., 1–42, doi:10.1080/13854046.2020.1769192 (2020).
OpenUrl CrossRef
↵
Cromer, J. A. et al. Comparison of Cognitive Performance on the Cogstate Brief Battery When Taken In-Clinic, In-Group, and Unsupervised. Clin. Neuropsychol. 29, 542–558, doi:10.1080/13854046.2015.1054437 (2015).
OpenUrl CrossRef
Mielke, M. M. et al. Performance of the CogState computerized battery in the Mayo Clinic Study on Aging. Alzheimer’s & Dementia 11, 1367–1376, doi:10.1016/j.jalz.2015.01.008 (2015).
OpenUrl CrossRef PubMed
↵
Stricker, N. H. et al. Longitudinal Comparison of in Clinic and at Home Administration of the Cogstate Brief Battery and Demonstrated Practice Effects in the Mayo Clinic Study of Aging. The journal of prevention of Alzheimer’s disease 7, 21–28, doi:10.14283/jpad.2019.35 (2020).
OpenUrl CrossRef
↵
Caselli, R. J. et al. Neuropsychological decline up to 20 years before incident mild cognitive impairment. Alzheimers Dement 16, 512–523, doi:10.1016/j.jalz.2019.09.085 (2020).
OpenUrl CrossRef
↵
Stricker, J. L. et al. Neural network process simulations support a distributed memory system and aid design of a novel computer adaptive digital memory test for preclinical and prodromal Alzheimer’s disease. Neuropsychology, doi:10.1037/neu0000847 (2022).
OpenUrl CrossRef
↵
Stricker, N. H. et al. A novel computer adaptive word list memory test optimized for remote assessment: Psychometric properties and associations with neurodegenerative biomarkers in older women without dementia. Alzheimers Dement (Amst) 14, e12299, doi:10.1002/dad2.12299 (2022).
OpenUrl CrossRef
↵
Lim, Y. Y. et al. Association of deficits in short-term learning and Aβ and hippoampal volume in cognitively normal adults. Neurology, 10.1212/WNL.0000000000010728, doi:10.1212/wnl.0000000000010728 (2020).
OpenUrl CrossRef
↵
Machulda, M. M. et al. Practice effects and longitudinal cognitive change in clinically normal older adults differ by Alzheimer imaging biomarker status. Clin. Neuropsychol. 31, 99–117, doi:10.1080/13854046.2016.1241303 (2017).
OpenUrl CrossRef
↵
Duff, K. et al. Short-Term Practice Effects and Amyloid Deposition: Providing Information Above and Beyond Baseline Cognition. J Prev Alzheimers Dis 4, 87–92, doi:10.14283/jpad.2017.9 (2017).
OpenUrl CrossRef
↵
Bilder, R. M. & Reise, S. P. Neuropsychological tests of the future: How do we get there from here? Clin. Neuropsychol. 33, 220–245, doi:10.1080/13854046.2018.1521993 (2019).
OpenUrl CrossRef
↵
Wolters, E. E. et al. Clinical validity of increased cortical uptake of [(18)F]flortaucipir on PET as a biomarker for Alzheimer’s disease in the context of a structured 5-phase biomarker development framework. Eur. J. Nucl. Med. Mol. Imaging 48, 2097–2109, doi:10.1007/s00259-020-05118-w (2021).
OpenUrl CrossRef
↵
Chiotis, K. et al. Clinical validity of increased cortical uptake of amyloid ligands on PET as a biomarker for Alzheimer’s disease in the context of a structured 5-phase development framework. Neurobiol. Aging 52, 214–227, doi:10.1016/j.neurobiolaging.2016.07.012 (2017).
OpenUrl CrossRef
↵
Jack, C. R., Jr.. et al. NIA-AA Research Framework: Toward a biological definition of Alzheimer’s disease. Alzheimers Dement 14, 535–562, doi:10.1016/j.jalz.2018.02.018 (2018).
OpenUrl CrossRef PubMed
↵
St Sauver, J. L. et al. Data resource profile: the Rochester Epidemiology Project (REP) medical records-linkage system. Int. J. Epidemiol. 41, 1614–1624, doi:10.1093/ije/dys195 (2012).
OpenUrl CrossRef PubMed Web of Science
↵
Roberts, R. O. et al. The Mayo Clinic Study of Aging: Design and sampling, participation, baseline measures and sample characteristics. Neuroepidemiology 30, 58–69, doi:10.1159/000115751 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Kokmen, E., Smith, G. E., Petersen, R. C., Tangalos, E. & Ivnik, R. C. The short test of mental status: Correlations with standardized psychometric testing. Arch. Neurol. 48, 725–728, doi:10.1001/archneur.1991.00530190071018 (1991).
OpenUrl CrossRef PubMed Web of Science
↵
Morris, J. C. The Clinical Dementia Rating (CDR): Current version and scoring rules. Neurology 43, 2412–2414, doi:10.1212/WNL.43.11.2412-a (1993).
OpenUrl CrossRef PubMed
↵
Petersen, R. C. Mild cognitive impairment as a diagnostic entity. J. Intern. Med. 256, 183–194, doi:10.1111/j.1365-2796.2004.01388.x. (2004).
OpenUrl CrossRef PubMed Web of Science
↵
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (DSM-IV). 4th edn, (American Psychiatric Association, 1994).
↵
Jack, C. R., Jr.. et al. 11C PiB and structural MRI provide complementary information in imaging of Alzheimer’s disease and amnestic mild cognitive impairment. Brain 131, 665–680 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Jack, C. R., Jr.. et al. Defining imaging biomarker cut points for brain aging and Alzheimer’s disease. Alzheimers Dement 13, 205–216, doi:10.1016/j.jalz.2016.08.005 (2017).
OpenUrl CrossRef PubMed
↵
Vemuri, P. et al. Tau-PET uptake: Regional variation in average SUVR and impact of amyloid deposition. Alzheimers Dement (Amst) 6, 21–30, doi:10.1016/j.dadm.2016.12.010 (2017).
OpenUrl CrossRef
↵
Klunk, W. E. et al. The Centiloid Project: standardizing quantitative amyloid plaque estimation by PET. Alzheimers Dement 11, 1-15 e11-14, doi:10.1016/j.jalz.2014.07.003 (2015).
OpenUrl CrossRef PubMed
↵
Alden, E. C. et al. Diagnostic accuracy of the Cogstate Brief Battery for prevalent MCI and prodromal AD (MCI A(+) T(+)) in a population-based sample. Alzheimers Dement 17, 584–594, doi:10.1002/alz.12219 (2021).
OpenUrl CrossRef
Pudumjee, S. B. et al. A Comparison of Cross-Sectional and Longitudinal Methods of Defining Objective Subtle Cognitive Decline in Preclinical Alzheimer’s Disease Based on Cogstate One Card Learning Accuracy Performance. J. Alzheimers Dis. 83, 861–877, doi:10.3233/JAD-210251 (2021).
OpenUrl CrossRef
↵
Stricker, N. H. et al. Diagnostic and Prognostic Accuracy of the Cogstate Brief Battery and Auditory Verbal Learning Test in Preclinical Alzheimer’s Disease and Incident Mild Cognitive Impairment: Implications for Defining Subtle Objective Cognitive Impairment J. Alzheimers Dis. 76, 261–274, doi:10.3233/JAD-200087 (2020).
OpenUrl CrossRef
↵
Stricker, N. H. et al. Mayo Normative Studies: Regression-Based Normative Data for the Auditory Verbal Learning Test for Ages 30–91 Years and the Importance of Adjusting for Sex. J. Int. Neuropsychol. Soc., 1–16, doi:10.1017/S1355617720000752 (2020).
OpenUrl CrossRef
↵
Ferman, T. J. et al. Mayo’s Older African American Normative Studies: Auditory Verbal Learning Test norms for African American elders. Clin. Neuropsychol. 19, 214–228, doi:10.1080/13854040590945300 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Jack, C. R., Jr.. et al. Age, Sex, and APOE epsilon4 Effects on Memory, Brain Structure, and beta-Amyloid Across the Adult Life Span. JAMA Neurol 72, 511–519, doi:10.1001/jamaneurol.2014.4821 (2015).
OpenUrl CrossRef PubMed
↵
Sliwinski, M. J. et al. Reliability and Validity of Ambulatory Cognitive Assessments. Assessment 25, 14–30, doi:10.1177/1073191116643164 (2016).
OpenUrl CrossRef PubMed
↵
Ohman, F., Hassenstab, J., Berron, D., Scholl, M. & Papp, K. V. Current advances in digital cognitive assessment for preclinical Alzheimer’s disease. Alzheimers Dement (Amst) 13, e12217, doi:10.1002/dad2.12217 (2021).
OpenUrl CrossRef
↵
Lohnas, L. J. & Kahana, M. J. Parametric effects of word frequency in memory for mixed frequency lists. J. Exp. Psychol. Learn. Mem. Cogn. 39, 1943–1946, doi:10.1037/a0033669 (2013).
OpenUrl CrossRef PubMed
↵
Clark, J. M. & Paivio, A. Extensions of the Paivio, Yuille, and Madigan (1968) norms. Behav. Res. Methods Instrum. Comput. 36, 371–383, doi:10.3758/BF03195584 (2004).
OpenUrl CrossRef
↵
Dolch, E. W. A basic sight vocabulary. The Elementary School Journal 36, 456–460, doi:10.1086/457353 (1936).
OpenUrl CrossRef
↵
Therneau, T. M. A package for survival analysis in R, < https://CRAN.Rproject.org/package=survival> (2021).
↵
Morrison, R. L. et al. A computerized, self-administered test of verbal episodic memory in elderly patients with mild cognitive impairment and healthy participants: A randomized, crossover, validation study. Alzheimers Dement (Amst) 10, 647–656, doi:10.1016/j.dadm.2018.08.010 (2018).
OpenUrl CrossRef
↵
Belleville, S. et al. Neuropsychological Measures that Predict Progression from Mild Cognitive Impairment to Alzheimer’s type dementia in Older Adults: a Systematic Review and Meta-Analysis. Neuropsychol. Rev. 27, 328–353, doi:10.1007/s11065-017-9361-5 (2017).
OpenUrl CrossRef
↵
Weissberger, G. H. et al. Diagnostic Accuracy of Memory Measures in Alzheimer’s Dementia and Mild Cognitive Impairment: a Systematic Review and Meta-Analysis. Neuropsychol. Rev. 27, 354–388, doi:10.1007/s11065-017-9360-6 (2017).
OpenUrl CrossRef PubMed
↵
1. K. W. Spence &
2. J. T. Spence
Atkinson, R. C. & Shiffrin, R. M. in The psychology of learning and motivation: II. (ed K. W. Spence & J. T. Spence) (Academic Press, 1968).
Gavett, B. E. & Horwitz, J. E. Immediate list recall as a measure of short-term episodic memory: insights from the serial position effect and item response theory. Arch. Clin. Neuropsychol. 27, 125–135, doi:10.1093/arclin/acr104 (2012).
OpenUrl CrossRef PubMed
↵
Greene, J. D., Baddeley, A. D. & Hodges, J. R. Analysis of the episodic memory deficit in early Alzheimer’s disease: evidence from the doors and people test. Neuropsychologia 34, 537–551, doi:10.1016/0028-3932(95)00151-4 (1996).
OpenUrl CrossRef PubMed Web of Science
↵
Cunha, C., Guerreiro, M., de Mendonca, A., Oliveira, P. E. & Santana, I. Serial position effects in Alzheimer’s disease, mild cognitive impairment, and normal aging: predictive value for conversion to dementia. J. Clin. Exp. Neuropsychol. 34, 841–852, doi:10.1080/13803395.2012.689814 (2012).
OpenUrl CrossRef PubMed
↵
Grober, E. & Kawas, C. Learning and retention in preclinical and early Alzheimer’s disease. Psychol. Aging 12, 183–188, doi:10.1037//0882-7974.12.1.183 (1997).
OpenUrl CrossRef PubMed Web of Science
↵
Stamate, A., Logie, R. H., Baddeley, A. D. & Della Sala, S. Forgetting in Alzheimer’s disease: Is it fast? Is it affected by repeated retrieval? Neuropsychologia 138, 107351, doi:10.1016/j.neuropsychologia.2020.107351 (2020).
OpenUrl CrossRef
↵
Duke Han, S., Nguyen, C. P., Stricker, N. H. & Nation, D. A. Detectable Neuropsychological Differences in Early Preclinical Alzheimer’s Disease: A Meta-Analysis. Neuropsychol. Rev. 27, 305–325, doi:10.1007/s11065-017-9345-5 (2017).
OpenUrl CrossRef
↵
Baker, J. E. et al. Cognitive impairment and decline in cognitively normal older adults with high amyloid-beta: A meta-analysis. Alzheimers Dement (Amst) 6, 108–121, doi:10.1016/j.dadm.2016.09.002 (2017).
OpenUrl CrossRef
↵
Boyle, P. A. et al. To what degree is late life cognitive decline driven by age-related neuropathologies? Brain 144, 2166–2175, doi:10.1093/brain/awab092 (2021).
OpenUrl CrossRef
Harrington, K. D. et al. Estimates of age-related memory decline are inflated by unrecognized Alzheimer’s disease. Neurobiol. Aging 70, 170–179, doi:10.1016/j.neurobiolaging.2018.06.005 (2018).
OpenUrl CrossRef
↵
Bos, I. et al. Amyloid-beta, Tau, and Cognition in Cognitively Normal Older Individuals: Examining the Necessity to Adjust for Biomarker Status in Normative Data. Front. Aging Neurosci. 10, 193, doi:10.3389/fnagi.2018.00193 (2018).
OpenUrl CrossRef
↵
Alden, E. C. et al. Mayo normative studies: A conditional normative model for longitudinal change on the Auditory Verbal Learning Test and preliminary validation in preclinical Alzheimer’s disease. Alzheimers Dement (Amst) 14, e12325, doi:10.1002/dad2.12325 (2022).
OpenUrl CrossRef
↵
Madero, E. N. et al. Environmental Distractions during Unsupervised Remote Digital Cognitive Assessment. The Journal of Prevention of Alzheimer’s Disease, 1–4, doi:10.14283/jpad.2021.9 (2021).
OpenUrl CrossRef
↵
Lee, J. et al. The Overlap Index as a means of evaluating early tau-PET signal reliability. J. Nucl. Med., doi:10.2967/jnumed.121.263136 (2022).
OpenUrl Abstract/FREE Full Text
↵
Jack, C. R., Jr.. et al. Associations of Amyloid, Tau, and Neurodegeneration Biomarker Profiles With Rates of Memory Decline Among Individuals Without Dementia. JAMA 321, 2316–2325, doi:10.1001/jama.2019.7437 (2019).
OpenUrl CrossRef PubMed
Papp, K. V. et al. The Computerized Cognitive Composite (C3) in an Alzheimer’s Disease Secondary Prevention Trial. J Prev Alzheimers Dis 8, 59–67, doi:10.14283/jpad.2020.38 (2021).
OpenUrl CrossRef

View the discussion thread.

Posted November 01, 2022.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Neurology

Subject Areas

All Articles

Addiction Medicine (323)
Allergy and Immunology (627)
Anesthesia (163)
Cardiovascular Medicine (2367)
Dentistry and Oral Medicine (288)
Dermatology (206)
Emergency Medicine (379)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (835)
Epidemiology (11765)
Forensic Medicine (10)
Gastroenterology (702)
Genetic and Genomic Medicine (3731)
Geriatric Medicine (348)
Health Economics (633)
Health Informatics (2392)
Health Policy (929)
Health Systems and Quality Improvement (896)
Hematology (340)
HIV/AIDS (780)
Infectious Diseases (except HIV/AIDS) (13303)
Intensive Care and Critical Care Medicine (767)
Medical Education (365)
Medical Ethics (104)
Nephrology (398)
Neurology (3493)
Nursing (198)
Nutrition (523)
Obstetrics and Gynecology (673)
Occupational and Environmental Health (662)
Oncology (1819)
Ophthalmology (535)
Orthopedics (218)
Otolaryngology (287)
Pain Medicine (232)
Palliative Medicine (66)
Pathology (446)
Pediatrics (1032)
Pharmacology and Therapeutics (426)
Primary Care Research (420)
Psychiatry and Clinical Psychology (3172)
Public and Global Health (6135)
Radiology and Imaging (1279)
Rehabilitation Medicine and Physical Therapy (746)
Respiratory Medicine (825)
Rheumatology (379)
Sexual and Reproductive Health (372)
Sports Medicine (322)
Surgery (401)
Toxicology (50)
Transplantation (172)
Urology (145)

[1] ↵
Cummings, J., Lee, G., Zhong, K., Fonseca, J. & Taghva, K. Alzheimer’s disease drug development pipeline: 2021. Alzheimers Dement (N Y) 7, e12179, doi:10.1002/trc2.12179 (2021).
OpenUrl CrossRef

[2] ↵
Dorsey, E. R., Kluger, B. & Lipset, C. H. The New Normal in Clinical Trials: Decentralized Studies. Ann. Neurol. 88, 863–866, doi:10.1002/ana.25892 (2020).
OpenUrl CrossRef

[3] ↵
Sabbagh, M. et al. Early Detection of Mild Cognitive Impairment MCI in an At Home Setting. J Prev Alzheimers Dis, doi:10.14283/jpad.2020.21 (2020).
OpenUrl CrossRef

[4] ↵
Sabbagh, M. N. et al. Rationale for Early Diagnosis of Mild Cognitive Impairment (MCI) Supported by Emerging Digital Technologies. J Prev Alzheimers Dis 7, 158–164, doi:10.14283/jpad.2020.19 (2020).
OpenUrl CrossRef

[5] ↵
Gregory G. Brown,
Tricia Z. King,
Kathleen Y. Haaland, &
Bruce Crosson
Bauer, R. M. & Bilder, R. M. in APA Handbook of Neuropsychology Vol. 2: Neuroscience and Neuromethods (eds Gregory G. Brown, Tricia Z. King, Kathleen Y. Haaland, & Bruce Crosson) (American Psychological Association, 2023, in press).

[6] Gregory G. Brown,

[7] Tricia Z. King,

[8] Kathleen Y. Haaland, &

[9] Bruce Crosson

[10] Mackin, R. S. et al. Reliability and Validity of a Home-Based Self-Administered Computerized Test of Learning and Memory Using Speech Recognition. Neuropsychol. Dev. Cogn. B Aging Neuropsychol. Cogn., 1–15, doi:10.1080/13825585.2021.1927961 (2021).
OpenUrl CrossRef

[11] ↵
Marra, D. E., Hamlet, K. M., Bauer, R. M. & Bowers, D. Validity of teleneuropsychology for older adults in response to COVID-19: A systematic and critical review. Clin. Neuropsychol., 1–42, doi:10.1080/13854046.2020.1769192 (2020).
OpenUrl CrossRef

[12] ↵
Cromer, J. A. et al. Comparison of Cognitive Performance on the Cogstate Brief Battery When Taken In-Clinic, In-Group, and Unsupervised. Clin. Neuropsychol. 29, 542–558, doi:10.1080/13854046.2015.1054437 (2015).
OpenUrl CrossRef

[13] Mielke, M. M. et al. Performance of the CogState computerized battery in the Mayo Clinic Study on Aging. Alzheimer’s & Dementia 11, 1367–1376, doi:10.1016/j.jalz.2015.01.008 (2015).
OpenUrl CrossRef PubMed

[14] ↵
Stricker, N. H. et al. Longitudinal Comparison of in Clinic and at Home Administration of the Cogstate Brief Battery and Demonstrated Practice Effects in the Mayo Clinic Study of Aging. The journal of prevention of Alzheimer’s disease 7, 21–28, doi:10.14283/jpad.2019.35 (2020).
OpenUrl CrossRef

[15] ↵
Caselli, R. J. et al. Neuropsychological decline up to 20 years before incident mild cognitive impairment. Alzheimers Dement 16, 512–523, doi:10.1016/j.jalz.2019.09.085 (2020).
OpenUrl CrossRef

[16] ↵
Stricker, J. L. et al. Neural network process simulations support a distributed memory system and aid design of a novel computer adaptive digital memory test for preclinical and prodromal Alzheimer’s disease. Neuropsychology, doi:10.1037/neu0000847 (2022).
OpenUrl CrossRef

[17] ↵
Stricker, N. H. et al. A novel computer adaptive word list memory test optimized for remote assessment: Psychometric properties and associations with neurodegenerative biomarkers in older women without dementia. Alzheimers Dement (Amst) 14, e12299, doi:10.1002/dad2.12299 (2022).
OpenUrl CrossRef

[18] ↵
Lim, Y. Y. et al. Association of deficits in short-term learning and Aβ and hippoampal volume in cognitively normal adults. Neurology, 10.1212/WNL.0000000000010728, doi:10.1212/wnl.0000000000010728 (2020).
OpenUrl CrossRef

[19] ↵
Machulda, M. M. et al. Practice effects and longitudinal cognitive change in clinically normal older adults differ by Alzheimer imaging biomarker status. Clin. Neuropsychol. 31, 99–117, doi:10.1080/13854046.2016.1241303 (2017).
OpenUrl CrossRef

[20] ↵
Duff, K. et al. Short-Term Practice Effects and Amyloid Deposition: Providing Information Above and Beyond Baseline Cognition. J Prev Alzheimers Dis 4, 87–92, doi:10.14283/jpad.2017.9 (2017).
OpenUrl CrossRef

[21] ↵
Bilder, R. M. & Reise, S. P. Neuropsychological tests of the future: How do we get there from here? Clin. Neuropsychol. 33, 220–245, doi:10.1080/13854046.2018.1521993 (2019).
OpenUrl CrossRef

[22] ↵
Wolters, E. E. et al. Clinical validity of increased cortical uptake of [(18)F]flortaucipir on PET as a biomarker for Alzheimer’s disease in the context of a structured 5-phase biomarker development framework. Eur. J. Nucl. Med. Mol. Imaging 48, 2097–2109, doi:10.1007/s00259-020-05118-w (2021).
OpenUrl CrossRef

[23] ↵
Chiotis, K. et al. Clinical validity of increased cortical uptake of amyloid ligands on PET as a biomarker for Alzheimer’s disease in the context of a structured 5-phase development framework. Neurobiol. Aging 52, 214–227, doi:10.1016/j.neurobiolaging.2016.07.012 (2017).
OpenUrl CrossRef

[24] ↵
Jack, C. R., Jr.. et al. NIA-AA Research Framework: Toward a biological definition of Alzheimer’s disease. Alzheimers Dement 14, 535–562, doi:10.1016/j.jalz.2018.02.018 (2018).
OpenUrl CrossRef PubMed

[25] ↵
St Sauver, J. L. et al. Data resource profile: the Rochester Epidemiology Project (REP) medical records-linkage system. Int. J. Epidemiol. 41, 1614–1624, doi:10.1093/ije/dys195 (2012).
OpenUrl CrossRef PubMed Web of Science

[26] ↵
Roberts, R. O. et al. The Mayo Clinic Study of Aging: Design and sampling, participation, baseline measures and sample characteristics. Neuroepidemiology 30, 58–69, doi:10.1159/000115751 (2008).
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Kokmen, E., Smith, G. E., Petersen, R. C., Tangalos, E. & Ivnik, R. C. The short test of mental status: Correlations with standardized psychometric testing. Arch. Neurol. 48, 725–728, doi:10.1001/archneur.1991.00530190071018 (1991).
OpenUrl CrossRef PubMed Web of Science

[28] ↵
Morris, J. C. The Clinical Dementia Rating (CDR): Current version and scoring rules. Neurology 43, 2412–2414, doi:10.1212/WNL.43.11.2412-a (1993).
OpenUrl CrossRef PubMed

[29] ↵
Petersen, R. C. Mild cognitive impairment as a diagnostic entity. J. Intern. Med. 256, 183–194, doi:10.1111/j.1365-2796.2004.01388.x. (2004).
OpenUrl CrossRef PubMed Web of Science

[30] ↵
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (DSM-IV). 4th edn, (American Psychiatric Association, 1994).

[31] ↵
Jack, C. R., Jr.. et al. 11C PiB and structural MRI provide complementary information in imaging of Alzheimer’s disease and amnestic mild cognitive impairment. Brain 131, 665–680 (2008).
OpenUrl CrossRef PubMed Web of Science

[32] ↵
Jack, C. R., Jr.. et al. Defining imaging biomarker cut points for brain aging and Alzheimer’s disease. Alzheimers Dement 13, 205–216, doi:10.1016/j.jalz.2016.08.005 (2017).
OpenUrl CrossRef PubMed

[33] ↵
Vemuri, P. et al. Tau-PET uptake: Regional variation in average SUVR and impact of amyloid deposition. Alzheimers Dement (Amst) 6, 21–30, doi:10.1016/j.dadm.2016.12.010 (2017).
OpenUrl CrossRef

[34] ↵
Klunk, W. E. et al. The Centiloid Project: standardizing quantitative amyloid plaque estimation by PET. Alzheimers Dement 11, 1-15 e11-14, doi:10.1016/j.jalz.2014.07.003 (2015).
OpenUrl CrossRef PubMed

[35] ↵
Alden, E. C. et al. Diagnostic accuracy of the Cogstate Brief Battery for prevalent MCI and prodromal AD (MCI A(+) T(+)) in a population-based sample. Alzheimers Dement 17, 584–594, doi:10.1002/alz.12219 (2021).
OpenUrl CrossRef

[36] Pudumjee, S. B. et al. A Comparison of Cross-Sectional and Longitudinal Methods of Defining Objective Subtle Cognitive Decline in Preclinical Alzheimer’s Disease Based on Cogstate One Card Learning Accuracy Performance. J. Alzheimers Dis. 83, 861–877, doi:10.3233/JAD-210251 (2021).
OpenUrl CrossRef

[37] ↵
Stricker, N. H. et al. Diagnostic and Prognostic Accuracy of the Cogstate Brief Battery and Auditory Verbal Learning Test in Preclinical Alzheimer’s Disease and Incident Mild Cognitive Impairment: Implications for Defining Subtle Objective Cognitive Impairment J. Alzheimers Dis. 76, 261–274, doi:10.3233/JAD-200087 (2020).
OpenUrl CrossRef

[38] ↵
Stricker, N. H. et al. Mayo Normative Studies: Regression-Based Normative Data for the Auditory Verbal Learning Test for Ages 30–91 Years and the Importance of Adjusting for Sex. J. Int. Neuropsychol. Soc., 1–16, doi:10.1017/S1355617720000752 (2020).
OpenUrl CrossRef

[39] ↵
Ferman, T. J. et al. Mayo’s Older African American Normative Studies: Auditory Verbal Learning Test norms for African American elders. Clin. Neuropsychol. 19, 214–228, doi:10.1080/13854040590945300 (2005).
OpenUrl CrossRef PubMed Web of Science

[40] ↵
Jack, C. R., Jr.. et al. Age, Sex, and APOE epsilon4 Effects on Memory, Brain Structure, and beta-Amyloid Across the Adult Life Span. JAMA Neurol 72, 511–519, doi:10.1001/jamaneurol.2014.4821 (2015).
OpenUrl CrossRef PubMed

[41] ↵
Sliwinski, M. J. et al. Reliability and Validity of Ambulatory Cognitive Assessments. Assessment 25, 14–30, doi:10.1177/1073191116643164 (2016).
OpenUrl CrossRef PubMed

[42] ↵
Ohman, F., Hassenstab, J., Berron, D., Scholl, M. & Papp, K. V. Current advances in digital cognitive assessment for preclinical Alzheimer’s disease. Alzheimers Dement (Amst) 13, e12217, doi:10.1002/dad2.12217 (2021).
OpenUrl CrossRef

[43] ↵
Lohnas, L. J. & Kahana, M. J. Parametric effects of word frequency in memory for mixed frequency lists. J. Exp. Psychol. Learn. Mem. Cogn. 39, 1943–1946, doi:10.1037/a0033669 (2013).
OpenUrl CrossRef PubMed

[44] ↵
Clark, J. M. & Paivio, A. Extensions of the Paivio, Yuille, and Madigan (1968) norms. Behav. Res. Methods Instrum. Comput. 36, 371–383, doi:10.3758/BF03195584 (2004).
OpenUrl CrossRef

[45] ↵
Dolch, E. W. A basic sight vocabulary. The Elementary School Journal 36, 456–460, doi:10.1086/457353 (1936).
OpenUrl CrossRef

[46] ↵
Therneau, T. M. A package for survival analysis in R, < https://CRAN.Rproject.org/package=survival> (2021).

[47] ↵
Morrison, R. L. et al. A computerized, self-administered test of verbal episodic memory in elderly patients with mild cognitive impairment and healthy participants: A randomized, crossover, validation study. Alzheimers Dement (Amst) 10, 647–656, doi:10.1016/j.dadm.2018.08.010 (2018).
OpenUrl CrossRef

[48] ↵
Belleville, S. et al. Neuropsychological Measures that Predict Progression from Mild Cognitive Impairment to Alzheimer’s type dementia in Older Adults: a Systematic Review and Meta-Analysis. Neuropsychol. Rev. 27, 328–353, doi:10.1007/s11065-017-9361-5 (2017).
OpenUrl CrossRef

[49] ↵
Weissberger, G. H. et al. Diagnostic Accuracy of Memory Measures in Alzheimer’s Dementia and Mild Cognitive Impairment: a Systematic Review and Meta-Analysis. Neuropsychol. Rev. 27, 354–388, doi:10.1007/s11065-017-9360-6 (2017).
OpenUrl CrossRef PubMed

[50] ↵
K. W. Spence &
J. T. Spence
Atkinson, R. C. & Shiffrin, R. M. in The psychology of learning and motivation: II. (ed K. W. Spence & J. T. Spence) (Academic Press, 1968).

[51] K. W. Spence &

[52] J. T. Spence

[53] Gavett, B. E. & Horwitz, J. E. Immediate list recall as a measure of short-term episodic memory: insights from the serial position effect and item response theory. Arch. Clin. Neuropsychol. 27, 125–135, doi:10.1093/arclin/acr104 (2012).
OpenUrl CrossRef PubMed

[54] ↵
Greene, J. D., Baddeley, A. D. & Hodges, J. R. Analysis of the episodic memory deficit in early Alzheimer’s disease: evidence from the doors and people test. Neuropsychologia 34, 537–551, doi:10.1016/0028-3932(95)00151-4 (1996).
OpenUrl CrossRef PubMed Web of Science

[55] ↵
Cunha, C., Guerreiro, M., de Mendonca, A., Oliveira, P. E. & Santana, I. Serial position effects in Alzheimer’s disease, mild cognitive impairment, and normal aging: predictive value for conversion to dementia. J. Clin. Exp. Neuropsychol. 34, 841–852, doi:10.1080/13803395.2012.689814 (2012).
OpenUrl CrossRef PubMed

[56] ↵
Grober, E. & Kawas, C. Learning and retention in preclinical and early Alzheimer’s disease. Psychol. Aging 12, 183–188, doi:10.1037//0882-7974.12.1.183 (1997).
OpenUrl CrossRef PubMed Web of Science

[57] ↵
Stamate, A., Logie, R. H., Baddeley, A. D. & Della Sala, S. Forgetting in Alzheimer’s disease: Is it fast? Is it affected by repeated retrieval? Neuropsychologia 138, 107351, doi:10.1016/j.neuropsychologia.2020.107351 (2020).
OpenUrl CrossRef

[58] ↵
Duke Han, S., Nguyen, C. P., Stricker, N. H. & Nation, D. A. Detectable Neuropsychological Differences in Early Preclinical Alzheimer’s Disease: A Meta-Analysis. Neuropsychol. Rev. 27, 305–325, doi:10.1007/s11065-017-9345-5 (2017).
OpenUrl CrossRef

[59] ↵
Baker, J. E. et al. Cognitive impairment and decline in cognitively normal older adults with high amyloid-beta: A meta-analysis. Alzheimers Dement (Amst) 6, 108–121, doi:10.1016/j.dadm.2016.09.002 (2017).
OpenUrl CrossRef

[60] ↵
Boyle, P. A. et al. To what degree is late life cognitive decline driven by age-related neuropathologies? Brain 144, 2166–2175, doi:10.1093/brain/awab092 (2021).
OpenUrl CrossRef

[61] Harrington, K. D. et al. Estimates of age-related memory decline are inflated by unrecognized Alzheimer’s disease. Neurobiol. Aging 70, 170–179, doi:10.1016/j.neurobiolaging.2018.06.005 (2018).
OpenUrl CrossRef

[62] ↵
Bos, I. et al. Amyloid-beta, Tau, and Cognition in Cognitively Normal Older Individuals: Examining the Necessity to Adjust for Biomarker Status in Normative Data. Front. Aging Neurosci. 10, 193, doi:10.3389/fnagi.2018.00193 (2018).
OpenUrl CrossRef

[63] ↵
Alden, E. C. et al. Mayo normative studies: A conditional normative model for longitudinal change on the Auditory Verbal Learning Test and preliminary validation in preclinical Alzheimer’s disease. Alzheimers Dement (Amst) 14, e12325, doi:10.1002/dad2.12325 (2022).
OpenUrl CrossRef

[64] ↵
Madero, E. N. et al. Environmental Distractions during Unsupervised Remote Digital Cognitive Assessment. The Journal of Prevention of Alzheimer’s Disease, 1–4, doi:10.14283/jpad.2021.9 (2021).
OpenUrl CrossRef

[65] ↵
Lee, J. et al. The Overlap Index as a means of evaluating early tau-PET signal reliability. J. Nucl. Med., doi:10.2967/jnumed.121.263136 (2022).
OpenUrl Abstract/FREE Full Text

[66] ↵
Jack, C. R., Jr.. et al. Associations of Amyloid, Tau, and Neurodegeneration Biomarker Profiles With Rates of Memory Decline Among Individuals Without Dementia. JAMA 321, 2316–2325, doi:10.1001/jama.2019.7437 (2019).
OpenUrl CrossRef PubMed

[67] Papp, K. V. et al. The Computerized Cognitive Composite (C3) in an Alzheimer’s Disease Secondary Prevention Trial. J Prev Alzheimers Dis 8, 59–67, doi:10.14283/jpad.2020.38 (2021).
OpenUrl CrossRef

Stricker Learning Span criterion validity: a remote self-administered multi-device compatible digital word list memory measure shows similar ability to differentiate amyloid and tau PET-defined biomarker groups as in-person Auditory Verbal Learning Test

Abstract

1. Introduction

2. Methods

In Vivo Neuroimaging Markers of Amyloid and Tau

Person-administered AVLT completed in clinic

Self-administered SLS completed remotely (not in clinic)

Inclusion Criteria

Statistical methods

3. Results

3.1 Participant Characteristics and Convergent Validity

3.2 The SLS shows similar ability to differentiate PET-defined biomarker groups compared to the AVLT (Aim 1, all participants)

3.2.1 AUROC comparisons

3.2.1a Amyloid Groups

3.2.1b Amyloid and Tau Groups

3.2.2 Descriptive effect sizes from group difference analyses (Table 5)

3.2.2a Amyloid Groups

3.2.2b Amyloid and Tau Groups

3.3 The SLS shows similar ability to differentiate PET-defined biomarker groups compared to the AVLT for preclinical AD (CU participants only; Aim 2)

3.3.1 AUROC comparisons

3.3.1a Amyloid Groups

3.3.1b Amyloid and Tau Groups

3.3.2 Descriptive effect sizes from group difference analyses (Table 5)

3.3.2a Amyloid Groups

3.3.2b Amyloid and Tau Groups

3.4 Learning measures show similar ability as delay memory measures to differentiate A-T- and A+T+ groups for both the SLS and the AVLT

3.4.1 All participants

Data Availability

4. Discussion

Data Availability

Acknowledgment

References

Citation Manager Formats

Subject Area