Comparison of approaches to control for intracranial volume in research on the association of brain volumes with cognitive outcomes =================================================================================================================================== * Jingxuan Wang * Tanisha Hill-Jarrett * Peter Buto * Annie Pederson * Kendra D. Sims * Scott C. Zimmerman * Michelle A DeVost * Erin Ferguson * Benjamin Lacar * Yulin Yang * Minhyuk Choi * Michelle R. Caunca * Renaud La Joie * Ruijia Chen * M. Maria Glymour * Sarah F. Ackley ## Abstract **Background** Consistent methods for evaluating the link between brain structure and cognition are essential for understanding determinants of neurologic outcomes. Studies examining associations between brain volumetric measures and cognition use various statistical approaches to account for variation in intracranial volume (ICV). It is unclear if commonly used approaches yield consistent results. **Methods** Using a brain-wide association approach in the MRI substudy of UK Biobank (N=41,964; mean age=64.5 years), we used regression models to estimate the associations of 58 regional brain volumetric measures with eight cognitive outcomes, comparing no correction and five ICV correction approaches. Approaches evaluated included: no correction; dividing regional volumes by ICV, with and without further adjustment for ICV (proportional approach); including ICV as a covariate in the regression (adjustment approach); and regressing the regional volumes against ICV in different normative samples and using calculated residuals to determine associations (residual approach). We used Spearman-rank correlations and two consistency measures to quantify the extent to which associations were inconsistent across ICV correction approaches for each possible brain region and cognitive outcome pair across 2,784 regression models. **Findings** The adjustment and residual approaches typically produced similar estimates, which were inconsistent with results from the crude and proportional approaches. Inconsistencies across approaches were largest when estimates from the adjustment and residual approaches were further from the null. That is, the approach used was least important when the association between brain volume and cognitive performance was close to null; in this case, all approaches tend to estimate a null association. **Interpretation** Commonly used methods to correct for ICV yield inconsistent results and the proportional method diverges from other methods. Adjustment methods are the simplest to implement while producing biologically plausible associations. ## Introduction Structural magnetic resonance imaging (MRI) is widely used to evaluate the relationship between brain volumetric measures and cognitive measures, including memory, attention, and executive function.1,2 In studies examining associations between brain volumetric measures and cognition in aging populations, correcting volumetric measures for intracranial volume (ICV) is often necessary to account for differences in skull size.3,4 This is because skull size is not an independent predictor of dementia5 and is associated with numerous childhood and adulthood socioeconomic status factors that may confound associatons between neuroimaging measures and outcomes.6 Several different statistical approaches to account for ICV have been adopted in the field. These different approaches may produce inconsistent estimates, but, to date, there is no clear guidance on which approaches are preferable. This presents significant challenges for reproducibility in neuroimaging studies, may account for inconsistent results across studies, and likely contributes to incorrect estimates in some studies. Associations between volumetric measures and treatment or outcomes are used in a wide range of contexts, including drug trials,7 and may be used inform biological understanding of disease, research priorities, and interventions.8 Thus, analysis decisions on ICV correction may ultimately impact individual clinical diagnosis and management.9 The proportional, adjustment, and residual approaches are the three commonly used approaches to correct for ICV (Box 1). For the proportional approach, each brain volume measure is divided by ICV and this scaled quantity is used to determine associations with a cognitive measure, typically using a regression model.4,10,11 For the adjustment approach, ICV is included along with the brain volume measure as an independent variable in a regression model with cognition as the dependent variable.4,11 The residual approach uses two regression models, first regressing each regional volume against ICV, calculating the residuals from this model (i.e., variation in brain volume not predicted by ICV), and using the calculated residuals as the independent variable in a regression with cognitive measures as the predictor. Typically, but not always, the coefficients for the first regression are estimated in a normative sample of healthy controls.4,11,12 Specific implementations of these methods vary, and sometimes crude volumes are used without correcting for ICV.13 Box 1: Summary of commonly used approaches used to correct for intracranial volume (ICV). View this table: [Table1](http://medrxiv.org/content/early/2023/07/16/2023.07.14.23292678/T1) Evidence on how different correction approaches modify the estimated associations between MRI volumetric measures and cognition is limited. Some prior studies examine how other associations with MRI volumetric measures, such as sex, gender, and age,14–16 vary with ICV correction strategy. The small number of prior studies evaluating how different ICV correction strategies affect associations between volumetric measures and cognition have important limitations: they do not evaluate all commonly used ICV correction approaches; they use a single or small number of cognitive measures; are performed in younger samples or small samples; or they are performed in highly select volunteer cohorts (e.g. the Alzheimer’s Disease Neuroimaging Intitiative cohort) such that results may not generalize to the general aging population.4,14–16 In this study, we compared estimated associations between MRI volumetric measures and cognitive measures across commonly used ICV correction approaches in the MRI subsample of the UK Biobank. The UK Biobank is a cohort of middle-aged and older adults participating in the National Health Service. As such, Alzheimer’s disease and vascular dementia would be expected to be the most common causes of brain atrophy.17,18 Inspired by brain-wide association studies (BWAS), which evaluate each pairwise association of brain region and outcome,19 we evaluated the extent to which different ICV correction approaches give inconsistent associations between a brain volume and cognitive measure for the full-factorial combination of ICV correction approach, brain volumetric measure, and cognitive outcome. Inconsistency across ICV correction approaches may have important implications for reproducibility in neuroimaging research. These inconsistencies may account for conflicting findings across studies and produce spurious associations that would not reach statistical significance with alternative approaches. ## Methods ### Study Population The UK Biobank is a large prospective cohort study of 502,490 UK adults aged 40-69 years at recruitment in 2006-2010. At the baseline visit, participants completed social, physical, and medical assessments. Starting in 2014, participants were invited for MRI neuroimaging at four clinics using identical protocols.20 The final target sample for the MRI substudy is 100,000 but at the time of writing, MRI imaging data were available for 41,964 participants. Ethical approval was obtained by the UKB study from the National Health Service National Research Ethics Service with all participants providing written informed consent. Analyses were approved by the University of California, San Francisco Institutional Review Board under UK Biobank Resource project #74748. ### MRI Volumetric Measures All MRI imaging was carried out using Siemens Skyra 3 Tesla scanner with a standard 32-channel head coil. Full details on the image acquisition, processing, and quality control are available at UK Biobank Brain Imaging Documentation.21 All image preprocessing was conducted by the UK Biobank neuroimaging team and included non-brain removal, bias-field correction, and tissue segmentation. The T1-weighted images with a cubic millimeter isotropic resolution were previously analyzed with FMRIB Software Library (FSL). T1-weighted images and T2-weighted fluid attenuation inversion recovery (FLAIR) images were further processed using FreeSurfer to generate subcortical volumes using the automatic subcortical segmentation (ASEG), and cortical volumes based on the Desikan-Killianny-Tourville (DKT) atlas. We included 27 subcortical and 31 cortical regions (**Supplementary Material 1**). ICV was estimated from ASEG. In our primary analysis, we *a priori* selected eight regions of interest (ROIs) previously linked to cognitive outcomes.22–26 Only data from the first MRI visit was used due to limited follow-up. We combined hemispheres to obtain a single measure for each ROI. ### Cognitive Measures We considered eight cognitive measures: fluid intelligence, numeric memory, prospective memory, pairs matching, Trail Making A, Trail Making B, reaction time, and symbol digit substitution. Detailed descriptions of the eight cognitive tests are included in **Supplementary Material 2.** All cognitive tests were administered in English via touch screen interface and designed to be completed without supervision. All cognitive outcomes were measured at the MRI visit. Cognitive scores for pairs matching, Trail Making A, Trail Making B, and reaction time referred to negative one times the score on those cognitive tests; that is, these measures were signed such that higher values indicate better cognition. All continuous cognitive outcomes were *z*-standardized. ### ICV Correction Approaches We considered no correction and five implementations of widely used approaches to correct for ICV in analyses of associations between regional brain volumes and cognition (in all models, the cognitive measure is the outcome variable of interest).10,11,27 The first approach uses crude, uncorrected volumes, while the other five are ICV correction approaches. The approaches are as follows: 1) **Crude approach:** Crude volumes are used without correcting for ICV to determine associations with cognitive measures in a regression model. 2) **Proportional approach:** volumetric measures are divided by ICV and this ratio of regional volume to ICV is used to determine associations in a regression model with cognition as the outcome. 3) **Proportional with adjustment approach:** The regional volume to ICV ratio from the proportional approach is used to determine associations in a regression model that additionally adjusts for ICV as a covariate. 4) **Adjustment approach:** Crude volumes are used to determine associations in a regression model that adjusts for ICV as a covariate. 5) **Full-sample residual approach:** Each volumetric measure is first regressed against ICV in the full sample of participants. This regression model is then used to obtain ICV-corrected volumes in the whole sample using the following: ![Graphic][1], where ![Graphic][2] is the slope from the regression model. ICV-corrected volumes (i.e., residuals from the first regression model) are then used as the independent variable in a regression model to determine associations with cognitive measures. 6) **Normative-subsample residual approach**: This approach is identical to the full-sample residual approach, but the first regression is restricted to a “normative” sample of dementia-free participants younger than 60 years. ### Covariates Covariates included age, sex14,15 (female, male), race (self-identified and categorized as White, Black, Asian, or Mixed/Other), level of education (A-levels or above or less than A-levels), assessment center (Cheadle, Reading, Newcastle, or Bristol), and number of APOE-ε4 alleles. At recruitment, age and sex information were obtained from a central registry and subsequently updated by participants. Participants self-reported their race and education through a touchscreen questionnaire at baseline. The number of APOE-ε4 alleles (0, 1, or 2) was determined by the single nucleotide polymorphisms rs7412 and rs429358.28 ### Statistical Analyses We summarized demographic characteristics of the subsamples with MRI data who completed each of the following cognitive outcomes: fluid intelligence, numeric memory, prospective memory, pairs matching, Trail Making A, Trail Making B, reaction time, and symbol digit substitution. To evaluate the associations between regional volumes and cognitive outcomes, we used linear regression for all cognitive outcomes except for prospective memory, for which we used logistic regression (1 for correct on the first attempt and 0 otherwise). All models were adjusted for age, age squared, sex, race, the number of APOE-ε4 alleles, education, and assessment center as a proxy for geographic location. To facilitate comparison of effect sizes across ICV correction approaches, ROIs, and cognitive outcomes, we *z*-standardized all corrected and crude regional volumes, ICV, and continuous cognitive outcomes by subtracting the sample mean and dividing by the sample standard deviation. The factorial combination of brain volume measure, cognitive outcome, and ICV correction approaches leads to 2,784 distinct estimates. We evaluated the extent to which different ICV correction approaches yield consistent estimates when applied to the same brain volume ROI measure and cognitive outcome by 1) generating pairwise comparisons for correction approaches across regional volumes and cognitive outcomes; 2) summarizing derived statistics from pairwise comparisons; and 3) creating Manhattan plots to show how rates of statistical significance of associations varies across correction approach. Consistency of estimates using different ICV corrections was evaluated using Spearman-rank correlation coefficients for each possible pair of ICV correction methods and across all ROI and cognitive outcome combinations. We additionally generated two measures for the consistency of findings using alternative ICV correction methods: For the first measure, we calculated the proportion of pairs of correction methods without significantly different estimates across all ROIs using a conservative *z-*test.29 For the second measure of consistency, we calculated the proportion of pairs of correction approaches for which associations had consistent signs across all brain regions assessed. Specifically, associations were only considered to have opposite signs if both associations were statistically significantly different from the null. For the residual approach, we additionally evaluated the impact of varying age thresholds to define the normative sample in associations between the eight selected regional volumes linked to dementia and all cognitive outcomes. Specifically, we defined the normative sample using different age cutoffs (<60, <65, <70, <75, <80) and assessed whether the analytical choice of cutoffs affected estimated associations. We conducted two sensitivity analyses: First, to assess whether the estimates of the associations with cognitive outcomes are affected by observations at the extreme ends of the regional volume distributions, we additionally fit models excluding participants with regional volumes beyond extreme percentiles (1st and 99th; **Supplementary Material 3**). Second, we repeated analyses adjusting only for age and age squared to assess whether results are affected by adjustment set chosen. Finally, we additionally examined incident dementia as an outcome (**Supplementary Material 4**). ## Results Sample sizes across cognitive outcomes ranged from 27,147 to 39,349 (**Table 1**) with very similar demographic compositions. Slightly over half the sample members were female, average age ranged from 64.2 to 64.9 years, nearly all participants were White (ranging from 96.9% to 97.1%), and a majority reported A-level or above education (ranging from 80.7% to 81.5%). Very few individuals had a dementia diagnosis at the MRI visit (n=27) or developed incident dementia in follow-up (n=44). View this table: [Table 1:](http://medrxiv.org/content/early/2023/07/16/2023.07.14.23292678/T2) Table 1: Characteristics of analytic sample for each cognitive outcome. Associations of regional brain volumes with cognitive scores are shown in **Figure 1** and **Figure S1**. The associations varied in both sign and magnitude across ICV correction approaches. Crude estimates were typically farthest from the null, and proportional approaches were typically closest to the null, but sometimes with reversed sign compared to other approaches. For example, when using the crude approach, an increase of 0.114 (95% confidence interval [CI] 0.103 to 0.125) in fluid intelligence was associated with each unit increase in hippocampal volume. However, when using the adjustment approach, the increase was 0.055 (95% CI 0.043 to 0.068), while the proportional approach showed a decrease of -0.025 (95% CI -0.035 to -0.014). ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/16/2023.07.14.23292678/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2023/07/16/2023.07.14.23292678/F1) Figure 1: Associations of volumes of the cuneus, entorhinal cortex, parahippocampal gyrus, precuneus, caudate nucleus, hippocampus, amygdala, and total grey matter with each cognitive outcome. Cognitive scores are signed such that higher values indicate better cognition. Specifically, cognitive scores for pairs matching, Trail Making A, Trail Making B, and reaction time refer to negative one times the score on those cognitive tests. Models adjusted for age, age squared, sex, race, the number of APOE-ε4 alleles, education, and assessment center. **Figure 2** shows comparisons across ICV correction approaches for the associations of all brain regions assessed with fluid intelligence, numeric memory, and Trail Making A and B. Specifically, it shows pairwise scatterplots of estimated associations, as well as Spearman-rank correlations and both measures of consistency for the 58 brain regions evaluated. Estimates when ICV corrections are based on the adjustment approach are highly correlated (*ρ* > 0.89) with coefficients from the residual approach. These two approaches both show only weak correlations with the proportional approach (*ρ* = 0.28 with the adjustment approach and *ρ* = 0.25 with the residual approach, both for fluid intelligence). Different ICV correction approaches sometimes produce both qualitatively different results. The proportional approach was most likely to produce an association with a reversed sign that reaches statistical significance. Even when point estimates are not of the opposite side of the null, estimates may be statistically significantly inconsistent. Supplementary **Figure S2** is the same plot for the remaining cognitive outcomes. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/16/2023.07.14.23292678/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2023/07/16/2023.07.14.23292678/F2) Figure 2: Comparison of coefficients based on alternative ICV corrections for estimated associations of 58 brain region volume measures with A. fluid intelligence, B. numeric memory, C. Trail Making A, and D. Trail Making B. Pairwise comparisons for estimated associations between volumetric measures and fluid intelligence among six total intracranial volume (ICV) correction approaches. Diagonal panels are smoothed density estimates for the distribution of associations across all 58 brain regions assessed under the corresponding ICV correction approach. Lower diagonal panels are scatterplots for the associations under each pair of ICV correction approaches across all 58 brain regions assessed. Upper diagonal panels show the correlation and two consistency measures between each pair. Spearman’s rho (range from -1 to 1) stands for Spearman-rank correlation coefficient; 1 – %sig. different (range from 0 to 1) is one minus the proportion of pairs with significantly different estimates across all ROIs using a conservative Z-test; and 1 – %qual. different (range from 0 to 1) is the one minus proportion of pairs with significantly opposite signs. Color intensity represents the value, with fully saturated colors indicating a 1. Discrepancies between ICV correction approaches were prominent for fluid intelligence, numeric memory, and Trail Making A and B. Numeric memory and Trail Making A and B are often used in clinical settings as indicators of simple attention/working memory and executive function, respectively, are known to be affected in more advanced stages of Alzheimer’s disease and are prominent areas of cognition affected in other forms of dementia (e.g., frontotemporal dementia).30–32 Associations with prospective memory were less appreciably affected by the ICV correction approach, but this may reflect a single limited assessment task in UK Biobank. **Figure 3** shows brain-wide associations in a Manhattan plot33 for all cognitive outcomes and across all brain regions evaluated, comparing ICV correction approaches. Which and how many associations are statistically significant varies with ICV correction approach and cognitive outcome assessed. The crude approach produces the smallest *p*-values, consistent with the premise that failing to correct for ICV inflates associations and statistical significance. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/16/2023.07.14.23292678/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2023/07/16/2023.07.14.23292678/F3) Figure 3: Manhattan plot for association of cognitive outcomes with 58 regional brain volumes, comparing 6 ICV correction approaches. Dashed lines indicate the Bonferroni-corrected threshold for brain-wide significance (p<1.8×10-5). Number annotations indicate the number of brain-wide significant regional associations across all 58 brain regions assessed. The full-sample and normative-sample (age < 60) residual approaches produced highly correlated associations (**Figure 2**), with comparable *p*-values (**Figure 3**). However, estimated associations can vary, as shown in **Figure 1**, with estimates with the normative-sample residual approach typically attenuated relative to those of approach the full-sample residual approach. **Figure 4** extends this, showing how using an increasingly older, and presumably less healthy sample with more age-associated neurodegeneration, delivers larger estimated associations between hippocampal volume and fluid intelligence. Consistent with results in **Figure 1**, **Figure 4** shows an increasingly younger sample produces an attenuated association between selected regions and fluid intelligence, numeric memory, and Trail Making B. This highlights that definition of normative sample can affect estimated associations. In sensitivity analyses, we find that results are comparable excluding outliers in ICV and with different adjustment sets. See supplemental **Figures S3**-**S6**. In addition, we did not observe similar inconsistency in time-to-event analysis (**Figure S7**). ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/07/16/2023.07.14.23292678/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2023/07/16/2023.07.14.23292678/F4) Figure 4: Effect of definition of residual sample on associations between volumes of the cuneus, entorhinal cortex, parahippocampal gyrus, precuneus, caudate nucleus, hippocampus, amygdala, and total grey matter and all cognitive outcomes. ## Discussion We evaluated whether estimated associations between regional brain volumetric measures and cognitive outcomes differed when using crude volumes or five alternative approaches to correct for ICV using the large UK Biobank MRI subsample. Although estimates based on the adjustment and residual approaches were similar, estimates based on the proportional and crude approaches were inconsistent. Inconsistencies were largest when estimates from the adjustment and residual approaches were further from the null. The proportional, adjustment, and residual approaches are all commonly used strategies10 to correct for ICV, and crude estimates are frequently presented as well.13 However, we found that the proportional approach frequently produced estimates that were inconsistent with understanding of neurobiology and even have the opposite sign of estimates derived with adjustment and residual approaches. As an example, the hippocampus plays a vital role in encoding and consolidation of new memories.34 Numerous studies have demonstrated an association between hippocampal atrophy and poorer neuropsychological test performance, particularly with regards to memory tasks.35–38 This relationship is further corroborated by existing experimental studies and clinical case studies involving direct damage to the hippocampus 39–41. However, estimates for the association between hippocampal volume and cognitive outcomes were inconsistent across ICV correction approaches: the proportional indicated that larger hippocampal volume was associated with worse cognition, which is contravenes extant understanding of neurobiology and neurodegenerative diseases (see **Figure 1**).37,42 While more often consistent, even adjustment and residual approaches do not always produce consistent estimated associations, particularly since the associations produced with the residual approach depended on the normative sample adopted (see **Figure 4**). Specifically, we find using increasingly younger normative samples attenuated estimated associations between hippocampal volume and fluid intelligence. These results have important implications for reproducibility in neuroimaging studies, our understanding of disease biology, and intervention evaluation. We recognize that these ICV correction methods represent different biological constructs and that method employed may depend on the nature of the research question. For example, in some studies, we may not want to account for ICV to account for perinatal and childhood factors that influence cranium size and regional volumes. However, findings from different studies may diverge merely because of the selected ICV correction approach—ostensibly a minor statistical decision. Estimated associations from crude volumes tended to be further from the null than findings after ICV correction, suggesting that ICV captures confounding by lifetime peak brain size or childhood growth. If atrophy-related neurodegeneration is the construct we intend to capture, ICV correction will often be necessary. Moreover, it is important to note that ICV correction is also crucial for accurately identifying and interpreting volumetric measures that are associated with cognitive functioning or dementia symptoms.5 Our findings suggest the proportional approach may be misleading and, if used, should be interpreted in conjunction with results from other approaches. The adjustment and residual approaches tend to produce comparable estimates and it may be reasonable to favor these two approaches. However, estimated associations with residual approaches can vary with reference sample used (**Figure 4**). Even more concerning, as the age of the normative sample is decreased, biologically plausible effects are increasingly attenuated, approaching the null as the sample becomes younger and healthier. In addition, the residual approach, when not applied to a separate normative sample, ideally would include a standard error correction to account for the fact that the two-stages of estimation are performed on overlapping samples. Thus, if ICV is to be corrected for, we would tend to favor adjustment over the other methods since it is the simplest to implement while producing biologically plausible associations. Our results are consistent with small prior studies indicating that ICV correction approach can affect study results. Previous studies have examined whether associations of sex and age with volumetric measure persisted across ICV correction approaches.14,15,43 Prior work has also examined associations between brain volumetric measures and cognition,27 finding that ICV correction approach altered associations, with flipped direction of association for the proportional method. This study was performed in two smaller and select samples (*N* = 406 and 724) and examined only the first item of word recall from the ADAS-Cog. The small samples have left substantial uncertainty in whether their conclusions hold for larger and less select samples. For example, previous work found that the association between hippocampal volumes and cognition was not statistically significantly different across ICV correction approach, possibly due to imprecise estimates. Our sample, with 25 to 35 times as many participants as the previous study and a much broader range of volumetric measures and cognitive outcomes assessed,27 provides far more conclusive findings on the importance of ICV correction approach. Our study has several strengths in addition to the large sample: these include comprehensive evaluation of ICV correction approaches and a wide range of cognitive outcomes evaluated. Employing a BWAS-inspired approach, we examined a large combination of associations between cognitive tests and regional brain volumes for a total of 2,784 regressions with several measures of consistency to further support the robustness of our findings. Our study has several limitations. First, our results only pertained to cross-sectional evaluation of MRI volumetric measures and cognitive outcomes. Other neuroimaging measures—including cortical thicknesses, diffusion tensor imaging, and longitudinal change in volumetric measures44—may warrant further investigation.3 UK Biobank participants are known to be healthier on average than the general UK population, have higher socioeconomic status, and are predominately White.45 Patterns may differ in more diverse populations.46 Finally, did not evaluate alternative ICV estimation methods,47 and we did not examine less commonly used methods (e.g., weighted ICV matching16). In conclusion, different ICV correction approaches can produce substantively different estimated associations between MRI-derived measures of brain volume and cognitive outcomes. These differences are largest when the associations are large. The proportional approach is most likely to produce estimates that are inconsistent with adjustment or residual control approaches and biologically implausible. Residual and adjustment approaches are more plausible but since they may produce different results, results based on only one of the approaches should be interpreted with caution. ## Supporting information Supplementary Material 1-4 and Supplementary Figure 1-7 [[supplements/292678_file04.docx]](pending:yes) ## Data Availability All data produced in the present study are available upon reasonable request to the authors. * Received July 14, 2023. * Revision received July 14, 2023. * Accepted July 16, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NoDerivs 4.0 International), CC BY-ND 4.0, as described at [http://creativecommons.org/licenses/by-nd/4.0/](http://creativecommons.org/licenses/by-nd/4.0/) ## References 1. 1.Aggarwal, N. T. et al. The Association of Magnetic Resonance Imaging Measures With Cognitive Function in a Biracial Population Sample. Arch Neurol 67, 475–482 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archneurol.2010.42&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20385915&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000276559500016&link_type=ISI) 2. 2.Scheltens, P., Fox, N., Barkhof, F. & De Carli, C. Structural magnetic resonance imaging in the practical assessment of dementia: beyond exclusion. The Lancet Neurology 1, 13–21 (2002). 3. 3.Barnes, J. et al. Head size, age and gender adjustment in MRI studies: a necessary nuisance? Neuroimage 53, 1244–1255 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2010.06.025&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20600995&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000282165800007&link_type=ISI) 4. 4.Voevodskaya, O. et al. The effects of intracranial volume adjustment approaches on multiple regional MRI volumes in healthy aging and Alzheimer’s disease. Frontiers in Aging Neuroscience 6, (2014). 5. 5.Edland, S. D. et al. Total intracranial volume: Normative values and lack of association with Alzheimer’s disease. Neurology 59, 272–274 (2002). 6. 6.Hackman, D. A. & Farah, M. J. Socioeconomic status and the developing brain. Trends in Cognitive Sciences 13, 65–73 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.tics.2008.11.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19135405&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000263635900004&link_type=ISI) 7. 7.Chertkow, H. & Black, S. Imaging Biomarkers and their Role in Dementia Clinical Trials. Canadian Journal of Neurological Sciences 34, S77–S83 (2007). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17469688&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000245632100016&link_type=ISI) 8. 8.Veitch, D. P. et al. Understanding disease progression and improving Alzheimer’s disease clinical trials: Recent highlights from the Alzheimer’s Disease Neuroimaging Initiative. Alzheimer’s & Dementia 15, 106–152 (2019). 9. 9.Opfer, R. et al. Single-subject analysis of regional brain volumetric measures can be strongly influenced by the method for head size adjustment. Neuroradiology 64, 2001–2009 (2022). 10. 10.O’Brien, L. M. et al. Adjustment for Whole Brain and Cranial Size in Volumetric Brain Studies: A Review of Common Adjustment Factors and Statistical Methods. Harvard Review of Psychiatry 14, 141–151 (2006). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/10673220600784119&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16787886&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000238697400002&link_type=ISI) 11. 11.O’Brien, L. M. et al. Statistical adjustments for brain size in volumetric neuroimaging studies: Some practical implications in methods. Psychiatry Research: Neuroimaging 193, 113–122 (2011). 12. 12.Sanfilipo, M. P., Benedict, R. H. B., Zivadinov, R. & Bakshi, R. Correction for intracranial volume in analysis of whole brain atrophy in multiple sclerosis: the proportion vs. residual method. Neuroimage 22, 1732–1743 (2004). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2004.03.037&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15275929&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000223156000030&link_type=ISI) 13. 13.Van Horn, J. D., Bowman, I., Joshi, S. H. & Greer, V. Graphical Neuroimaging Informatics: Application to Alzheimer’s Disease. Brain Imaging Behav 8, 300–310 (2014). 14. 14.Kijonka, M. et al. Whole Brain and Cranial Size Adjustments in Volumetric Brain Analyses of Sex- and Age-Related Trends. Frontiers in Neuroscience 14, (2020). 15. 15.Sanchis-Segura, C., Ibañez-Gual, M. V., Aguirre, N., Cruz-Gómez, Á. J. & Forn, C. Effects of different intracranial volume correction methods on univariate sex differences in grey matter volume and multivariate sex prediction. Sci Rep 10, 12953 (2020). 16. 16.Dhamala, E. et al. Proportional intracranial volume correction differentially biases behavioral predictions across neuroanatomical features and populations. 2022.03.15.483970 Preprint at [https://doi.org/10.1101/2022.03.15.483970](https://doi.org/10.1101/2022.03.15.483970) (2022). 17. 17.Brunnström, H., Gustafson, L., Passant, U. & Englund, E. Prevalence of dementia subtypes: A 30-year retrospective survey of neuropathological reports. Archives of Gerontology and Geriatrics 49, 146–149 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.archger.2008.06.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18692255&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000266733300029&link_type=ISI) 18. 18.Goodman, R. A. et al. Prevalence of dementia subtypes in United States Medicare fee-for-service beneficiaries, 2011–2013. Alzheimer’s & Dementia 13, 28–37 (2017). 19. 19.Marek, S. et al. Reproducible brain-wide association studies require thousands of individuals. Nature 603, 654–660 (2022). 20. 20.Littlejohns, T. J. et al. The UK Biobank imaging enhancement of 100,000 participants: rationale, data collection, management and future directions. Nat Commun 11, 2624 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-020-15948-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32457287&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 21. 21.Alfaro-Almagro, F. et al. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. NeuroImage 166, 400–424 (2018). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/J.NEUROIMAGE.2017.10.034&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 22. 22.Henneman, W. J. P. et al. Hippocampal atrophy rates in Alzheimer disease. Neurology 72, 999–1007 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1212/01.wnl.0000344568.09360.31&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19289740&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 23. 23.Juottonen, K., Lehtovirta, M., Helisalmi, S., Sr, P. J. R. & Soininen, H. Major decrease in the volume of the entorhinal cortex in patients with Alzheimer’s disease carrying the apolipoprotein E ε4 allele. *Journal of Neurology*, Neurosurgery & Psychiatry 65, 322–327 (1998). 24. 24.Feng, F. et al. Radiomic Features of Hippocampal Subregions in Alzheimer’s Disease and Amnestic Mild Cognitive Impairment. Front Aging Neurosci 10, 290 (2018). 25. 25.Ries, M. L. et al. Magnetic Resonance Imaging Characterization of Brain Structure and Function in Mild Cognitive Impairment: A Review. Journal of the American Geriatrics Society 56, 920–934 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1532-5415.2008.01684.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18410325&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000255486000021&link_type=ISI) 26. 26.Apostolova, L. G. & Thompson, P. M. Mapping progressive brain structural changes in early Alzheimer’s disease and mild cognitive impairment. Neuropsychologia 46, 1597–1612 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuropsychologia.2007.10.026&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18395760&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 27. 27.Voevodskaya, O. et al. The effects of intracranial volume adjustment approaches on multiple regional MRI volumes in healthy aging and Alzheimer’s disease. Front Aging Neurosci 6, 264 (2014). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fnagi.2014.00264&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25339897&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 28. 28.Bekris, L. M., Yu, C.-E., Bird, T. D. & Tsuang, D. W. Review Article: Genetics of Alzheimer Disease. J Geriatr Psychiatry Neurol 23, 213–227 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0891988710383571&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21045163&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000283714000002&link_type=ISI) 29. 29.DeGroot, M. H. & Schervish, M. J. Probability and statistics. (Pearson Education, 2012). 30. 30.Castel, A. D., Balota, D. A. & McCabe, D. P. Memory efficiency and the strategic control of attention at encoding: impairments of value-directed remembering in Alzheimer’s disease. Neuropsychology 23, 297–306 (2009). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19413444&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 31. 31.Peña-Casanova, J., Sánchez-Benavides, G., de Sola, S., Manero-Borrás, R. M. & Casals-Coll, M. Neuropsychology of Alzheimer’s Disease. Archives of Medical Research 43, 686–693 (2012). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23072720&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 32. 32.Ashendorf, L. et al. Trail Making Test errors in normal aging, mild cognitive impairment, and dementia. Arch Clin Neuropsychol 23, 129–137 (2008). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.acn.2007.11.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18178372&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000254657400001&link_type=ISI) 33. 33.Ehret, G. B. Genome-Wide Association Studies: Contribution of Genomics to Understanding Blood Pressure and Essential Hypertension. Curr Hypertens Rep 12, 17–25 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11906-009-0086-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20425154&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000274462100004&link_type=ISI) 34. 34.Reber, P. J. The neural basis of implicit learning and memory: A review of neuropsychological and neuroimaging research. Neuropsychologia 51, 2026–2042 (2013). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuropsychologia.2013.06.019&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23806840&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000324085200026&link_type=ISI) 35. 35.Rusinek, H. et al. Regional Brain Atrophy Rate Predicts Future Cognitive Decline: 6-year Longitudinal MR Imaging Study of Normal Aging. Radiology 229, 691–696 (2003). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1148/radiol.2293021299&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14657306&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000186717700012&link_type=ISI) 36. 36.Golomb, J. et al. Hippocampal atrophy in normal aging. An association with recent memory impairment. Arch Neurol 50, 967–973 (1993). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archneur.1993.00540090066012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8363451&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1993LV72300012&link_type=ISI) 37. 37.Peng, G.-P. et al. Correlation of Hippocampal Volume and Cognitive Performances in Patients with Either Mild Cognitive Impairment or Alzheimer’s disease. CNS Neuroscience & Therapeutics 21, 15–22 (2015). 38. 38.Schuff, N. et al. MRI of hippocampal volume loss in early Alzheimer’s disease in relation to ApoE genotype and biomarkers. Brain 132, 1067–1077 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/brain/awp007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19251758&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000265269600029&link_type=ISI) 39. 39.Impaired remote spatial memory after hippocampal lesions despite extensive training beginning early in life - Clark - 2005 - Hippocampus - Wiley Online Library. [https://onlinelibrary.wiley.com/doi/abs/10.1002/hipo.20076?casa\_token=ri9Jsj303o4AAAAA:SBtlEOO3\_8LlO1qOZJuuS1nKJPOSx-XPqaGt5yjDXGW7R01nXT4T28QK5WwhYvbj8cphJGIDzBcf\_VM4](https://onlinelibrary.wiley.com/doi/abs/10.1002/hipo.20076?casa_token=ri9Jsj303o4AAAAA:SBtlEOO3_8LlO1qOZJuuS1nKJPOSx-XPqaGt5yjDXGW7R01nXT4T28QK5WwhYvbj8cphJGIDzBcf_VM4). 40. 40.Critical role of the hippocampus in memory for sequences of events | Nature Neuroscience. [https://www.nature.com/articles/nn834](https://www.nature.com/articles/nn834). 41. 41.Dickerson, B. C. & Eichenbaum, H. The Episodic Memory System: Neurocircuitry and Disorders. Neuropsychopharmacol 35, 86–104 (2010). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/npp.2009.126&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19776728&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000272784600006&link_type=ISI) 42. 42.Hardcastle, C. et al. Contributions of Hippocampal Volume to Cognition in Healthy Older Adults. Front Aging Neurosci 12, 593833 (2020). 43. 43.Nordenskjöld, R. et al. Intracranial volume normalization methods: Considerations when investigating gender differences in regional brain volume. Psychiatry Research: Neuroimaging 231, 227–235 (2015). 44. 44.Schwarz, C. G. et al. A large-scale comparison of cortical thickness and volume methods for measuring Alzheimer’s disease severity. NeuroImage: Clinical 11, 802–812 (2016). 45. 45.Fry, A. et al. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. Am J Epidemiol 186, 1026– 1034 (2017). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwx246&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28641372&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) 46. 46.Mukadam, N., Marston, L., Lewis, G. & Livingston, G. Risk factors, ethnicity and dementia: A UK Biobank prospective cohort study of White, South Asian and Black participants. PLoS One 17, e0275309 (2022). 47. 47.Buckner, R. L. et al. A unified approach for morphometric and functional data analysis in young, old, and demented adults using automated atlas-based head size normalization: reliability and validation against manual measurement of total intracranial volume. Neuroimage 23, 724–738 (2004). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2004.06.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15488422&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F07%2F16%2F2023.07.14.23292678.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000224817100029&link_type=ISI) [1]: /embed/inline-graphic-1.gif [2]: /embed/inline-graphic-2.gif