Abstract
Brain growth is affected by a broad range of childhood conditions that affect cognitive development, but definitive growth curves for the brain throughout childhood have not been available. We studied the brain volume growth from 1,067 normal MRI scans from 505 normal healthy children from birth through age 18. Brain volume peaked at 10–12 years of age. Males exhibited larger age-adjusted total brain volumes than females, and body size normalization procedures did not eliminate this difference. Other significant gender-based differences were found in cerebrospinal fluid (CSF) accumulation, grey and white matter volumes, and lateralization between left and right temporal lobes and hippocampi. A significant correlation between cognitive scores with brain volume was found in the years leading up to the adolescent brain volume peak. The ratio of brain to CSF volume, however, uncovered a universal age-dependent relationship independent of gender or body size. These findings enable the use of normative growth curves in managing a broad range of childhood disease where cognitive development and brain growth is impaired.
The study of brain size and growth has a long and contentious history(1–4). In the pre-MRI era, post-mortem studies provided insight into brain volume changes over the lifespan, but such methods suffered from inherent inaccuracies(5). The magnetic resonance imaging (MRI) era enabled detailed in-vivo volumetrics including components of the brain, but there has not been a definitive analysis of normal brain growth throughout the entire pediatric age range(6–9). We sought to create normative growth curves for the human brain in order to enable improved management of a broad range of childhood diseases where cognitive development and brain growth are impaired.
The MRI era has facilitated computational in vivo structural brain quantification, which we leveraged to analyze 1,067 MRI scans from healthy pediatric participants in the NIH Pediatric MRI Repository (https://nda.nih.gov), under an institutional data use agreement between The Pennsylvania State University and the National Institute of Mental Health. This MRI repository was developed based on a scaled-down United States (US) census with rigorous exclusion criteria, with the goal of providing a standard representation of the socioeconomic, gender, and ethnic distribution of healthy normal US children(10). The cross-sequential cohort contains participants in each year of life ranging from 13-days to 22-years old (Supplemental Figure 1).
Studies defining brain volume growth patterns in the MRI era have suffered from small sample sizes, limited algorithm technology, incomplete coverage of the pediatric age range, retrospective cohorts taken from clinical patients, and an inconsistent array of curve fitting techniques(6–8, 11, 12). The current study addresses these limitations to develop normative growth curves for the US pediatric population. This cross-sequential data includes 505 subjects (259 female), of which the majority have two or three longitudinal MRI scan sets, leading to a total number of 1067 healthy MRI scans (Supplemental Figure 1c).
The T2 MRI images of the neonatal population were processed using the Developing Human Connectome Project (dHCP) Pipeline in order to appropriately segment the rapidly growing and incompletely myelinated brains(13). The T1 images of the older subjects were processed using the Computational Anatomy Toolbox (CAT) within the Statistical Parametric Mapping (SPM12) software(14). The resulting segmentation images (Supplemental Figure 2) were manually inspected to ensure appropriate labelling of these compartments and regions.
The volumes quantified by the dHCP and CAT12 pipelines were then fit using Smoothing Splines ANOVA (SSANOVA) with random effects (to account for the cross-sequential design of the study) to define time periods of significant differences between genders and hemispheres (Figure 1, 2, and Supplemental Figure 3)(15). The volumes were also fit using Generalized Additive Models for Location, Scale, and Shape (GAMLSS) with a Box-Cox power exponential (BCPE) distribution (Supplemental Figures 4 and 5), which is the platform and distribution leveraged by the World Health Organization to develop their standard growth curves for weight, height, and head circumference(16, 17).
Males exhibited larger overall brain volumes than females throughout childhood (Figure 1a). The volume for females peaked at 10.7 years, and the volume for males peaked at 11.2 years, followed by a slow, but consistent decrease. Although this early adolescent peak has been noted(11), data from early childhood (< 4 years of age) was not previously incorporated. Cerebrospinal fluid (CSF) increased throughout childhood, with male fluid accumulation significantly larger than female after the third year of life (Figure 1b). Grey matter (Figure 1c) peaked at 7.5 years for males and 7.4 years for females, while white matter (Figure 1d) continued to progressively increase into early adulthood(18). The ratio of grey/white matter (Figure 1e) showed an initial increase, peaking before 2 years of age and followed by a progressive decrease thereafter. Female grey/white matter ratios were significantly larger than male ratios between ages 9 and 11 although the difference was small (Figure 1e).
We performed body size normalization to assess if gender differences in brain volume persisted. Normalizing the brain by body size is not a new concept; allometry, or differential growth, of the brain with respect to body size was discussed in depth by D’Arcy Wentworth Thompson in 1917 in On Growth and Form(19). Gould, in The Mismeasure of Man, attempted and failed to eliminate gender differences through body size normalization procedures(1). Nevertheless, much of the volumetric brain study in the MRI era has not accounted for anthropomorphic normalization. Figures 1f and 1g show brain volume normalized by height-for-age and weight-for-height, respectively, which did not eliminate the gender-based differences in volume. Muscle mass content, greater in males, has been correlated with larger brain volumes although not with higher cognitive capability(20). However, the ratio of brain to CSF volume demonstrated no significant gender differences at any age, without applying anthropomorphic normalization (Figure 1h).
Figure 2a and 2b show that there was no lateralizing difference in size between right and left hemispheres for males and females, nor for cerebellum, frontal, parietal, or occipital lobes (Supplemental Figure 2). Figure 2c and 2d show that the left temporal lobe was significantly larger than the right for both males and females, which has been controversial(21). In addition, the hippocampi were significantly larger on the right than the left side for both genders (Figure 2e and 2f). Age-adjusted hippocampal and temporal lobe volume assessment may be of value in diagnosing and treating medically refractive epilepsy in childhood(21).
Cognitive scores showed a small but significant correlation with brain volume in the four years leading up to the peak in volume (Figure 3). The Mental Development Index (MDI) scores for infants from birth to age three were not significantly predicted by brain volume (Figure 3b), but the Weschler Abbreviated Scale of Intelligence (WASI) scores for ages 6–18 years were significantly correlated with brain volume z-scores (Figure 3a), as described previously(22). Usinga sliding window across age we found that the correlation between cognitive score and brain volume was significant for raw brain volume, age-adjusted brain volume z-score, and weight-for-height normalized volumes in the years immediately preceding the brain volume peak (Figure 3c-3e). This correlation is not maintained when separating into smaller cohorts by gender (Supplemental Figure 6).
In 1987, Roche et al created head circumference growth curves from studying 888 healthy US children(23), and such normative head circumference curves from US(24) and World Health Organization(16, 25) cohorts are now in routine clinical practice as indirect metrics of brain growth. Figure 4 illustrates the analogous GAMLSS pediatric brain volume growth curves for males and females (Figure 4a and 4b), with early brain volume growth and CSF volume insets included, as well as the brain/CSF ratio (Figure 4c and 4d) for a more comprehensive presentation of childhood normative brain growth suitable for clinical settings. The apparent universal nature of the age-dependent brain/CSF ratio, regardless of gender or body size, suggests that the role of this ratio warrants clinical investigation(26).
Brain volume measurement became a field of study of biological determinism pioneered by Samuel Morton in the mid-1800s(1, 27). Morton filled over 1000 cranial vaults with mustard seed and lead shot to determine brain volume, which he then compared between races and genders. A century and a half later Gould used Morton as a case study in scientific bias1. Decades after the publishing of these analyses in The Mismeasure of Man, arguments over biases and flaws continue in the assessment of the volume of the normal human brain(2–4).
Our findings demonstrate that for a broad spectrum of human disease affecting neurocognitive development and brain growth – ranging from neonatal infection(28) to malnutrition(29) and hydrocephalus(30) – measuring brain growth with respect to normative values is now feasible. The small association observed for brain size within the normal range with cognitive performance within the normal range will likely be magnified in children with disease early in life that substantially impacts brain growth. A major challenge for children’s medicine is now how to construct the frameworks needed to improve cognitive outcomes by optimizing brain growth based upon interventions(31). We anticipate that these findings will enable more personalized optimization of treatment and care for a broad range of debilitating childhood conditions.
Data Availability
All data and code will be made available upon final peer reviewed publication of the article.
Author Contributions
Conception and design: SJS, MRP. Acquisition of data: MRP, VC. Analysis and interpretation of data: MRP, VC, VM, JNP, AK, BCW, SJS. Drafting the article: MRP, SJS. Critically revising the article: all authors. Reviewed and approved submitted version of manuscript: all authors.
Competing Interests
The authors declare no competing interests.
Supplementary Materials
Cohort Characteristics
The MRI scans used in this study were taken from the NIH Pediatric MRI Repository (https://nda.nih.gov) under an institutional data use agreement (214908) between The Pennsylvania State University and the National Institute of Mental Health approved on January 14, 2019, and a determination (STUDY00010883) by the Penn State Institutional Review Borad that this activity does not meet the definition of human subject research and does not require IRB review and approval. This repository was developing using a scaled down United States census (in order to appropriately represent the demographic characteristics of the entire US pediatric population) and included rigorous exclusion criteria to ensure healthy participants with normal brain development. The repository aimed to achieve two-year longitudinal followup scans for individual participants, and was able to accomplish this for 378 of the subjects, making this a cross-sequential study(10). The cross-sequential format is ideal for the development of growth curves(7, 32). The number of subjects in this study was 505 (259 female), with a total of 1067 MRI scans due to the longitudinal nature of the cohort. The minimum age was 13 days, and the maximum age was 22 years, but only scans from subjects up to 18-years-old were included to develop growth curves representative of the pediatric age range. Scans existed for participants in each year of life throughout the pediatric age range, as seen in Supplemental Figure 1.
Segmentation Algorithms
Currently, no single algorithm exists that can reliably segment both young infants and older subjects(33, 34). This is particularly due to the myelination changes that do not resolve until approximately two years of age(35). These myelination changes lead to difficulty in establishing intensity thresholds between grey matter, white matter, and CSF(35). In order to maximize thresholding intensities, neonatal segmentation techniques most commonly rely on T2 weighted MRI scans, rather than T1 weighted scans that are the predominant scan type used in older cohort segmentation algorithms. Due to these reasons, two different algorithms were used in this study. The neonates were assessed using the Developing Human Connectome Project (dHCP) pipeline, which required T2 images and was run through a virtual Docker container to access a Linux computer system(13). The older subjects were assessed using the Computational Anatomy Toolbox 12 (CAT12) within the Statistical Parametric Mapping (SPM) platform using Matlab 2019b, which relies on T1 images(14). Each of the resulting scan sets was manually curated to ensure that appropriate skull-stripping and segmentation was accomplished. Upon establishing the volumes determined by each segmentation procedure, the accompanying atlases were used to compile volumes for the desired regions from smaller sections of the brain(36, 37).
Smoothing Splines ANOVA
The compartment differences between males and females were explored within the R platform using non-parametric Smoothing Splines ANOVA models with a random effects component added to account for the cross-sequential aspect of the data(15). The model included age and gender or hemisphere as main factors, as well as an interaction term. Time periods with significant gender or hemispheric differences were defined as regions where there was no overlap between the Bayesian 95% confidence intervals calculated for the gender and hemispheric factors. These regions of significant difference were highlighted on the plotted models, and the time period of significance was documented as well.
Generalized Additive Models for Location, Scale, and Shape (GAMLSS)
The smooth growth curves used to fit the volumes and other growth metrics included in this study were developed using the Generalized Additive Models for Location, Scale, and Shape software implemented in R(17). The Box-Cox power exponential (BCPE) distribution, which was chosen by the World Health Organization (WHO) for their standard growth curves, was used to model the volumes in this study(38). This distribution models the median for a nonparametric assessment, and appropriately accounts for kurtosis and skewness within the data. The growth curves were fitted using the default RS algorithm and were smoothed using fractional polynomials of the third order(17). A random effects component was added to the curve modeling procedure in order to account for the longitudinal aspect of the data. For the total brain tissue growth curves, we utilized data for subjects between 18–22 years of age to set the 18 year old volume intercepts, and the perinatal volume data from Huppi et al to set the volume intercepts at birth(39).
In order to determine the peaks of the brain tissue and grey matter curves, cftool within Matlab 2019b was used to fit differentiable rational polynomial functions (which were applied as the smoothing function in the GAMLSS curves)(40). The significant gender and hemispheric differences were found using the Mann-Whitney U-test within Matlab 2019b. A Bonferroni correction was applied so that significance was established with p<0.000066.
The weight for height and height for age normalizations were accomplished by fitting a GAMLSS curve to the weight for height and height for age data from the NIH repository for each gender. Based on these fits, percentiles were calculated for each subject, and the 50th percentile was set at 1, with percentiles above and below ranging from 0.5 to 1.5. The corresponding brain volume for each subject was then divided by this percentile value to achieve normalized brain volumes.
Cognitive Score Correlations
Cognitive scoring was included in the NIH Pediatric MRI Repository study, with Wechsler Abbreviated Scale of Intelligence (WASI) Tests undertaken on participants ranging from 6 years of age to 18 years of age. The infants (from birth to 3 years of age) were assessed using the Bayley Scales of Infant Development, Second Edition (BSID-II) Mental Development Index (MDI). We fit a linear mixed effects model (with subject identification as the random effects component) to the appropriate cognitive score using the brain volume z-score values. While the linear fits showed positive slopes for each metric, only the WASI scores showed a significant fixed effect for the brain volume z-score. Plots of the windowed correlation with a subpopulation value of 120 subjects and an overlap value of 20 subjects were developed for raw brain volume, brain volume z-score, and weight-for-height normalized volume z-score.
Limitations
While the growth curves developed in this study provide a standard representation of the United States pediatric population, they can only act as a reference and not as a standard for other geographic regions. Other growth curves specific to particular regions should be developed using healthy cohorts derived from those regions in order to provide appropriately representative global standards.
Data Availability
The volumes used to create the SSANOVA and GAMLSS growth curves presented here are provided in a spread sheet supplied in the online supplemental material (Supplemental Extended Data).
Figure Legends
Supplemental
Expanded Data
Supplemental Excel File: Supplemental_Master_File.xlsm
Acknowledgements
We are grateful to T. Sauer and S. Sinnar for helpful discussion, and to Y. Wang and J. Chai for technical help in compiling data. This research was supported by the Penn State and National Science Foundation Center for Healthcare Organization Transformation (CHOT) collaboration, US National Institutes of Health grants R01HD085853 (VC, AK, BCW, VM, SJS), and the NIH Director’s Transformative Award R01AI145057 (MRP, JNP, SJS).