Detecting mild traumatic brain injury with MEG, normative modelling and machine learning ======================================================================================== * Veera Itälinna * Hanna Kaltiainen * Nina Forss * Mia Liljeström * Lauri Parkkonen ## Abstract Diagnosis of mild traumatic brain injury (mTBI) is challenging, as the symptoms are diverse and nonspecific. Electrophysiological studies have discovered several promising indicators of mTBI that could serve as objective markers of brain injury, but we are still lacking a diagnostic tool that could translate these findings into a real clinical application. Here, we used a multivariate machine-learning approach to detect mTBI from resting-state magnetoencephalography (MEG) measurements. To address the heterogeneity of the condition, we employed a normative modeling approach and modeled MEG signal features of individual mTBI patients as deviations with respect to the normal variation. To this end, a normative dataset comprising 621 healthy participants was used to determine the variation in power spectra across the cortex. In addition, we constructed normative datasets based on age-matched subsets of the full normative data. To discriminate patients from healthy control subjects, we trained support vector machine classifiers on the quantitative deviation maps for 25 mTBI patients and 20 controls not included in the normative dataset. The best performing classifier made use of the full normative data across the entire age range. This classifier was able to distinguish patients from controls with an accuracy of 79%, which is high enough to substantially contribute to clinical decision making. Inspection of the trained model revealed that low-frequency activity in the theta frequency band (4–8 Hz) is a significant indicator of mTBI, consistent with earlier studies. The method holds promise to advance diagnosis of mTBI and identify patients for treatment and rehabilitation. **Significance statement** Mild traumatic brain injury is extremely common, but no definite diagnostic method is yet available. Objective markers for detecting brain injury are needed to direct care to those who would best benefit from it. We present a new approach based on MEG recordings that first explicitly addresses the variability in brain dynamics within the population through normative modeling, and then applies supervised machine-learning to detect pathological deviations related to mTBI. The approach can easily be adapted to other brain disorders as well and could thus provide a basis for an automated tool for analysis of MEG/EEG towards disease-specific biomarkers. Keywords * traumatic brain injury * magnetoencephalography * machine learning * resting-state * normative modeling ## Introduction Due to its high prevalence and potential long-term adverse health effects, accurate diagnosis of mild traumatic brain injury (mTBI) is of high importance. Objective diagnosis of mTBI remains a challenge, however, as structural imaging methods such as magnetic resonance imaging (MRI) as well as neuropsychological testing often fail to detect clinically significant abnormalities (Bigler et al., 2016; Dikmen et al., 2017). Diagnosis of mTBI after the acute phase is further complicated by posttraumatic symptoms that are nonspecific to mTBI and highly variable across patients (Wäljas et al., 2015). Studies employing noninvasive functional neuroimaging techniques such as functional magnetic resonance imaging (fMRI), electroencephalography (EEG) or magnetoencephalography (MEG) have provided group-level evidence of changes in brain activity following mTBI, even several months after the trauma and in the absence of clinical symptoms (Huang et al., 2014; Lewine et al., 2007; Pontifex et al., 2009; Rogers et al., 2015). New objective measures based on functional brain imaging might prove essential for improving the accuracy and reliability of the diagnosis, and for identifying patients who are at risk of chronic symptoms and would benefit from intervention. Electrophysiological recordings of brain activity, such as MEG and EEG, provide a range of measures (e.g. amount of low-frequency activity, posterior alpha frequency and power, alterations in functional connectivity) that reflect the altered functional state of brain regions and networks after mTBI (Castellanos et al., 2010; Dimitriadis et al., 2015; Haneef et al., 2013; Lewine et al., 2007). However, the ability to determine the clinical status of individual mTBI patients based on single – or univariate – measures is extremely limited (Lewine et al., 2019), and it is not clear which MEG/EEG measures are most informative of disease pathology. Thus, compound – or multivariate – analysis that jointly exploits multiple measures could potentially increase our ability to accurately detect pathology related to mTBI. Extraction of objective measures from functional imaging data in mTBI is particularly challenging since the mechanism, location and nature of the head insult, as well as the clinical symptoms are largely heterogeneous. Moreover, individual variation in brain activity is large even in the healthy population and the majority of mTBI patients experience a prompt recovery. Therefore, regarding patients and control subjects as clearly delineated and distinct groups, may not properly reflect the nature of this disorder (Kapur et al., 2012; Marquand et al., 2016). This variability can partially be addressed by normative modeling (Marquand et al., 2016), where the aim is to map the full range of normal variation within the population and quantify statistical deviations of individual patients. Different normative modeling approaches have recently been applied to neuroimaging data to study disorders such as schizophrenia (Kia and Marquand, 2019; Pinaya et al., 2019; Wolfers et al., 2018), dementia (Pinaya et al., 2021; Ziegler et al., 2014) and autism (Pinaya et al., 2019; Zabihi et al., 2019). The methods have shown significant promise in providing predictions of disease states at the level of individual subjects. In this study, we compare source-level power spectra computed from resting-state MEG recordings of mTBI patients and their healthy controls to a large normative reference dataset for the purpose of modeling the pathological features of individual mTBI patients as extreme values or deviations with respect to the normal variation. To discriminate the group of mTBI patients from healthy control subjects, we train a support vector machine (SVM) classifier (Vapnik, 1995) on the resulting quantitative deviation maps. A key question in normative modeling is the choice of the reference cohort, which should capture a wide range of variation in the population (Marquand et al., 2019). An important consideration is therefore the matching of the demographics of the normative reference data to the subject. As there is significant neurophysiological variation across demographic groups, interesting disease-related effects may be diluted if the applied normative data represents the whole population. Here, we explore this question by comparing the results obtained with age-matched and non-matched normative data. ## Materials and methods ### Datasets We employed a dataset originally measured by Kaltiainen and colleagues (Kaltiainen et al., 2019, 2018), comprising resting-state MEG recordings from 25 mild traumatic injury patients and 20 healthy controls. In addition, we employed a large, separate dataset, utilizing MEG recordings from a total of 621 healthy participants (Taylor et al., 2017) as normative data. ### mTBI patients and healthy controls The patient group consisted of 25 mild traumatic brain injury patients (11 females, 14 males) with a mean age of 42 years (range 20–59 years). The control group comprised 20 healthy subjects (8 females, 12 males) with a mean age of 39 years (range 19–58 years). All patients and controls were without neuropsychological disorders, medication affecting the central nervous system, substance abuse or earlier history of TBI. The study was approved by the Ethics Committee of Helsinki and Uusimaa Hospital District. All participants’ consent was obtained in accordance with the Declaration of Helsinki. The patients’ level of consciousness was assessed with the Glasgow Coma Scale (GCS) (Teasdale and Jennett, 1974) shortly after the injury. GCS addresses the level of consciousness, ranging from three (deep unconsciousness) to 15 (alert and awake). The GCS scores varied between 14 and 15, thus fulfilling the criteria for mTBI. All patients maintained TBI symptoms at their first MEG measurement. At their MEG measurement sessions, the patients filled in the Rivermead Post-Concussion Symptoms Questionnaire (RPQ) (King et al., 1995), which measures the severity of post-concussive symptoms after TBI with a five-step scale, compared with the situation before the accident. The maximum score is 64, but answering “no more of a problem as before the accident” yields one point. The scores of the questionnaire varied from 3 to 36 with an average of 17.2. The demographics of the patient group, as well as their GCS and RPQ scores, are presented in Table 1. All patients fulfilled the criteria for mTBI according to the American Congress of Rehabilitation Medicine (ACRM) criteria (Kay et al., 1993) with loss of consciousness of less than 30 minutes at the time of the accident, GCS varying between 13–15 at 30 min after the accident and the duration of post-traumatic amnesia less than 24h. View this table: [Table 1.](http://medrxiv.org/content/early/2022/09/30/2022.09.29.22280521/T1) Table 1. Demographics of the mTBI patients All patients underwent an MEG measurement within 6 months (26 weeks) after the trauma. The recordings of 12 patients were performed at the subacute stage within 2 months of the injury. The MEG measurements were performed at Aalto Neuroimaging MEG Core, Aalto University School of Science, Espoo, Finland, using a 306-channel whole-head MEG device (Elekta Neuromag; MEGIN Oy, Helsinki, Finland). During the recordings, data were filtered to 0.03–330 Hz and sampled at 1000 Hz. Electrocardiogram and horizontal and vertical electro-oculograms were measured for managing artifacts caused by heartbeat and eye movement, respectively. Here, from those recordings, we use one MEG session where the subjects rested with eyes closed for 10 minutes. The subjects were instructed to sit relaxed and avoid movement. The measurement was briefly paused twice to confirm that the subjects remained awake and alert. Anatomical MRI images (Signa HDX 1.5 T, General Electric, Milwaukee, WI, USA) were acquired from all subjects. MRIs from patients were acquired within one week to 16 months after the injury. Trauma lesions were detected in 15 of the 25 (60%) patients (see Table 1). ### Normative dataset A large open neuroimaging dataset by the Cambridge Centre for Ageing and Neuroscience (Cam-CAN) (Taylor et al., 2017), containing MEG and MRI measurements of nearly 700 healthy participants aged 18 to 87, was used for creating the normative reference data. The curated, cross-sectional Cam-CAN dataset contains measurements from approximately 50 men and 50 women in each age decade (18–27, 28–37, 38–47, 48–57, 58–67, 68–77 and 78–87 years). In this study, only the resting-state, eyes-closed MEG and anatomical MRI were used. Further details of the dataset are presented by Taylor and colleagues (Taylor et al., 2017). Subjects with missing or incomplete resting-state MEG measurements or T1-weighted MRI images were excluded from the analyses, resulting in a set of 621 subjects. The number of subjects in each age group was 56, 92, 104, 93, 95, 102 and 79, from the youngest to oldest. ## Data preprocessing ### Artifact removal and data augmentation The temporal extension of the signal space separation (tSSS) (Taulu and Simola, 2006) method implemented in the MaxFilter software package (MEGIN Oy) was used for reducing external artifacts in the MEG data. To suppress artifacts caused by cardiac activity and eye movement, independent component analysis (ICA) (Hyvärinen and Oja, 2000) was used for identifying components most prominently related to the aforementioned sources. In most cases, 1–2 components were removed, but for some subjects three or even four components were removed based on manual inspection of the spatial patterns and time courses of the components. The FastICA algorithm available in MNE-Python software (Gramfort et al., 2013) was used for the ICA processing. Data augmentation was applied on the samples by means of a sliding window with a length of 200 seconds and stride of 50 seconds. This resulted in seven time series for each subject and a total sample size of 315 data points. ### Source modeling An automated source-modeling pipeline was applied on the measurements to compute power spectral densities (PSDs) at each cortical location using the MNE-Python software (Gramfort et al., 2014). Reconstruction of each subject’s cortical surface was performed using the FreeSurfer software (Dale et al., 1999; Fischl et al., 1999) from T1-weighted anatomical MRI images. For the forward computation, a surface-based source space with the ico-4 decimation was created, resulting in a set of 5124 cortical locations at which the amplitudes of the current dipoles were estimated. A single-compartment BEM head model was formed based on brain surface tessellations obtained by the FreeSurfer watershed algorithm (Ségonne et al., 2004). The coregistration of the MEG and MRI coordinate frames was performed automatically using MNE-Python and fiducial points calculated using the FieldTrip (Oostenveld et al., 2011) toolbox in MATLAB. For the calculation of the inverse operator, noise covariance matrices were computed from recordings without a subject (“empty-room recording”) performed during the same measurement session. The ICA solutions computed for each subject’s recording were applied also to these empty-room recordings. The noise covariance matrix and the forward solution were used to compute the dSPM inverse solution, which was applied to the complex-valued 8192-point Fourier transform of the Hann-windowed (50% overlap) raw data over a frequency range of 1–40 Hz. A source-level PSD was obtained by taking the magnitude of the estimate at each source point, yielding a matrix *X* of size 5124 *(number of source locations)* × 319 *(number of frequency values)*. Finally, the subject-specific cortical PSDs were morphed to a reference brain (the “fsaverage” brain provided by FreeSurfer) to enable comparison of the power spectra across subjects (see Figure 1A). ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/30/2022.09.29.22280521/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2022/09/30/2022.09.29.22280521/F1) Figure 1. Analysis pipeline. (**A**) **Data preprocessing**. Source-level power spectra were first calculated for each cortical region and morphed to a common brain template. **(B) Construction of normative models**. The mean *μ* and standard deviation *σ* were calculated across subjects within the reference dataset to obtain normative data for each location and frequency. Three different types of normative models were constructed: a full normative model containing all participants from the reference dataset across the entire age range (depicted in blue, mean values across all frequencies and cortical locations are shown), an age-matched model containing a subset of participants within the same age range as the patient/control (depicted in green), and for comparison a random model with a subset of reference data of random ages (shown in red). (**C**) **Classification procedure**. The normative models were used for converting the power spectra of the patients and controls into deviation scores (z-scores). The deviation scores, binned into 448 cortical parcels and six frequency bands, were then entered into the classification procedure. ### Feature engineering To obtain normative models, the mean *μ* and standard deviation *σ* of the power spectra were calculated across the subjects from the normative dataset (Figure 1B). These statistics were then used for converting the power spectrum matrices for the patients and their healthy controls, into *deviations maps* of Z-scores over the entire cortex, calculated as ![Formula][1] where *X* ∈ **R**5124×319 is the power spectrum of an individual subject. In addition, data binning was performed both over the spatial and the frequency dimensions to reduce dimensionality. The brain sources were automatically grouped together according to an anatomical parcellation scheme with 448 cortical regions (Khan et al., 2018), where the activity of a cortical region was represented by the mean power of the source points within that region (see Figure 1C). In the frequency dimension, six frequency bands were defined: delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), low beta (13–17 Hz), high beta (17–30 Hz), and gamma (30–40 Hz). The average was taken over the power values corresponding to each frequency interval (not shown in Figure 1). Binning the data into cortical parcels and into canonical frequency bands significantly reduced the dimensionality of the data: the resulting power spectra of size 448×6 were only 0.16% the size of the original data. Finally, the deviation matrices were flattened into feature vectors containing the six frequency features for each cortical region, resulting in 2688 features per subject. ## Model training and validation A support vector machine with a radial basis-function kernel was selected for classifying the measurements of mTBI patients and controls due to its ability to perform well in high-dimensional settings, even when the size of the dataset is smaller than the number of features (Nayak et al., 2015). A nested cross-validation strategy was selected for evaluating model performance. In the inner 5-fold cross-validation loop, the best values for the regularization hyperparameters C and γ were chosen based on the best average accuracy across the folds. In the outer 7-fold loop, the model was re-trained using the chosen hyperparameters and evaluated using the independent validation set of the fold. To reduce possible bias of a single cross-validation split, the nested procedure was repeated 5 times with a different split each round, resulting in a total of 35 folds in the outer loop. The splits were stratified by the target labels. The hyperparameter values tested in the inner cross-validation loop were 1, 5 and 10 for C and 0.1, 0.01 and 0.001 for γ. The penalty parameter C was weighted inversely proportional to class frequencies to avoid bias towards the positive class (patients) which had a small majority. The data were centered by subtracting the median and scaled to the range between the 1st and 3rd quartile of the data. The median and the interquartile range were calculated from the training data at each cross-validation fold and applied to both training and testing data before fitting and evaluating the model. The employed data augmentation approach resulted in multiple samples corresponding to the same subject, which leads to the samples being dependent. To avoid leaking information, the cross-validation splits were constructed so that the samples of any single subject were included only in the training or only in the validation set, but never both. The predicted class label for each subject, positive (patient) or negative (control), was determined by the label given to the majority of samples from that subject. To test the effect of using normative data from a specific age group, the model was trained and evaluated on an age-matched subset of the normative data, i.e., only the power spectra belonging to the same age decade as the subject were used. The age groups were defined according to the normative dataset as 18–27, 28–37, 38–47, 48–57 and 58–67 years. To ensure that the possible difference in the results was not due to the smaller size of the normative dataset, the model was also evaluated on a dataset where the normative data used for each subject were randomly subsampled so that the size of the normative dataset was equal to the number of normative samples in the subject’s age group. The results of the randomized procedure were averaged over three repetitions to reduce the effect of a single random sampling. In addition to the above three ways of employing the large normative dataset, we trained and evaluated the model on a dataset where the features were created from only the power spectra of the mTBI patients and their healthy controls, applying the same binning scheme as described earlier but without computing the Z-scores using normative data. This was done to assess the performance of the proposed normative modeling approach compared to classifying the power spectra as such. The statistical significance of the results was explored using permutation tests, where the model was cross-validated 1000 times with randomly permuted group labels for subjects. A *p*-value < 0.05 was considered significant. ## Estimating feature significance After verifying the predictive capabilities of the model, permutation feature importance (Breiman, 2001) was used for estimating how much the model relies on each individual feature for aiding the classification. The method can be used to interpret even nonlinear classifiers and is defined as the decrease in prediction performance (in this case, accuracy) when the values of the feature are randomly permuted while keeping other features intact. We expected that many of the features within the dataset would be highly correlated: for example, the alpha-band power values in two neighbouring regions are probably very similar. To reduce multicollinearity of the features, the number of features was reduced with a hierarchical clustering approach, where the Spearman rank correlations between features were clustered using Ward’s method. A threshold of 2 was manually selected to form the clusters, the first feature of each cluster was picked, and the resulting set of features was used to train the model with the cross-validation approach described before. The selection of features to be removed was performed only on the training data of each fold to avoid leaking information to the test set. The permutation importance was calculated on the test data of each fold and finally averaged across folds. ## Correlation of patient demographics and classification The effects of each mTBI patient’s age, timing of the MEG recording and RPQ score on the classifier’s performance (Table 1) were inspected by calculating the Spearman rank correlation coefficients. The fraction of times each patient was predicted correctly and the average decision function values of each patient were calculated over the five repeats of the 7-fold cross validation. The decision function values are proportional to the distance of the samples from the hyperplane separating the classes, and so they are indicators of the classifier’s confidence regarding a particular sample. The predicted class corresponds to the sign of the decision function output. ## Data availability The data that supports the findings of the study are available upon reasonable request from the authors. The dataset is not publicly available as it contains information that could compromise the privacy of the research participants. The Cam-CAN dataset is publicly available upon request at [https://camcan-archive.mrc-cbu.cam.ac.uk/dataaccess/](https://camcan-archive.mrc-cbu.cam.ac.uk/dataaccess/). ## Results ### Deviation scores Figure 2A shows the average deviation scores for the patient and control groups after binning the data in the spatial dimension. The patient group shows higher average activation compared to the normative dataset, mainly around 10 and 20 Hz as well as at frequencies over 30 Hz. The deviation values for the control subjects are overall slightly lower compared to the normative dataset, which might be an effect of different measurement sites. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/30/2022.09.29.22280521/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2022/09/30/2022.09.29.22280521/F2) Figure 2. (A) Group-average, relative power spectra of mTBI patients and healthy controls. Horizontal axis is frequency (Hz), vertical axis the cortical location (indices of the 448 cortical regions ordered alphabetically), and the color indicates the Z-score with respect to the normative data at that frequency and cortical location. **(B) Relative power spectra associated with different classification results**. The average Z-score maps for patients (first row) and controls (second row) by classification output, from left to right: correctly classified samples, incorrectly classified samples and the difference between the correctly and incorrectly classified samples. ### Classification performance The mean and standard deviation of the accuracy, sensitivity, and specificity across the cross-validation folds are reported in Table 2 for the three different approaches that were used for selecting the normative data (using the whole normative dataset, an age-group-matched subset, or a random subset of the same size as the age-matched set), as well as for the results obtained without using normative data. View this table: [Table 2.](http://medrxiv.org/content/early/2022/09/30/2022.09.29.22280521/T2) Table 2. Classification of MEG data from mTBI patients and healthy controls The largest accuracy (0.790) was achieved by using all available normative data. Using age-matched normative data yielded slightly lower results: the accuracy was 0.761 with feature selection by clustering. When comparing the results using age-matched normative data to a random sample of normative data of the same size, the randomly selected normative data yielded a notably lower accuracy of 0.711. Classification without the use of normative data yielded an accuracy of 0.786, which is only marginally lower than the highest value obtained. Permutation tests indicated that the accuracy of the classifier was significantly higher than chance level at *p* < 0.05 for all classification tasks. In all cases the classifier had a high sensitivity, with the largest value (0.914) obtained for the random normative dataset. The specificity of the classifier was notably lower, at most 0.638 with full normative data. Using age-matched normative data yielded a decrease in sensitivity and an increase in specificity when compared to a random selection of normative data. ### Model interpretation To gain insight to the decision function of the classifier, averaged spectral Z-score maps of both patients and controls were plotted for correctly and incorrectly classified subjects together with their difference; see Figure 2B. In this analysis, the correct vs. incorrect classification was based on the best performing model, which used all available normative data without age-group matching. Similarly to the observations from Figure 2A, the correctly classified patients seem to be characterized by larger Z-scores around the alpha (∼10 Hz) and beta (∼20 Hz) frequencies and also in the high end of the spectrum – the gamma band. Higher activation can also be seen in the slower waves of the theta band. The mTBI patients incorrectly classified as controls appear to be lacking these features at least at the group level, which is likely the reason for their misclassification. For the control group, the most notable difference between the correct and incorrect classifications is in the 10-Hz frequency band, which shows higher values for the false positives. The difference plots show that the values of the correctly classified patients are overall slightly higher than those of the incorrectly classified patients, while the inverse is true for the controls. Figure 3A shows the permutation importance of the features selected by the hierarchical clustering method. Only 30 features with the largest mean importance are shown. A clear majority of the most significant features correspond to the theta frequency band. The list also includes a few features from the alpha, delta and low beta frequency bands. Cortical areas prominently present among the most important features are mostly located in the parietal lobe, such as the supramarginal gyrus, postcentral gyrus, superior and inferior parietal lobule and precuneus. Other featured areas include the precentral gyrus in the posterior frontal lobe and the middle and inferior temporal cortex in the temporal lobe. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/30/2022.09.29.22280521/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2022/09/30/2022.09.29.22280521/F3) Figure 3. (A) Feature importance. The feature importance (horizontal axis) is defined as the reduction in accuracy when the feature is randomly permuted. The labels of the features indicate the cortical region, the index of the subarea within the subdivided region, and the hemisphere (L for left, R for right) that the feature corresponds to. Only the 30 features with the largest mean importance are shown. **(B) Cortical sources contributing to the classification of patients and controls at two frequency bands**. The spatial distribution of the average feature importance, shown for the theta and alpha frequency bands. The values were calculated as the permutation feature importance. The mean values of the estimated feature importance for the theta and alpha bands are visualized superimposed on the cortical surface in Figure 3B. In line with the results in Figure 3A, the most significant features are concentrated in the parietal lobe while there are also some in the temporal and occipital lobes and in parts of the frontal lobe. To further assess the reliability of the results, the Z-score map of each patient was visually compared to the findings of Kaltiainen and colleagues (Kaltiainen et al., 2018). In that study, theta band activity exceeding two standard deviations from the healthy subjects’ average was found in seven of the 26 mTBI patients, 25 of which were also analyzed in this study. The Z-maps revealed such aberrant low-frequency activity in eight patients – of which five were the same as the ones identified in that earlier study – when the scores were computed using the full normative dataset. With age-matched normative data, the abnormality was found in one additional patient. Representative examples of the Z-score maps of patients with abnormal theta activity are shown in Figure 4. As seen in the figure, the locations of this abnormal low-frequency activity are highly variable. ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/09/30/2022.09.29.22280521/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2022/09/30/2022.09.29.22280521/F4) Figure 4. Deviation score maps for theta-band power in four patients. The color indicates the Z-score with respect to the level of 4–8-Hz activity in the full normative data. The eight patients with abnormal theta activity (or nine in the age-matched case) were classified by the model with an accuracy of 0.950 (age-matched: 0.911). On the other hand, the same phenomenon within the theta frequency band was observed in three out of the 20 healthy controls with non-matched data and in four with age-matched data. These subjects were classified incorrectly without exception. To test whether the classification results were affected by age, delay between time of injury and the MEG recording, or the severity of the symptoms, we calculated correlations between these demographics and the classification results. We obtained the following correlations coefficients: age and fraction of correct predictions −0.11…0.21, timing of MEG and fraction of correct predictions −0.29…0.01, RPQ and fraction of correct predictions −0.16…0.00, age and decision function output 0.00…0.24, timing of MEG and decision function output – 0.14…0.26, RPQ and decision function output −0.13…0.20. None of the correlations were statistically significant (*p*-value of the highest correlation was 0.16), indicating that the ability of the classifier to correctly detect mTBI was not significantly affected by the subject’s age, the delay of the MEG measurements after the injury, or the severity of the post-concussive symptoms on this dataset. ## Discussion ### Classification accuracy and significant features Finding objective diagnostic biomarkers for mTBI is challenging due to the high variability and nonspecificity of posttraumatic symptoms. Combining measurements of brain electric activity with machine learning could aid in identification of mTBI, and thus help clinical decision making. Here we show that mTBI patients can be separated from healthy controls with 79.0% accuracy using quantified deviations from normative power spectra combined with supervised machine learning. Low-frequency activity in the theta frequency band (4–8 Hz) provided the most significant discriminative features for determining the classification results. Increased neural oscillatory activity below 8 Hz, previously associated with axonal injury (Huang et al., 2009)38, is the most frequent finding in mTBI patients even in the chronic stage of the injury (Dunkley et al., 2015; Haneef et al., 2013; Huang et al., 2014, 2012, 2009; Kaltiainen et al., 2018; Lewine et al., 2007; Robb Swan et al., 2015). The obtained results are thus in line with previous literature. Compared to earlier studies utilizing machine learning and neuroimaging data to predict mTBI at the subacute or chronic stage, the accuracy of the methods presented in this paper slightly exceed the accuracy of Cao *et al* (Cao et al., 2008), where 61 subjects were classified with 77.1% accuracy using task-related EEG measurements, and Lewine *et al*. (Lewine et al., 2019), where a 75% accuracy was achieved for classifying 153 subjects using five global features calculated from resting-state EEG data. It is notable that we reach a comparable accuracy with only 45 subjects compared to 61 and 153 subjects in the earlier studies, respectively. In addition, using features that quantify the deviation from a normative sample yields a more interpretable result. Considering that 10 out of 25 mTBI patients did not have any structural lesions visible in MRI scans and that more than half of the MEG recordings were conducted over a month post-injury, the achieved accuracy can be regarded as satisfactory. Furthermore, it is likely that the patient group includes subjects whose brain activity and function could be considered normal by all relevant metrics despite earlier trauma, so achieving an accuracy close to 100% may be an unrealistic goal. We used permutation feature importance for estimating the significance of each individual feature in the classification. Interestingly, features from the alpha, beta and gamma frequency bands were nearly absent from the list of the most significant features, even though these bands showed visible differences between patients and controls at group level. Previously, Zhang and colleagues have detected reduced beta power in frontotemporal regions (Zhang et al., 2020), but for changes in alpha and gamma oscillatory power after mTBI the reports of alterations in neural oscillatory power are contradictory (Antonakakis et al., 2016; Dunkley et al., 2015; Huang et al., 2020; Popescu et al., 2016). A possible explanation for the lack of significant features is that there is large physiological variability in these frequency bands between and even within individuals e.g. according to their vigilance and attention. Another explanation might be the outlier values in these bands that affect the average but do not contain much valuable information for the training of the SVM classifier, as the decision boundary of the SVM is robust to outliers. The most significant features of the theta frequency band were found to be located in the parietal, temporal and occipital regions of the brain. The paucity of frontal features, however, is notable, since frontobasal areas are among the most frequent lesion sites after mTBI (Shin et al., 2017; Yuh et al., 2014). The locations generating mTBI-related low-frequency activity are typically highly variable (Huang et al., 2012), and likely to be influenced by the location of the impact to the head. This spatial variance was confirmed with visual inspection of the individual Z-score maps presented here. This heterogeneity highlights the need to focus on individual-level abnormalities rather than a “typical mTBI patient”, as in a traditional case-control paradigm. ### Normative modeling: advantages and considerations Interpretability of machine-learning models in a medical context is important due to safety and ethical concerns: clinicians should be able to identify possible errors in the model’s predictions (Watson et al., 2019). Normative modeling, an intuitive approach familiar from children’s growth charts, together with a supervised classifier has the potential to help place confidence in the model’s predictions and detect possible errors. In addition to the prediction of the model, the decisions can be aided with visualizations of the patients’ individual cortical Z-score maps, possibly limited to the theta frequency band. Many of the earlier studies utilizing a normative modeling approach have relied on Gaussian process regression (Williams and Rasmussen, 1996) for modeling the healthy variation (Marquand et al., 2016; Wolfers et al., 2018; Zabihi et al., 2019), which has a benefit of quantifying the uncertainty of the model. The method could be explored in the context of our proposed approach in future research. In this study, a relatively simple and straightforward calculation of Z-scores was adapted, as robust statistical inference was not of concern: the normative modeling provided features for supervised classification rather than being directly used for discriminating patients from controls. A larger normative dataset of thousands of measurements might also enable the use of state-of-the-art deep learning methods for building the normative model, such as deep autoencoders as in Pinaya and colleagues (Pinaya et al., 2021, 2019), which have performed well in complex tasks but require a large number of samples to learn effectively. As neurophysiological patterns vary significantly with age, patients should be compared to their own age group in normative comparisons (Nuwer et al., 2005). In this study, selecting the normative data by each subject’s own age group for calculating the Z-score maps yielded better results compared to using an equally sized random sample of normative data. This suggests that true abnormalities are indeed more reliably identified if a subject is compared with their own age group: selecting the normative data by age may lead to better detection of phenomena normal for some age groups but possibly pathological for others. However, the highest accuracy was achieved by using all available normative data, which suggests that a sufficiently large normative dataset is needed to capture enough individual variation. A normative database integrated with a future clinical application should thus ideally aggregate thousands of M/EEG measurements of healthy subjects across different studies, sites and demographic variables to enable selecting a sufficiently sized subset of normative data with suitable properties for each task. Large functional imaging datasets from clinical populations are rarely available, and drawing reliable conclusions about classification results is often hindered by the small size of the datasets. In this study, the effect of the size of the dataset was most clearly seen in the large standard deviations of the accuracy, sensitivity and specificity scores, which ranged from 0.15 to 0.32. Larger clinical datasets would increase the robustness of the predictions and of the identification of the most significant features, but such large-scale patient data collection remains a challenge. We employed an SVM classifier, as they typically perform well on data with high dimensionality (Nayak et al., 2015). The nonlinearity of the SVM classifier does, however, introduce some limitations to the interpretability of the results, as the weights of the individual features are not directly interpretable. Importantly, the results of such methods should not be thought of as revealing exactly the location and frequency of the most discriminative features, but as giving a general sense about brain activation patterns associated with mTBI. ## Conclusions We introduced a normative modeling and machine learning approach capable of discriminating the MEG power spectra of mild traumatic brain injury patients from healthy controls with up to 79.0% accuracy, which is high enough to substantially contribute to clinical decision making. Most of the features that were significant for the classification corresponded to the theta frequency band, the activity of which has been associated with pathological phenomena, including mTBI, in earlier studies. The approach could be used to help differentiate mTBI-related symptoms in patients with confounding factors who exhibit prolonged symptoms suggestive of TBI and/or problems with vocational performance, as well as for detecting patients in need for neuropsychological intervention. We demonstrated how the normative modeling approach provides a means to intuitively interpret the predictions of the classifier even at the level of individual patients, a framework that could be incorporated in future clinical applications. This would enable building a system where, for example, a brain scan of an individual patient could be automatically checked for different pathological patterns against a large normative database. The present study acts as an example use case for such a system, with a preprocessing and classification pipeline that can be made fully automatic from the raw measurement data to the final results. ## Data Availability The data that supports the findings of the study are available upon reasonable request from the authors. ## Funding This research was supported by European Research Council grant #658578 to LP, Neurocenter Finland (Functional Brain Imaging Biobank pilot project) and European Commission Regional Development Fund REACT-EU (#309487). Support was awarded to ML by the Swedish Cultural Foundation in Finland, the Finnish Cultural Foundation, and the Sohlberg Foundation. HK was supported by The Finnish Medical Foundation, Paulo Foundation, and Helsinki University Hospital Research Fund. None of the funding sources had a role in conducting the research. ## Acknowledgements The authors thank Paju Rantala for automated pipelines for calculating cortical power spectra for large-scale MEG data. ## Footnotes * **Conflict of interest** The authors declare no competing financial interests. * Received September 29, 2022. * Revision received September 29, 2022. * Accepted September 30, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. Antonakakis, M., Dimitriadis, S.I., Zervakis, M., Micheloyannis, S., Rezaie, R., Babajani-Feremi, A., Zouridakis, G., Papanicolaou, A.C., 2016. Altered cross-frequency coupling in resting-state MEG after mild traumatic brain injury. International Journal of Psychophysiology 102, 1–11. [https://doi.org/10.1016/j.ijpsycho.2016.02.002](https://doi.org/10.1016/j.ijpsycho.2016.02.002) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijpsycho.2016.02.002&link_type=DOI) 2. Bigler, E.D., Abildskov, T.J., Goodrich-Hunsaker, N.J., Black, G., Christensen, Z.P., Huff, T., Wood, D.M.G., Hesselink, J.R., Wilde, E.A., Max, J.E., 2016. Structural neuroimaging findings in mild traumatic brain injury. Sports Medicine and Arthroscopy Review 24, e42–e52. [https://doi.org/10.1097/JSA.0000000000000119](https://doi.org/10.1097/JSA.0000000000000119) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/JSA.0000000000000119&link_type=DOI) 3. Breiman, L., 2001. Random forests. Machine Learning 45, 5–32. [https://doi.org/10.1023/A:1010933404324](https://doi.org/10.1023/A:1010933404324) 4. Cao, C., Tutwiler, R.L., Slobounov, S., 2008. Automatic classification of athletes with residual functional deficits following concussion by means of EEG signal using support vector machine. IEEE Transactions on Neural Systems and Rehabilitation Engineering 16, 327–335. [https://doi.org/10.1109/TNSRE.2008.918422](https://doi.org/10.1109/TNSRE.2008.918422) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/TNSRE.2008.918422&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18701381&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 5. Castellanos, N.P., Paúl, N., Ordóñ Ez, V.E., Demuynck, O., Bajo, R., Campo, P., Bilbao, A., Ortiz, T., Del-Pozo, F., Maestú, F., 2010. Reorganization of functional connectivity as a correlate of cognitive recovery in acquired brain injury. Brain 133, 2365–2381. [https://doi.org/10.1093/brain/awq174](https://doi.org/10.1093/brain/awq174) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/brain/awq174&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20826433&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000280982700017&link_type=ISI) 6. Dale, A.M., Fischl, B., Sereno, M.I., 1999. Cortical surface-based analysis: I. Segmentation and surface reconstruction. NeuroImage 9, 179–194. [https://doi.org/10.1006/nimg.1998.0395](https://doi.org/10.1006/nimg.1998.0395) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/nimg.1998.0395&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9931268&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000078608900001&link_type=ISI) 7. Dikmen, S., Machamer, J., Temkin, N., 2017. Mild traumatic brain injury: Longitudinal study of cognition, functional status, and post-traumatic symptoms. Journal of Neurotrauma 34, 1524–1530. [https://doi.org/10.1089/neu.2016.4618](https://doi.org/10.1089/neu.2016.4618) 8. Dimitriadis, S.I., Zouridakis, G., Rezaie, R., Babajani-Feremi, A., Papanicolaou, A.C., 2015. Functional connectivity changes detected with magnetoencephalography after mild traumatic brain injury. NeuroImage: Clinical 9, 519–531. [https://doi.org/10.1016/j.nicl.2015.09.011](https://doi.org/10.1016/j.nicl.2015.09.011) 9. Dunkley, B.T., Da Costa, L., Bethune, A., Jetly, R., Pang, E.W., Taylor, M.J., Doesburg, S.M., 2015. Low-frequency connectivity is associated with mild traumatic brain injury. Neuroimage Clin 7, 611–621. [https://doi.org/10.1016/j.nicl.2015.02.020](https://doi.org/10.1016/j.nicl.2015.02.020) 10. Fischl, B., Sereno, M.I., Dale, A.M., 1999. Cortical surface-based analysis: II. Inflation, flattening, and a surface-based coordinate system. NeuroImage 9, 195–207. [https://doi.org/10.1006/nimg.1998.0396](https://doi.org/10.1006/nimg.1998.0396) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/nimg.1998.0396&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9931269&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000078608900002&link_type=ISI) 11. Gramfort, A., Luessi, M., Larson, E., Engemann, D., Strohmeier, D., Brodbeck, C., Parkkonen, L., Hämäläinen, M., 2014. MNE software for processing MEG and EEG data. NeuroImage 23, 1–7. [https://doi.org/10.1016/j.neuroimage.2013.10.027.MNE](https://doi.org/10.1016/j.neuroimage.2013.10.027.MNE) 12. Gramfort, A., Luessi, M., Larson, E., Engemann, D.A., Strohmeier, D., Brodbeck, C., Goj, R., Jas, M., Brooks, T., Parkkonen, L., Hämäläinen, M., 2013. MEG and EEG data analysis with MNE-Python. Frontiers in Neuroscience 7, 267. [https://doi.org/10.3389/fnins.2013.00267](https://doi.org/10.3389/fnins.2013.00267) 13. Haneef, Z., Levin, H.S., Frost, J.D., Mizrahi, E.M., 2013. Electroencephalography and quantitative electroencephalography in mild traumatic brain injury. Journal of Neurotrauma 30, 653–656. [https://doi.org/10.1089/neu.2012.2585](https://doi.org/10.1089/neu.2012.2585) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/neu.2012.2585&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23249295&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 14. Huang, M.X., Huang, C.W., Harrington, D.L., Nichols, S., Robb-Swan, A., Angeles-Quinto, A., Le, L., Rimmele, C., Drake, A., Song, T., Huang, J.W., Clifford, R., Ji, Z., Cheng, C.K., Lerman, I., Yurgil, K.A., Lee, R.R., Baker, D.G., 2020. Marked increases in resting-state MEG gamma-band activity in combat-related mild traumatic brain injury. Cerebral Cortex 30, 283–295. [https://doi.org/10.1093/cercor/bhz087](https://doi.org/10.1093/cercor/bhz087) 15. Huang, M.X., Nichols, S., Baker, D.G., Robb, A., Angeles, A., Yurgil, K.A., Drake, A., Levy, M., Song, T., McLay, R., Theilmann, R.J., Diwakar, M., Risbrough, V.B., Ji, Z., Huang, C.W., Chang, D.G., Harrington, D.L., Muzzatti, L., Canive, J.M., Christopher Edgar, J., Chen, Y.H., Lee, R.R., 2014. Single-subject-based whole-brain MEG slow-wave imaging approach for detecting abnormality in patients with mild traumatic brain injury. NeuroImage: Clinical 5, 109–119. [https://doi.org/10.1016/j.nicl.2014.06.004](https://doi.org/10.1016/j.nicl.2014.06.004) 16. Huang, M.X., Nichols, S., Robb, A., Angeles, A., Drake, A., Holland, M., Asmussen, S., D’Andrea, J., Chun, W., Levy, M., Cui, L., Song, T., Baker, D.G., Hammer, P., McLay, R., Theilmann, R.J., Coimbra, R., Diwakar, M., Boyd, C., Neff, J., Liu, T.T., Webb-Murphy, J., Farinpour, R., Cheung, C., Harrington, D.L., Heister, D., Lee, R.R., 2012. An automatic MEG low-frequency source imaging approach for detecting injuries in mild and moderate TBI patients with blast and non-blast causes. NeuroImage 61, 1067–1082. [https://doi.org/10.1016/j.neuroimage.2012.04.029](https://doi.org/10.1016/j.neuroimage.2012.04.029) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2012.04.029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22542638&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000305920600034&link_type=ISI) 17. Huang, M.X., Theilmann, R.J., Robb, A., Angeles, A., Nichols, S., Drake, A., D’Andrea, J., Levy, M., Holland, M., Song, T., Ge, S., Hwang, E., Yoo, K., Cui, L., Baker, D.G., Trauner, D., Coimbra, R., Lee, R.R., 2009. Integrated imaging approach with MEG and DTI to detect mild traumatic brain injury in military and civilian patients. Journal of Neurotrauma 26, 1213–1226. [https://doi.org/10.1089/neu.2008.0672](https://doi.org/10.1089/neu.2008.0672) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/neu.2008.0672&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19385722&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000268629900005&link_type=ISI) 18. Hyvärinen, A., Oja, E., 2000. Independent component analysis: Algorithms and applications. Neural Networks 13, 411–430. [https://doi.org/10.1016/S0893-6080(00)00026-5](https://doi.org/10.1016/S0893-6080(00)00026-5) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0893-6080(00)00026-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10946390&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000088567000002&link_type=ISI) 19. Kaltiainen, H., Helle, L., Liljeström, M., Renvall, H., Forss, N., 2018. Theta-band oscillations as an indicator of mild traumatic brain injury. Brain Topography 31, 1037–1046. [https://doi.org/10.1007/s10548-018-0667-2](https://doi.org/10.1007/s10548-018-0667-2) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10548-018-0667-2&link_type=DOI) 20. Kaltiainen, H., Liljeström, M., Helle, L., Salo, A., Hietanen, M., Renvall, H., Forss, N., 2019. Mild Traumatic Brain Injury Affects Cognitive Processing and Modifies Oscillatory Brain Activity during Attentional Tasks. Journal of Neurotrauma 36, 2222–2232. [https://doi.org/10.1089/neu.2018.6306](https://doi.org/10.1089/neu.2018.6306) 21. Kapur, S., Phillips, A.G., Insel, T.R., 2012. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it. Molecular Psychiatry 17, 1174–1179. [https://doi.org/10.1038/mp.2012.105](https://doi.org/10.1038/mp.2012.105) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2012.105&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22869033&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000311425600010&link_type=ISI) 22. Kay, T., Harrington, D., Adams, Anderson, Berrol, Cicerone, Dahlberg, Gerber, Goka, Harley, Hilt, Horn, Lehmkuhl, Malec, 1993. Definition of mild traumatic brain injury. The Journal of Head Trauma Rehabilitation 8. 23. Khan, S., Hashmi, J.A., Mamashli, F., Michmizos, K., Kitzbichler, M.G., Bharadwaj, H., Bekhti, Y., Ganesan, S., Garel, K.L.A., Whitfield-Gabrieli, S., Gollub, R.L., Kong, J., Vaina, L.M., Rana, K.D., Stufflebeam, S.M., Hämäläinen, M.S., Kenet, T., 2018. Maturation trajectories of cortical resting-state networks depend on the mediating frequency band. NeuroImage 174, 57–68. [https://doi.org/10.1016/j.neuroimage.2018.02.018](https://doi.org/10.1016/j.neuroimage.2018.02.018) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2018.02.018&link_type=DOI) 24. 1. Cardoso, M.J., 2. Feragen, A., 3. Glocker, B., 4. Konukoglu, E., 5. Oguz, I., 6. Unal, G., 7. Vercauteren, T. Kia, S.M., Marquand, A.F., 2019. Neural Processes Mixed-Effect Models for Deep Normative Modeling of Clinical Neuroimaging Data, in: Cardoso, M.J., Feragen, A., Glocker, B., Konukoglu, E., Oguz, I., Unal, G., Vercauteren, T. (Eds.), Proceedings of The 2nd International Conference on Medical Imaging with Deep Learning, Proceedings of Machine Learning Research. PMLR, pp. 297–314. 25. King, N.S., Crawford, S., Wenden, F.J., Moss, N.E.G., Wade, D.T., 1995. The Rivermead Post Concussion Symptoms Questionnaire: a measure of symptoms commonly experienced after head injury and its reliability. Journal of Neurology 242, 587–592. [https://doi.org/10.1007/BF00868811](https://doi.org/10.1007/BF00868811) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/BF00868811&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8551320&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1995RW08200006&link_type=ISI) 26. Lewine, J.D., Davis, J.T., Bigler, E.D., Thoma, R., Hill, D., Funke, M., Sloan, J.H., Hall, S., Orrison, W.W., 2007. Objective documentation of traumatic brain injury subsequent to mild head trauma: Multimodal brain imaging with MEG, SPECT, and MRI. Journal of Head Trauma Rehabilitation 22, 141–155. [https://doi.org/10.1097/01.HTR.0000271115.29954.27](https://doi.org/10.1097/01.HTR.0000271115.29954.27) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/01.HTR.0000271115.29954.27&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17510590&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000246951100001&link_type=ISI) 27. Lewine, J.D., Plis, S., Ulloa, A., Williams, C., Spitz, M., Foley, J., Paulson, K., Davis, J., Bangera, N., Snyder, T., Weaver, L., 2019. Quantitative EEG biomarkers for mild traumatic brain injury. Journal of Clinical Neurophysiology 36, 298–305. [https://doi.org/10.1097/WNP.0000000000000588](https://doi.org/10.1097/WNP.0000000000000588) 28. Marquand, A.F., Kia, S.M., Zabihi, M., Wolfers, T., Buitelaar, J.K., Beckmann, C.F., 2019. Conceptualizing mental disorders as deviations from normative functioning. Molecular Psychiatry 24, 1415–1424. [https://doi.org/10.1038/s41380-019-0441-1](https://doi.org/10.1038/s41380-019-0441-1) 29. Marquand, A.F., Rezek, I., Buitelaar, J., Beckmann, C.F., 2016. Understanding heterogeneity in clinical cohorts using normative models: beyond case-control studies. Biological Psychiatry 80, 552–561. [https://doi.org/10.1016/j.biopsych.2015.12.023](https://doi.org/10.1016/j.biopsych.2015.12.023) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.biopsych.2015.12.023&link_type=DOI) 30. Nayak, J., Naik, B., Behera, H.S., 2015. A comprehensive survey on support vector machine in data mining tasks: applications & challenges. International Journal of Database Theory and Application 8, 169–186. [https://doi.org/10.14257/ijdta.2015.8.1.18](https://doi.org/10.14257/ijdta.2015.8.1.18) 31. Nuwer, M.R., Hovda, D.A., Schrader, L.M., Vespa, P.M., 2005. Routine and quantitative EEG in mild traumatic brain injury. Clinical Neurophysiology 116, 2001–2025. [https://doi.org/10.1016/j.clinph.2005.05.008](https://doi.org/10.1016/j.clinph.2005.05.008) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.clinph.2005.05.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16029958&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 32. Oostenveld, R., Fries, P., Maris, E., Schoffelen, J.M., 2011. FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Computational Intelligence and Neuroscience 2011. [https://doi.org/10.1155/2011/156869](https://doi.org/10.1155/2011/156869) 33. Pinaya, W.H.L., Mechelli, A., Sato, J.R., 2019. Using deep autoencoders to identify abnormal brain structural patterns in neuropsychiatric disorders: A large-scale multi-sample study. Human Brain Mapping 40, 944–954. [https://doi.org/10.1002/hbm.24423](https://doi.org/10.1002/hbm.24423) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/hbm.24423&link_type=DOI) 34. Pinaya, W.H.L., Scarpazza, C., Garcia-Dias, R., Vieira, S., Baecker, L., F da Costa, P., Redolfi, A., Frisoni, G.B., Pievani, M., Calhoun, V.D., Sato, J.R., Mechelli, A., 2021. Using normative modelling to detect disease progression in mild cognitive impairment and Alzheimer’s disease in a cross-sectional multi-cohort study. Scientific Reports 11, 15746. [https://doi.org/10.1038/s41598-021-95098-0](https://doi.org/10.1038/s41598-021-95098-0) 35. Pontifex, M.B., O’Connor, P.M., Broglio, S.P., Hillman, C.H., 2009. The association between mild traumatic brain injury history and cognitive control. Neuropsychologia 47, 3210–3216. [https://doi.org/10.1016/j.neuropsychologia.2009.07.021](https://doi.org/10.1016/j.neuropsychologia.2009.07.021) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuropsychologia.2009.07.021&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19664646&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 36. Popescu, M., Hughes, J.D., Popescu, E.A., Riedy, G., DeGraba, T.J., 2016. Reduced prefrontal MEG alpha-band power in mild traumatic brain injury with associated posttraumatic stress disorder symptoms. Clinical Neurophysiology 127, 3075–3085. [https://doi.org/10.1016/j.clinph.2016.06.004](https://doi.org/10.1016/j.clinph.2016.06.004) 37. Robb Swan, A., Nichols, S., Drake, A., Angeles, A.M., Diwakar, M., Song, T., Lee, R.R., Huang, M.X., 2015. Magnetoencephalography slow-wave detection in patients with mild traumatic brain injury and ongoing symptoms correlated with long-term neuropsychological outcome. Journal of Neurotrauma 32, 1510–1521. [https://doi.org/10.1089/neu.2014.3654](https://doi.org/10.1089/neu.2014.3654) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/neu.2014.3654&link_type=DOI) 38. Rogers, J.M., Fox, A.M., Donnelly, J., 2015. Impaired practice effects following mild traumatic brain injury: An event-related potential investigation. Brain Injury 29, 343–351. [https://doi.org/10.3109/02699052.2014.976273](https://doi.org/10.3109/02699052.2014.976273) 39. Ségonne, F., Dale, A.M., Busa, E., Glessner, M., Salat, D., Hahn, H.K., Fischl, B., 2004. A hybrid approach to the skull stripping problem in MRI. NeuroImage 22, 1060–1075. [https://doi.org/10.1016/j.neuroimage.2004.03.032](https://doi.org/10.1016/j.neuroimage.2004.03.032) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2004.03.032&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15219578&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000222423200004&link_type=ISI) 40. Shin, S.S., Bales, J.W., Edward Dixon, C., Hwang, M., 2017. Structural imaging of mild traumatic brain injury may not be enough: overview of functional and metabolic imaging of mild traumatic brain injury. Brain Imaging and Behavior 11, 591–610. [https://doi.org/10.1007/s11682-017-9684-0](https://doi.org/10.1007/s11682-017-9684-0) 41. Taulu, S., Simola, J., 2006. Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurements. Physics in Medicine and Biology 51, 1759–1768. [https://doi.org/10.1088/0031-9155/51/7/008](https://doi.org/10.1088/0031-9155/51/7/008) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1088/0031-9155/51/7/008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16552102&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 42. Taylor, J.R., Williams, N., Cusack, R., Auer, T., Shafto, M.A., Dixon, M., Tyler, L.K., Cam-CAN, Henson, R.N., 2017. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. NeuroImage 144, 262–269. [https://doi.org/10.1016/j.neuroimage.2015.09.018](https://doi.org/10.1016/j.neuroimage.2015.09.018) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2015.09.018&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26375206&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 43. Teasdale, G., Jennett, B., 1974. Assessment of coma and impaired consciousness. A practical scale. The Lancet 304, 81–84. [https://doi.org/10.1016/S0140-6736(74)91639-0](https://doi.org/10.1016/S0140-6736(74)91639-0) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s0140-6736(74)91639-0&link_type=DOI) 44. Vapnik, V.N., 1995. The Nature of Statistical Learning Theory. Springer, New York. [https://doi.org/10.1007/978-1-4757-2440-0](https://doi.org/10.1007/978-1-4757-2440-0) 45. Wäljas, M., Iverson, G.L., Lange, R.T., Hakulinen, U., Dastidar, P., Huhtala, H., Liimatainen, S., Hartikainen, K., Öhman, J., 2015. A prospective biopsychosocial study of the persistent post-concussion symptoms following mild traumatic brain injury. Journal of Neurotrauma 32, 534–547. [https://doi.org/10.1089/neu.2014.3339](https://doi.org/10.1089/neu.2014.3339) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1089/neu.2014.3339&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25363626&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 46. Watson, D.S., Krutzinna, J., Bruce, I.N., Griffiths, C.E.M., McInnes, I.B., Barnes, M.R., Floridi, L., 2019. Clinical applications of machine learning algorithms: Beyond the black box. BMJ (Online) 364, 1–5. [https://doi.org/10.1136/bmj.l886](https://doi.org/10.1136/bmj.l886) 47. Williams, C.K.I., Rasmussen, C.E., 1996. Gaussian processes for regression. Advances in Neural Information Processing Systems 8, 514–520. 48. Wolfers, T., Doan, N.T., Kaufmann, T., Alnæs, D., Moberget, T., Agartz, I., Buitelaar, J.K., Ueland, T., Melle, I., Franke, B., Andreassen, O.A., Beckmann, C.F., Westlye, L.T., Marquand, A.F., 2018. Mapping the heterogeneous phenotype of schizophrenia and bipolar disorder using normative models. JAMA Psychiatry 75, 1146–1155. [https://doi.org/10.1001/jamapsychiatry.2018.2467](https://doi.org/10.1001/jamapsychiatry.2018.2467) 49. Yuh, E.L., Hawryluk, G.W.J., Manley, G.T., 2014. Imaging concussion: A review. Neurosurgery 75, S50–S63. [https://doi.org/10.1227/NEU.0000000000000491](https://doi.org/10.1227/NEU.0000000000000491) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1227/NEU.0000000000000491&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25232884&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F09%2F30%2F2022.09.29.22280521.atom) 50. Zabihi, M., Oldehinkel, M., Wolfers, T., Frouin, V., Goyard, D., Loth, E., Charman, T., Tillmann, J., Banaschewski, T., Dumas, G., Holt, R., Baron-Cohen, S., Durston, S., Bölte, S., Murphy, D., Ecker, C., Buitelaar, J.K., Beckmann, C.F., Marquand, A.F., 2019. Dissecting the heterogeneous cortical anatomy of autism spectrum disorder using normative models. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging 4, 567–578. [https://doi.org/10.1016/j.bpsc.2018.11.013](https://doi.org/10.1016/j.bpsc.2018.11.013) 51. Zhang, J., Safar, K., Emami, Z., Ibrahim, G.M., Scratch, S.E., da Costa, L., Dunkley, B.T., 2020. Local and large-scale beta oscillatory dysfunction in males with mild traumatic brain injury. J Neurophysiol 124, 1948–1958. [https://doi.org/10.1152/jn.00333.2020](https://doi.org/10.1152/jn.00333.2020) 52. Ziegler, G., Ridgway, G.R., Dahnke, R., Gaser, C., 2014. Individualized Gaussian process-based prediction and detection of local and global gray matter abnormalities in elderly subjects. NeuroImage 97, 333–348. [https://doi.org/10.1016/j.neuroimage.2014.04.018](https://doi.org/10.1016/j.neuroimage.2014.04.018) [1]: /embed/graphic-3.gif