Detecting a History of Repetitive Head Impacts from a Short Voice Recording =========================================================================== * Michael G. Tauro * Mirco Ravanelli * Cristian A. Droppelmann ## ABSTRACT Repetitive head impacts (RHI) are associated with an increased risk of developing various neurodegenerative disorders, such as Alzheimer’s disease (AD), Parkinson’s disease (PD), and most notably, chronic traumatic encephalopathy (CTE). While the clinical presentation of AD and PD is well established, CTE can only be diagnosed post-mortem. Therefore, a distinction can be made between the pathologically defined CTE and RHI-related functional or structural brain changes (RHI-BC) which may result in CTE. Unfortunately, there are currently no accepted biomarkers of CTE nor RHI-BC, a major hurdle to achieving clinical diagnoses. Interestingly, speech has shown promise as a potential biomarker of both AD and PD, being used to accurately classify individuals with AD and PD from those without. Given the overlapping symptoms between CTE, RHI-BC, PD and AD, we aimed to determine if speech could be used to identify individuals with a history of RHI from those without. We therefore created the Verus dataset, consisting of 13 second voice recordings from 605 professional fighters (RHI group) and 605 professional athletes in non-contact sports (control group) for a total of 1210 recordings. Using a deep learning approach, we achieved 85% accuracy in detecting individuals with a history of RHI from those without. We then used our model trained on the Verus dataset to fine-tune on publicly available AD and PD speech datasets and achieved new state-of-the-art accuracies of 84.99% on the AD dataset and 89% on the PD dataset. Finding a biomarker of CTE and RHI-BC that presents early in disease progression is critical to improve risk management and patient outcome. Our study is the first we are aware of to investigate speech as such a candidate biomarker of RHI-BC. Keywords * Chronic traumatic encephalopathy * Repetitive head impacts * Machine learning * Speech * Deep learning * Neurodegeneration * Transfer learning ## 1. Introduction Concussion is defined as a form of mild traumatic brain injury (mTBI) that disrupts regular brain function [71]. Concussion can be caused by blunt force trauma to the head or rapid acceleration of the head, as occurs in whiplash [77]. While the acute symptoms of concussion generally resolve within a week, long term effects can occur from multiple or even a single concussion [52]. However, focusing on the long-term risks associated exclusively with concussion may miss key information. Athletes for example tend to underreport the number of concussions they have had, making analyses between health outcomes and number of concussions difficult. Further, there is no objective threshold one can use to differentiate a concussion from a subconcussive blow to the head [52]. Finally, recent literature suggests that the behavioural symptoms of concussion result from physical disruption to brain function, while subconcussive impacts have the potential to cause the long-term effects often attributed to concussion [81]. Therefore, we will use the term repetitive head impacts (RHI) in reference to blows to the head which may or may not have resulted in a clinically diagnosed concussion. Correspondingly, a history of RHI is associated with an increased risk of developing Alzheimer’s disease (AD), Parkinson’s disease (PD), amyotrophic lateral sclerosis (ALS), chronic traumatic encephalopathy (CTE) and other disabilities [43, 34, 76, 13, 42]. CTE is a neurodegenerative disorder linked to a history of RHI [55]. Originally documented in professional boxers, CTE has now been identified in former professional football, hockey, soccer and rugby players, as well as combat veterans and others [69, 16, 61, 50]. While the symptoms of CTE are quite heterogeneous, [55] has proposed a staged-based characterisation of the disease, with increasing stages relating to increased neurodegeneration and symptom severity. Stage 1 may be asymptomatic, with memory problems occurring rarely. Stage 2 symptoms include severe depressive episodes as well as behavioural changes and outbursts. Stage 3 is characterized by the first signs of cognitive deficits, such as executive dysfunction, memory loss, attentional and concentration difficulties and explosive outbursts. Stage 4 symptoms may include more severe cognitive deficits, including increased deficits in memory, attention, executive functioning, language, as well as severe depression, suicidal tendencies, paranoia, gait and visuospatial difficulties, dysarthria and parkinsonism. CTE is pathologically characterized, with diagnoses only possible post-mortem [54]. Indeed, the clinical presentation of CTE is not universally agreed upon, nor is the number or severity of RHI needed to precipitate CTE [82, 43, 30]. Therefore, in this paper a distinction is made between CTE, and RHI-related functional or structural brain changes (RHI-BC). Unfortunately, there are no accepted biomarkers of CTE or RHI-BC [9]. As CTE is a degenerative disorder, early detection through the use of biomarkers could be critical in improving patient outcome and slowing disease progression. The use of speech as a biomarker of neurodegenerative disorders has recently come under investigation with promising results [68]. Using voice recordings, [59] achieved 90% accuracy in classifying those with Parkinson’s disease (PD) from those without. [85] achieved 91.6% accuracy in classifying those with in classifying ALS patients with bulbar-onset from those without. While the previous two studies relied solely on the acoustic features of speech, [29] used a combination of acoustic, semantic, lexical, as well as other linguistic features to achieve 81.9% accuracy in classifying those with Alzheimer’s disease (AD) from those without. This warrants the investigation of speech as a biomarker of RHI-BC. In fact, not only does exposure to RHI increase the risk of developing PD, AD and ALS, but CTE also has overlapping symptoms with each of the aforementioned neurodegenerative disorders [55, 32]. Further, speech problems have long been considered a symptom of CTE. Indeed, [51] commented that many boxers examined had “hesitancy of speech”, “indistinct”, slow, and “thick, muffled, and hard to understand” speech, along with other issues. [23] described case reports of 11 boxers, with 6 listed as having motor speech problems. More recently, [46] characterised the speech of 102 active and retired professional fighters in comparison to 27 healthy controls. Professional fighters were reported to have a significantly slower articulation rate, with 88% of fighters having a slower articulation rate than the mean of the control group. Fighters also exhibited increased interruptions in speech, such as pauses, stuttering, and other dysfluencies. The authors noted that the speech characteristics of the fighters are also observed in several types of dysarthria, PD and parkinsonism. Yet, as speech problems are associated with RHI, and speech has been used to successfully detect neurodegenerative disorders that present similarly to CTE, speech might also serve as a biomarker of RHI-BC. The investigation of speech as a biomarker of various diseases has historically been influenced by other popular speech classification tasks such as speaker recognition [68]. Traditionally, the best results in speaker recognition were achieved using some form or combination of a Gaussian Mixture Model (GMM) [31]. More specifically, a GMM-Universal Background Model (UBM) was used by [3] to predict PD severity in a longitudinal study. Yet the current trend has now shifted to the use of deep neural networks (DNN) [6]. Indeed, many recent performance advancements in speaker recognition and verification tasks are achieved through the use of x-vectors and other similar embedding approaches [73, 10]. x-vectors are fixed-length embedding vectors of variable length voice recordings. x-vectors have been used to achieve state of the art accuracy in detecting gender, language and PD. They have also recently been used to detect AD [59, 72, 66, 63]. Notably, x-vectors result in text-independent speaker identification, meaning x-vectors capture the acoustic properties of speech, but not the individual words or meaning. This was considered advantageous as motor-speech disorders occur earlier than language impairments in PD and ALS, thus we reasoned this could be the case in individuals exposed to RHI [36, 41, 83]. Thus, as our speaker recognition model, we decided to use the x-vector scheme with an Emphasized Channel Attention, Propagation and Aggregation-Time Delay Neural Network (ECAPA-TDNN) architecture. The ECAPA-TDNN model was developed by [27] and improved upon the previous TDNN-x-vector architecture to achieve state of the art performance in speaker recognition tasks. The prevalence of CTE within populations exposed to RHI is unknown, with estimates ranging from 5-99% [18, 11, 57]. As the incidence rate and clinical presentation of CTE remain ambiguous, we aimed to determine if state of the art speech classification algorithms based on modern deep learning techniques could differentiate individuals with RHI exposure from those without. To do so, we created the Verus corpus, composed of 1210 individuals, corresponding to one 13 second voice recording each. Of these 1210 individuals, 605 were retired and active professional fighters and 605 were retired and active professional athletes in non-contact sports. The fighters were composed of boxers, mixed martial artists (MMA), kickboxers and Muay Thai fighters. These fighters were considered the group with RHI exposure, or the RHI group. The athletes in non-contact sports, considered the control group, were composed of soccer, basketball and baseball players as well as golfers, swimmers and track and field athletes. The two groups (RHI and control) were matched for age, ethnicity and gender. The dataset will be further characterized in the methods section. At various stages of CTE, clinical symptoms may present similarly to other neurodegenerative disorders such as AD, PD and ALS [86]. However, it is unclear to what degree similarities exist between these disorders along the dimension of speech. Concordantly, we were interested in determining if our model trained on the Verus dataset could be re-purposed to detect AD and PD at or above the current state-of-the-art accuracy on certain AD and PD corpora. Successful transfer learning from the Verus dataset to the aforementioned neurodegenerative disorders would suggest that our model was identifying vocal features in the Verus dataset that are related to brain damage. This would in turn support the utility of speech as a biomarker of RHI-BC. ## 2. Proposed Approach While deep learning approaches can produce state-of-the-art results in speaker recognition and related tasks, they may require significantly more data to be trained on compared to non-deep learning algorithms. Unfortunately, in the field of medicine, domain-specific data is often difficult to obtain, and algorithms must be trained on datasets of sub-optimal size. Therefore, standard practice is to pretrain a deep learning model on a domain non-specific task before fine-tuning on the task of interest [47, 70]. This allows for the learning of lower-level features which may be domain-unspecific in nature [8]. Building on this standard, we adopted a multi-step transfer learning approach, where we first pretrained our ECAPA-TDNN model on a large, domain non-specific dataset and then subsequently fine-tuned on smaller, domain-specific datasets. This approach outlined below can also be visualized in Figure 1. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/29/2021.09.20.21263753/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/F1) Figure 1: Depiction of the training process used in this study. a) An ECAPA-TDNN model is trained to classify speaker identities with the Voxceleb1+2 dataset. b) The pretrained ECAPA-TDNN model from “a)” is fine-tuned on the Verus dataset to differentiate the RHI from control group. c) Finally, the pre-trained ECAPA-TDNN model from “b)” is fine-tuned on either the DementiaBank or Gita dataset in order to detect AD or PD, respectively. Stage 1 of our approach entailed training a speaker recognition model. In Stage 2, we used the ECAPA-TDNN model that was pretrained for speaker recognition and fine-tuned it to classify individuals with a history of RHI from those without. In Stage 3, we utilized the ECAPA-TDNN model from Stage 2 and fine-tuned it to perform either AD or PD detection. Using this multi-step transfer learning approach, we start with the broad task of speaker recognition and then successively transfer to the more specific tasks of RHI detection followed by AD and PD detection. This order of operations utilized the larger datasets first, with the intention that the smaller datasets will be used primarily to learn domain-specific features. As some symptoms described in people with a history of RHI present similarly to both PD and AD, it is possible that some of the RHI-specific features learned in Stage 2 will be preserved in Stage 3 [8]. This proposed progression of feature learning is shown in Figure 2. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/29/2021.09.20.21263753/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/F2) Figure 2: Depiction of the proposed feature learning resulting from the multi-step transfer learning approach. a) Stage 1 results in the learning of low-level features pertinent to speaker recognition. b) Stage 2 involves the transfer of the low-level features from Stage 1 and the learning of RHI-related features. c) Stage 3 involves the transfer of features from Stage 1 and 2 as well as the learning of AD or PD-specific features. The feature extraction, training and evaluation of the model were mediated through a modified SpeechBrain recipe. Speech-Brain is an opensource toolkit that provides end-to-end methods for various audio processing tasks [67]. The recipe adapted for our project converts raw audio sampled at 16 kHz into short-time Fourier transformed (STFT) filter banks (FBANKS) and augments the data with noise using the Room Impulse Response and Noise (RIRS) Database, while also randomly dropping a specified number of frequencies [45]. FBANKS are widely used features in speech and speaker recognition, especially when using deep learning approaches [24]. The SpeechBrain recipe employs the ECAPA-TDNN model as the embedding extractor. Fundamentally, the ECAPA-TDNN converts variable length utterances into fixed-length embeddings (here termed BC-vectors) which can then be used to characterize speech, as shown in Figure 3. The comparison of BC-vectors determines which speaker (or group) an utterance belongs to. The ECAPA-TDNN model differs from previous TDNN models in the inclusion of ResNet-like layers, termed SE-Res2Blocks, that are interjected with skip connections within the TDNN layers. This allows for greater “… emphasis on channel attention, propagation and aggregation”, which has led to state-of-the-art accuracy in speaker recognition tasks [27, 25]. Once the BC-vectors are extracted, they are output to a classifier which computes the cosine similarity with additive angular margin (AAM) loss [26]. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/09/29/2021.09.20.21263753/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/F3) Figure 3: Simplified depiction of the ECAPA-TDNN model used in this study. ## 3. Materials ### 3.1. VoxCeleb Datasets The VoxCeleb datasets, termed VoxCeleb 1 and VoxCeleb 2 are speech corpora composed of over 2000 hours of speech from over 7000 speakers, downloaded from publicly available videos on YouTube [60]. These datasets are commonly used to benchmark state-of-the-art performance in speaker recognition tasks [19]. ### 3.2. Verus Dataset The Verus dataset is composed of 1210, 13-second recordings, each corresponding to a distinct individual. The recordings were downloaded from publicly available interviews on YouTube and edited using the Audacity application [5]. Each recording is composed of 13 seconds of uninterrupted spontaneous speech. The corpus can be further divided into the RHI and control group, each having 605 recordings. The RHI group is composed of active or retired professional boxers, MMA, Kickboxers and Muay Thai fighters. To be considered a professional, they must have currently or previously competed in a professionally sanctioned league, such as the UFC, Glory, One Championship, Bellator, or other similar leagues. Their affiliation to these leagues was verified by consulting [boxrec.com](http://boxrec.com) for boxers, [glorykickboxing.com](http://glorykickboxing.com) for kickboxers, [sherdog.com](http://sherdog.com) for MMA, and [onefc.com](http://onefc.com) for Muay Thai fighters. Professional fighters were selected for the RHI group as data indicate they suffer significantly more RHI and concussions than players of contact sports like football or hockey [84, 65, 17]. The control group is composed of 605 recordings of active or retired professional athletes from non-contact sports such as basketball, baseball, soccer, golf, tennis, cricket, track and field and rowing. Professional athletes were chosen as the control group as a means to control for differences that might exist between professional fighters and non-athletes, such as differences in education, socioeconomic status or hormone levels [2, 33]. The RHI and control groups were controlled for age, gender and race/ethnicity. We also calculated the mean fundamental frequency (F0) and mean energy of all the recordings of fighters and athletes, as displayed in Table 3. The F0 is commonly used as a proxy of the average pitch of a recording [37]. We computed the F0 through Librosa’s use of the YIN algorithm [53]. The mean energy is used to indicate the average intensity or loudness of a recording [44]. The mean energy was also computed using Librosa. These metrics were calculated to ensure that simple, uninformative differences such as pitch or loudness did not exist between the two groups. Tables 1 and 2 provide a more thorough description of the demographic data pertaining to the Verus dataset while Table 3 contains the F0 and mean energy scores. This dataset was used to train the RHI-detection model in Stage 2. Before training, the Verus dataset was split into training and validation sets using stratified k-10 cross validation (CV). The final accuracy will be the average accuracy across the 10 validation folds, with each fold composed of a distinct subset of the data [78]. This CV approach was also applied to the DementiaBank and Gita datasets described below. View this table: [Table 1](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T1) Table 1 Demographic data of the Verus dataset. * Denotes overlap between “Oceania” and “White”. STD = standard deviation, N = number of subjects. View this table: [Table 2](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T2) Table 2 RHI group of the RHI Control dataset and their corresponding RHI-related history. View this table: [Table 3](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T3) Table 3 Demographic data within the control and RHI groups of the Verus dataset. * Denotes overlap between “Oceania” and “White”. STD = standard deviation, N = number of subjects. ### 3.3. RHI Control Dataset The RHI control dataset consists of 10, 13 second voice recordings. The recordings were downloaded from publicly available interviews on YouTube and edited using the Audacity application [5]. The dataset is divided into a “RHI” and “control” group. The RHI group consists of 5 individuals who have a history of either CTE, probable CTE, or RHI-BC. The control group consists of three chess players and two golfers. The relevant medical history of the RHI group is shown in Table 2. ### 3.4. DementiaBank Dataset The DementiaBank dataset was created by [7] as a means to better characterize the behavioural neurological factors that can aide in diagnosing AD. The section of this dataset used in our study was the “Cookie Theft” description task, where a subject was asked to describe a scene shown as a picture. We included recordings of participants who were listed as either “probable AD” or “possible AD”, resulting in 259 recordings from 167 speakers. We also included 236 recordings from 97 healthy controls. Each recording was edited to include 13 seconds of uninterrupted spontaneous speech, using Audacity [5]. In total, 60 recordings did not contain 13 seconds of uninterrupted speech and were therefore edited to include the maximum amount of uninterrupted speech under 13 seconds. ### 3.5. Gita Dataset The Gita dataset is a speech corpus consisting of 50 subjects with PD and 50 healthy controls matched for age and gender [62]. The corpus included in this study is that of a monologue, where participants were asked to describe their daily routines. We included 100, 13 second recordings of uninterrupted spontaneous speech of each individual listed in the study. This resulted in 50 recordings of subjects with PD and 50 recordings of healthy controls. Each recording was edited using Audacity [5]. ## 4. Results and Discussion ### 4.1. RHI Detection The average accuracy over 10 folds of the Verus dataset was 85%. The accuracy of each fold, as well as the average accuracy and standard deviation are shown in Table 4. The youngest age of individuals correctly classified to the RHI group was 21, all four of which were boxers. This is the first time we are aware of that speech has been used to accurately differentiate individuals with a history of significant RHI from those without. Speech problems have long been associated with RHI, yet how they relate to RHI-BC or CTE severity has not been sufficiently explored. Literature pertaining to boxers indicates that motor speech disorders occur relatively frequently [51, 23, 46]. However, the prevalence of motor speech disorders in the clinical description of CTE as depicted by [55] is less clear. Dysarthria is sometimes listed as a symptom of CTE, while language impairments are commonly listed as symptoms of advanced stages of CTE [58, 4]. Motor speech disorders differ from language impairments in both their symptoms and underlying pathophysiology. Motor speech disorders (such as dysarthria) are characterized by difficulties in producing or pronouncing words or sounds [28]. Language impairments refer to difficulty in understanding or creating language, which may occur without any difficulties in the mechanical production of sounds [12]. Motor speech disorders are generally caused by damage to regions in the brain responsible for the motor control and coordination of the muscles used to create speech. Regarding professional fighters, brain damage has been found in the form of cerebellar scarring, depigmentation of the substantia nigra, pyramidal and extrapyramidal dysfunction and cerebral atrophy [17, 21, 58]. Cerebellar scarring can lead to ataxic dysarthria, a condition which presents with irregular rate of speech and stress patterns, scanning speech and improper articulation [74]. Extrapyramidal dysfunction and depigmentation of the substantia nigra can result in parkinsonism, a condition where speech may be affected by changes in speed, decreased volume and pitch variability, pronunciation abnormalities and decreased intelligibility [88, 35]. Indeed, parkinsonism, ataxic dysarthria and other forms of dysarthria have all been documented in professional fighters, while parkinsonism is also listed as a symptom of advanced CTE [55, 17, 21, 58]. However, language impairments can be caused by damage to cortical regions of the brain [12]. Their occurrence in the later stages of CTE is likely due to the pathological progression of CTE, where cortical regions tend to be affected later in the disease [1]. Further, [9] found that although significant differences in regional brain volumes existed between professional fighters and controls, cognitive tests were largely indistinguishable. These data seem to indicate that higher order processes like language and cognition are not obviously affected in the early stages of CTE or RHI-BC. This has been indicated in other neurodegenerative disorders [36, 41, 83]. However, motor speech disorders, which are caused by damage to subcortical or hindbrain regions, might manifest significantly earlier. This would be consistent with our finding that four boxers at just 21 years of age were correctly classified to the RHI group. View this table: [Table 4](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T4) Table 4 Average F0 and mean energy scores calculated using the Verus dataset. STD = standard deviation. ### 4.2. RHI Control A limitation of the relatively large Verus dataset is that we could not gather relevant medical history of every individual. As not every professional fighter will develop a neurodegenerative disorder, and some athletes in non-contact sports can indeed develop neurodegenerative disorders [48], we reasoned that the accuracy of our model on the Verus dataset might be lower than in a clinically controlled setting. Thus, we created a small dataset in which the RHI group was composed of individuals who were either diagnosed with CTE, suspected of having CTE, or had received brain imaging results suggestive of trauma-induced damage. Further, the control group was composed of three chess players (i.e. non-athletes) and 2 golfers. The relevant information of these individuals is shown in Table 5. Our model trained on the Verus dataset achieved 100% accuracy in predicting whether an individual belonged to the RHI or control group. While this is a small sample size, there are some points that merit consideration. Our model accurately classified an individual who was diagnosed post-mortem as having CTE, but was a professional wrestler, not a fighter. Further, our model also accurately classified the three chess players as belonging to the control group. This suggests the model can generalize to individuals who were not either professional fighters nor athletes. These findings also suggest that our model may perform at over 85% accuracy when the medical history of the subjects is already known. View this table: [Table 5](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T5) Table 5 K-Fold Cross Validation Results of the Verus Dataset. STD = standard deviation ### 4.3. Alzheimer’s Disease Detection The average accuracy over 10 folds of the DementiaBank dataset was 84.99%, as shown in Table 6. This is the highest accuracy we are aware of achieved on this dataset. [29] achieved 81.9% accuracy using logistic regression and what was described as a mixture of 4 latent variables, generally corresponding to syntax/fluency, semantics, acoustics, and other factors. In a more recent publication, [90] achieved F1-micro and F1-macro scores of 0.799. They again used a combination of acoustic and linguistic features, with “Consensus Networks” (CN), deep neural networks that contain a “discriminator” which functions similarly to a generative adversarial network (GAN). Further, [89] achieved 88% accuracy on a subset of the DementiaBank dataset, while [40] achieved 91% accuracy. However, ([89] allowed for “spillover”, meaning data from the same patients were used in both the training and validation set. Further, [40] included transcripts from at least the “Cookie Theft” and “Recall” tasks and did not indicate if they also included transcripts from the “Sentence” and “Fluency” tasks. They also did not indicate whether they included patients diagnosed with a disorder other than AD in the AD group. Nonetheless, the literature indicates that attempts at using machine learning to detect AD rely more on linguistic than acoustic features [49]. This trend might exist due to theoretical knowledge pertaining to AD. Specifically, the symptoms of AD are often attributed to the accumulation of misfolded proteins in the cerebral cortex. It is thought that memory and cognition problems are amongst the first symptoms in AD, with motor problems not occurring until later stages when the misfolded proteins have significantly accumulated in subcortical structures [87]. However, recent findings have shown that motor problems are prevalent in AD and might present preclinically [15, 79, 14]. This aligns with our findings, as we achieved state-of-the-art performance on the DementiaBank dataset using a model which is text independent. To clarify, this means our model was not able to utilize or learn any linguistic features, but instead was able to differentiate subjects based on speech characteristics like syllable rate, pitch and others. Our findings provide motivation to further investigate the motor changes evident in speech which could serve as an early biomarker of AD. View this table: [Table 6](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T6) Table 6 K-Fold Cross Validation Results of the DementiaBank dataset. STD = standard deviation ### 4.4. Parkinson’s Disease Detection The average accuracy over 10 folds of the Gita dataset was 89% with a standard deviation of 7.3, as shown in Table 7. The accuracy reported in [62] on this section of the dataset was 81% with a standard deviation of 7. To determine if the difference between these two scores was statistically significant, we performed a two-tailed t-test assuming unequal variance and an alpha value of 0.05. As the t-test indicated a p-value of 0.022, we can assume our higher accuracy is statistically significant from that achieved by [62]. View this table: [Table 7](http://medrxiv.org/content/early/2021/09/29/2021.09.20.21263753/T7) Table 7 K-Fold Cross Validation Results of the Gita Dataset. STD = standard deviation ## 5. Limitations As previously stated, the current trend in speaker recognition and related tasks is to use deep learning approaches which learn features, as opposed to the previously standard approach of engineering features to be fed to algorithms which do not use deep neural networks [39]. This has lead to a limitation in our study in the form of a trade-off. Namely, our deep learning approach results in higher accuracy at the expense of interpretability. However, as the relationship between speech and RHI-BC has not been sufficiently characterized, we reasoned feature engineering would be inferior to our deep learning approach as it is not known which features are relevant to the task. Another limitation to our study is that the medical history of the subjects used was not known. We have attempted to address this limitation, yet further studies in a clinically controlled setting are needed. Another limitation was that we did not control for bilingualism. Recent data indicate that bilingualism may act as a form of “cognitive reserve”, thus delaying onset of symptoms in neurodegenerative disorders like AD [56, 22]. It should be noted that bilingualism appears to delay deficits in executive functioning and has not been linked to motor-speech disorders, yet it is still possible that our classification accuracies would be higher if we excluded individuals who were bilingual. ## 6. Conclusion The long-term consequences of RHI are still being investigated, with indications that a history of RHI can increase the risk of developing CTE, AD, ALS, PD and other disorders [13, 34, 43, 42, 76]. Unfortunately, there are no accepted biomarkers of RHI-BC or CTE, thus tracking their progression and mapping the level of RHI exposure to disease severity has proven difficult [9]. In this study, we found that a 13 second voice recording can be used to differentiate those with a history of RHI from this without at 85% accuracy. We then used the ECAPA-TDNN model trained on RHI detection to achieve state-of-the-art results in detecting AD and PD on the DementiaBank and Gita datasets, respectively. Machine learning techniques have often focused heavily on language impairments when detecting AD [29, 40, 49, 63]. Our study is the first we are aware of to focus exclusively on acoustic, text-independent features of speech. This might indicate that motor speech problems can serve as a biomarker of AD. Further, our success in transfer learning is evidence that the features learned by the ECAPA-TDNN model trained on the Verus dataset were related to speech changes associated with brain damage. Finding a biomarker of RHI-BC or CTE that presents early in disease progression could help with risk management and improve patient outcome. Indeed, our model accurately classified 4 boxers of just 21 years of age as belonging to the RHI group, suggesting speech might serve as a such a biomarker. Future research should aim to characterize the relationship between speech disorders and progression of RHI-related neurodegenerative disorders. ## Data Availability All code is available upon request. Externally owned data may be available through the provided URLs. The Verus dataset will be available upon request and following copyright registration. [https://dementia.talkbank.org/](https://dementia.talkbank.org/) [https://www.robots.ox.ac.uk/~vgg/data/voxceleb/](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/) ## 8. Declaration of Interest M Ravanelli and CA Droppelmann declare no competing interests. MG Tauro has received no commercial support for this work yet may commercialise some results from this work in the future. ## 7. Acknowledgements The authors would like to thank Dr. Mariano Avino for his valuable support and assistance. * Received September 20, 2021. * Revision received September 20, 2021. * Accepted September 29, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. [1].Alosco, M.L., Cherry, J.D., Huber, B.R., Tripodis, Y., Baucom, Z., Kowall, N.W., Saltiel, N., Goldstein, L.E., Katz, D.I., Dwyer, B., et al., 2020. Characterizing tau deposition in chronic traumatic encephalopathy (cte): utility of the mckee cte staging scheme. Acta Neuropathologica 140, 495–512. 2. [2].Aquilina, D., 2013. A study of the relationship between elite athletes’ educational development and sporting performance. The International Journal of the History of Sport 30, 374–392. 3. [3].Arias-Vergara, T., Vásquez-Correa, J.C., Orozco-Arroyave, J.R., Vargas-Bonilla, J.F., Nöth, E., 2016. Parkinson’s disease progression assessment from speech using gmm-ubm., in: Interspeech, pp. 1933–1937. 4. [4].Asken, B.M., Sullan, M.J., Snyder, A.R., Houck, Z.M., Bryant, V.E., Hizel, L.P., McLaren, M.E., Dede, D.E., Jaffee, M.S., DeKosky, S.T., et al., 2016. Factors influencing clinical correlates of chronic traumatic encephalopathy (cte): a review. Neuropsychology review 26, 340–363. 5. [5].Audacity, T., 2013. Audacity. 6. [6].Bai, Z., Zhang, X.L., 2021. Speaker recognition based on deep learning: An overview. Neural Networks. 7. [7].Becker, J.T., Boiler, F., Lopez, O.L., Saxton, J., McGonigle, K.L., 1994. The natural history of alzheimer’s disease: description of study cohort and accuracy of diagnosis. Archives of Neurology 51, 585–594. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archneur.1994.00540180063015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8198470&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1994NQ14600009&link_type=ISI) 8. [8].Bengio, Y., 2012. Deep learning of representations for unsupervised and transfer learning, in: Proceedings of ICML workshop on unsupervised and transfer learning, JMLR Workshop and Conference Proceedings. pp. 17–36. 9. [9].Bernick, C., Shan, G., Zetterberg, H., Banks, S., Mishra, V.R., Bekris, L., Leverenz, J.B., Blennow, K., 2020. Longitudinal change in regional brain volumes with exposure to repetitive head impacts. Neurology 94, e232–e240. 10. [10].Bhattacharya, G., Alam, M.J., Kenny, P., 2017. Deep speaker embeddings for short-duration speaker verification., in: Interspeech, pp. 1517–1521. 11. [11].Bieniek, K.F., Blessing, M.M., Heckman, M.G., Diehl, N.N., Serie, A.M., Paolini, M.A., Boeve, B.F., Savica, R., Reichard, R.R., Dickson, D.W., 2020. Association between contact sports participation and chronic traumatic encephalopathy: a retrospective cohort study. Brain Pathology 30, 63–74. 12. [12].Bishop, D.V., 1992. The underlying nature of specific language impairment. Journal of child psychology and psychiatry 33, 3–66. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1469-7610.1992.tb00858.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=1737831&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1992HE46300002&link_type=ISI) 13. [13].Blecher, R., Elliott, M.A., Yilmaz, E., Dettori, J.R., Oskouian, R.J., Patel, A., Clarke, A., Hutton, M., McGuire, R., Dunn, R., et al., 2019. Contact sports as a risk factor for amyotrophic lateral sclerosis: A systematic review. Global spine journal 9, 104–118. 14. [14].Bologna, M., Guerra, A., Colella, D., Cioffi, E., Paparella, G., Di Vita, A., D’Antonio, F., Trebbastoni, A., Berardelli, A., 2020. Bradykinesia in alzheimer’s disease and its neurophysiological substrates. Clinical Neurophysiology 131, 850–858. 15. [15].Buchman, A.S., Bennett, D.A., 2011. Loss of motor function in preclinical alzheimer’s disease. Expert review of neurotherapeutics 11, 665–676. 16. [16].Buckland, M.E., Sy, J., Szentmariay, I., Kullen, A., Lee, M., Harding, A., Halliday, G., Suter, C.M., 2019. Chronic traumatic encephalopathy in two former australian national rugby league players. Acta neuropathologica communications 7, 1–4. 17. [17].Casson, I.R., Viano, D.C., 2019. Long-term neurological consequences related to boxing and american football: a review of the literature. Journal of Alzheimer’s disease 69, 935–952. 18. [18].Casson, I.R., Viano, D.C., Haacke, E.M., Kou, Z., LeStrange, D.G., 2014. Is there chronic brain damage in retired nfl players? neuroradiology, neuropsychology, and neurology examinations of 45 retired players. Sports health 6, 384–395. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1941738114540270&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25177413&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 19. [19].Chung, J.S., Nagrani, A., Coto, E., Xie, W., McLaren, M., Reynolds, D.A., Zisserman, A., 2019. Voxsrc 2019: The first voxceleb speaker recognition challenge. arXiv preprint arxiv:1912.02522. 20. [20].Cohan, A.,. Micky ward’s toughest fight. Boston Herald URL: [https://www.bostonherald.com/2020/02/03/micky-wards-toughest-fight/](https://www.bostonherald.com/2020/02/03/micky-wards-toughest-fight/). 21. [21].Corsellis, J., Bruton, C., Freeman-Browne, D., 1973. The aftermath of boxing1. Psychological medicine 3, 270–303. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0033291700049588&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=4729191&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1973Q562400003&link_type=ISI) 22. [22].Costumero, V., Marin-Marin, L., Calabria, M., Belloch, V., Escudero, J., Baquero, M., Hernandez, M., de Miras, J.R., Costa, A., Parcet, M.A., et al., 2020. A cross-sectional and longitudinal study on the protective effect of bilingualism against dementia using brain atrophy and cognitive measures. Alzheimer’s research & therapy 12, 1–10. 23. [23].Critchley, M., 1957. Medical aspects of boxing, particularly from a neurological standpoint. British medical journal 1, 357. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czozOiJibWoiO3M6NToicmVzaWQiO3M6MTA6IjEvNTAxNS8zNTciO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wOS8yOS8yMDIxLjA5LjIwLjIxMjYzNzUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 24. [24].Cyrta, P., Trzciriski, T., Stokowiec, W., 2017. Speaker diarization using deep recurrent convolutional neural networks for speaker embeddings, in: International Conference on Information Systems Architecture and Technology, Springer. pp. 107–117. 25. [25].Dawalatabad, N., Ravanelli, M., Grondin, F., Thienpondt, J., Desplanques, B., Na, H., 2021. Ecapa-tdnn embeddings for speaker diarization. arXiv preprint arxiv:2104.01466. 26. [26].Deng, J., Guo, J., Xue, N., Zafeiriou, S., 2019. Arcface: Additive angular margin loss for deep face recognition, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4690–4699. 27. [27].Desplanques, B., Thienpondt, J., Demuynck, K., 2020. Ecapa-tdnn: Emphasized channel attention, propagation and aggregation in tdnn based speaker verification. arXiv preprint arxiv:2005.07143. 28. [28].Duffy, J.R., 2013. Motor Speech disorders-E-Book: Substrates, differential diagnosis, and management. Elsevier Health Sciences. 29. [29].Fraser, K.C., Meltzer, J.A., Rudzicz, F., 2016. Linguistic features identify alzheimer’s disease in narrative speech. Journal of Alzheimer’s Disease 49, 407–422. 30. [30].Gaetz, M., 2017. The multi-factorial origins of chronic traumatic encephalopathy (cte) symptomology in post-career athletes: the athlete post-career adjustment (ap-ca) model. Medical hypotheses 102, 130–143. 31. [31].Gales, M., Young, S., 2008. The application of hidden markov models in speech recognition. 32. [32].Gardner, R.C., Yaffe, K., 2015. Epidemiology of mild traumatic brain injury and neurodegenerative disease. Molecular and Cellular Neuroscience 66, 75–80. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.mcn.2015.03.001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25748121&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 33. [33].Godfrey, R.J., Madgwick, Z., Whyte, G.P., 2003. The exerciseinduced growth hormone response in athletes. Sports medicine 33, 599–613. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2165/00007256-200333080-00005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12797841&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000184014300005&link_type=ISI) 34. [34].Goldman, S.M., Tanner, C.M., Oakes, D., Bhudhikanok, G.S., Gupta, A., Langston, J.W., 2006. Head injury and parkinson’s disease risk in twins. Annals of neurology 60, 65–72. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ana.20882&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16718702&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000238825500012&link_type=ISI) 35. [35].Heisters, D., 2011. Parkinson’s: symptoms, treatments and research. British Journal of Nursing 20, 548–554. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21647015&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 36. [36].Ho, A.K., Iansek, R., Marigliani, C., Bradshaw, J.L., Gates, S., 1998. Speech impairment in a large sample of patients with parkinson’s disease. Behavioural neurology 11, 131–137. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11568413&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000084045000002&link_type=ISI) 37. [37].Honorof, D.N., Whalen, D.H., 2005. Perception of pitch location within a speaker’s f0 range. The Journal of the Acoustical Society of America 117, 2193–2200. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15898660&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 38. [38].Institute, S.L.,. Wrestler chris benoit brain’s forensic exam consistent with numerous brain injuries. ScienceDaily URL: [https://www.sciencedaily.com/releases/2007/09/070905224343.htm](https://www.sciencedaily.com/releases/2007/09/070905224343.htm). 39. [39].Karan, B., Sahu, S.S., Orozco-Arroyave, J.R., Mahto, K., 2021. Nonnegative matrix factorization-based time-frequency feature extraction of voice signal for parkinson’s disease prediction. Computer Speech & Language 69, 101216. 40. [40].Karlekar, S., Niu, T., Bansal, M., 2018. Detecting linguistic characteristics of alzheimer’s dementia by interpreting neural models. arXiv preprint arxiv:1804.06440. 41. [41].Kehagia, A.A., Barker, R.A., Robbins, T.W., 2010. Neuropsychological and clinical heterogeneity of cognitive impairment and dementia in patients with parkinson’s disease. The Lancet Neurology 9, 1200–1213. 42. [42].Kerr, Z.Y., Marshall, S.W., Harding Jr, H.P., Guskiewicz, K.M., 2012. Nine-year risk of depression diagnosis increases with increasing selfreported concussions in retired professional football players. The American journal of sports medicine 40, 2206–2212. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0363546512456193&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22922518&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000309351000007&link_type=ISI) 43. [43].Kiraly, M.A., Kiraly, S.J., 2007. Traumatic brain injury and delayed sequelae: a review-traumatic brain injury and mild traumatic brain injury (concussion) are precursors to later-onset brain disorders, including early-onset dementia. TheScientificWorldJournal 7, 1768–1776. 44. [44].Knees, P., Schedl, M., 2016. Basic methods of audio signal processing, in: Music similarity and retrieval. Springer, pp. 33–50. 45. [45].Ko, T., Peddinti, V., Povey, D., Seltzer, M.L., Khudanpur, S., 2017. A study on data augmentation of reverberant speech for robust speech recognition, in: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 5220–5224. 46. [46].Krasilshchikova, S., Neel Ph D, A., Richardson Ph D, J.D., Arenas Ph D, R., Bennett Ph D, L., Banks Ph D, S., Bernick Ph D, C., 2020. Speech characteristics of professional fighters. 47. [47].LeCun, Y., Bengio, Y., Hinton, G., 2015. Deep learning. nature 521, 436–444. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/nature14539&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26017442&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 48. [48].Mackay, D.F., Russell, E.R., Stewart, K., MacLean, J.A., Pell, J.P., Stewart, W., 2019. Neurodegenerative disease mortality among former professional soccer players. New England Journal of Medicine 381, 1801–1808. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa1908483&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 49. [49].Mahajan, P., Baths, V., 2021. Acoustic and language based deep learning approaches for alzheimer’s dementia detection from spontaneous speech. Frontiers in Aging Neuroscience 13. 50. [50].Maroon, J.C., Winkelman, R., Bost, J., Amos, A., Mathyssek, C., Miele, V., 2015. Chronic traumatic encephalopathy in contact sports: a systematic review of all reported pathological cases. PloS one 10, e0117338. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0117338&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25671598&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 51. [51].Martland, H.S., 1928. Punch drunk. Journal of the American Medical Association 91, 1103–1107. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.1928.02700150029009&link_type=DOI) 52. [52].McCrory, P., Meeuwisse, W.H., Echemendia, R.J., Iverson, G.L., Dvořák, J., Kutcher, J.S., 2013. What is the lowest threshold to make a diagnosis of concussion? British Journal of Sports Medicine 47, 268–271. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiYmpzcG9ydHMiO3M6NToicmVzaWQiO3M6ODoiNDcvNS8yNjgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wOS8yOS8yMDIxLjA5LjIwLjIxMjYzNzUzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 53. [53].McFee, B., Lostanlen, V., McVicar, M., Metsai, A., Balke, S., Thomé, C., Raffel, C., Malek, A., Lee, D., Zalkow, F., Lee, K., Nieto, O., Mason, J., Ellis, D., Yamamoto, R., Seyfarth, S., Battenberg, E.,,, Bittner, R., Choi, K., Moore, J., Wei, Z., Hidaka, S., nullmightybofo, Friesch P., Stöter, F.R., Hereñú, D., Kim, T., Vollrath, M., Weiss, A., 2020. librosa/librosa: 0.7.2. URL: [https://doi.org/10.5281/zenodo.3606573](https://doi.org/10.5281/zenodo.3606573), doi:10.5281/zenodo.3606573. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5281/zenodo.3606573&link_type=DOI) 54. [54].McKee, A.C., Cairns, N.J., Dickson, D.W., Folkerth, R.D., Keene, C.D., Litvan, I., Perl, D.P., Stein, T.D., Vonsattel, J.P., Stewart, W., et al., 2016. The first ninds/nibib consensus meeting to define neuropathological criteria for the diagnosis of chronic traumatic encephalopathy. Acta neuropathologica 131, 75–86. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00401-015-1515-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26667418&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 55. [55].McKee, A.C., Stein, T.D., Nowinski, C.J., Stern, R.A., Daneshvar, D.H., Alvarez, V.E., Lee, H.S., Hall, G., Wojtowicz, S.M., Baugh, C.M., et al., 2013. The spectrum of disease in chronic traumatic encephalopathy. Brain 136, 43–64. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/brain/aws307&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23208308&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000314909900006&link_type=ISI) 56. [56].Mendez, M.F., 2019. Bilingualism and dementia: cognitive reserve to linguistic competency. Journal of Alzheimer’s Disease 71, 377–388. 57. [57].Mez, J., Daneshvar, D.H., Kiernan, P.T., Abdolmohammadi, B., Alvarez, V.E., Huber, B.R., Alosco, M.L., Solomon, T.M., Nowinski, C.J., McHale, L., et al., 2017. Clinicopathological evaluation of chronic traumatic encephalopathy in players of american football. Jama 318, 360–370. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2017.8334&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28742910&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 58. [58].Montenigro, P.H., Bernick, C., Cantu, R.C., 2015. Clinical features of repetitive traumatic brain injury and chronic traumatic encephalopathy. Brain pathology 25, 304–317. 59. [59].Moro-Velazquez, L., Villalba, J., Dehak, N., 2020. Using x-vectors to automatically detect parkinson’s disease from speech, in: ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 1155–1159. 60. [60].Nagrani, A., Chung, J.S., Zisserman, A., 2017. Voxceleb: a largescale speaker identification dataset. arXiv preprint arxiv:1706.08612. 61. [61].Omalu, B., Hammers, J.L., Bailes, J., Hamilton, R.L., Kamboh, M.I., Webster, G., Fitzsimmons, R.P., 2011. Chronic traumatic encephalopathy in an iraqi war veteran with posttraumatic stress disorder who committed suicide. Neurosurgical focus 31, E3. 62. [62].Orozco-Arroyave, J.R., Arias-Londoño, J.D., Vargas-Bonilla, J.F., Gonzalez-Rativa, M.C., Nöth, E., 2014. New spanish speech corpus database for the analysis of people suffering from parkinson’s disease., in: LREC, pp. 342–347. 63. [63].Pappagari, R., Cho, J., Moro-Velazquez, L., Dehak, N., 2020. Using state of the art speaker recognition and natural language processing technologies to detect alzheimer’s disease and assess its severity. Proc. Interspeech 2020, 2177–2181. 64. [64].Press, A.,. Board resists norris’ return to ring. ESPN URL: [http://a.espncdn.com/boxing/news/2000/0216/360596.html](http://a.espncdn.com/boxing/news/2000/0216/360596.html). 65. [65].Prien, A., Grafe, A., Rössler, R., Junge, A., Verhagen, E., 2018. Epidemiology of head injuries focusing on concussions in team contact sports: a systematic review. Sports medicine 48, 953–969. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 66. [66].Raj, D., Snyder, D., Povey, D., Khudanpur, S., 2019. Probing the information encoded in x-vectors, in: 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), IEEE. pp. 726–733. 67. [67].Ravanelli, M., Parcollet, T., Plantinga, P., Rouhe, A., Cornell, S., Lugosch, L., Subakan, C., Dawalatabad, N., Heba, A., Zhong, J., Chou, J.C., Yeh, S.L., Fu, S.W., Liao, C.F., Rastorgueva, E., Grondin, F., Aris, W., Na, H., Gao, Y., Mori, R.D., Bengio, Y., 2021. SpeechBrain: A general-purpose speech toolkit. arxiv:2106.04624. 2106.04624. 68. [68].Robin, J., Harrison, J.E., Kaufman, L.D., Rudzicz, F., Simpson, W., Yancheva, M., 2020. Evaluation of speech-based digital biomarkers: Review and recommendations. Digital Biomarkers 4, 99–108. 69. [69].Schwab, N., Wennberg, R., Grenier, K., Tartaglia, C., Tator, C., Hazrati, L.N., 2021. Association of position played and career duration and chronic traumatic encephalopathy at autopsy in elite football and hockey players. Neurology 96, e1835–e1843. 70. [70].Shaikhina, T., Khovanova, N.A., 2017. Handling limited datasets with neural networks in medical applications: A small-data approach. Artificial intelligence in medicine 75, 51–63. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.artmed.2016.12.003&link_type=DOI) 71. [71].Shaw, N.A., 2002. The neurophysiology of concussion. Progress in neurobiology 67, 281–344. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0301-0082(02)00018-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12207973&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000178438500002&link_type=ISI) 72. [72].Snyder, D., Garcia-Romero, D., McCree, A., Sell, G., Povey, D., Khudanpur, S., 2018a. Spoken language recognition using x-vectors., in: Odyssey, pp. 105–111. 73. [73].Snyder, D., Garcia-Romero, D., Sell, G., Povey, D., Khudanpur, S., 2018b. X-vectors: Robust dnn embeddings for speaker recognition, in: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 5329–5333. 74. [74].Spencer, K.A., Slocomb, D.L., 2007. The neural basis of ataxic dysarthria. The Cerebellum 6, 58–65. 75. [75].Stanley, B.,. The last punch: How concussions forced kiwi ufc pioneer james te huna from the octagon. Vice URL: [https://www.vice.com/en/article/aex774/the-last-punch-how-concussions-forced-kiwi-ufc-pioneer-james-te-huna-from-the-octagon](https://www.vice.com/en/article/aex774/the-last-punch-how-concussions-forced-kiwi-ufc-pioneer-james-te-huna-from-the-octagon). 76. [76].Stein, T.D., Alvarez, V.E., McKee, A.C., 2015. Concussion in chronic traumatic encephalopathy. Current pain and headache reports 19, 1–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11916-014-0474-8&link_type=DOI) 77. [77].Stemper, B.D., Pintar, F.A., 2014. Biomechanics of concussion. Concussion 28, 14–27. 78. [78].Stone, M., 1974. Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society: Series B (Methodological) 36, 111–133. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2307/2984809&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1974U703600001&link_type=ISI) 79. [79].Suzumura, S., Osawa, A., Nagahama, T., Kondo, I., Sano, Y., Kandori, A., 2016. Assessment of finger motor skills in individuals with mild cognitive impairment and patients with alzheimer’s disease: Relationship between finger-to-thumb tapping and cognitive function. Japanese Journal of Comprehensive Rehabilitation Science 7, 19–28. 80. [80].Swick, M.,. After an mri showed a condition related to cte, he’s done in mma i mike swick podcast. URL: [https://www.youtube.com/watch?v=oGB\_tpgo9rQ](https://www.youtube.com/watch?v=oGB_tpgo9rQ). 81. [81].Tagge, C.A., Fisher, A.M., Minaeva, O.V., Gaudreau-Balderrama, A., Moncaster, J.A., Zhang, X.L., Wojnarowicz, M.W., Casey, N., Lu, H., Kokiko-Cochran, O.N., et al., 2018. Concussion, microvascular injury, and early tauopathy in young athletes after impact head injury and an impact concussion mouse model. Brain 141, 422–458. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/brain/awx350&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29360998&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 82. [82].Tartaglia, M.C., Hazrati, L.N., Davis, K.D., Green, R.E., Wennberg, R., Mikulis, D., Ezerins, L.J., Keightley, M., Tator, C., 2014. Chronic traumatic encephalopathy and other neurodegenerative proteinopathies. Frontiers in human neuroscience 8, 30. 83. [83].Taylor, L.J., Brown, R.G., Tsermentseli, S., Al-Chalabi, A., Shaw, C.E., Ellis, C.M., Leigh, P.N., Goldstein, L.H., 2013. Is language impairment more common than executive dysfunction in amyotrophic lateral sclerosis? Journal of Neurology, Neurosurgery & Psychiatry 84, 494–498. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiam5ucCI7czo1OiJyZXNpZCI7czo4OiI4NC81LzQ5NCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzA5LzI5LzIwMjEuMDkuMjAuMjEyNjM3NTMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 84. [84].Tommasone, B.A., McLeod, T.C.V., 2006. Contact sport concussion incidence. Journal of athletic training 41, 470. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17273475&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000243385800020&link_type=ISI) 85. [85].Vashkevich, M., Petrovsky, A., Rushkevich, Y., 2019. Bulbar als detection based on analysis of voice perturbation and vibrato, in: 2019 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), IEEE. pp. 267–272. 86. [86].Washington, P.M., Villapol, S., Burns, M.P., 2016. Polypathology and dementia after brain trauma: does brain injury trigger distinct neurodegenerative diseases, or should they be classified together as traumatic encephalopathy? Experimental neurology 275, 381–388. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.expneurol.2015.06.015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26091850&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 87. [87].Wenk, G.L., 2003. Neuropathologic changes in alzheimer’s disease. The Journal of clinical psychiatry 64, 7–10. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12934968&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F09%2F29%2F2021.09.20.21263753.atom) 88. [88].Wichmann, T., 2019. Changing views of the pathophysiology of parkinsonism. Movement Disorders 34, 1130–1143. 89. [89].Ye, Z., Hu, S., Li, J., Xie, X., Geng, M., Yu, J., Xu, J., Xue, B., Liu, S., Liu, X., et al., 2021. Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus, in: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 6433–6437. 90. [90].Zhu, Z., Novikova, J., Rudzicz, F., 2018. Detecting cognitive impairments by agreeing on interpretations of linguistic features. arXiv preprint arxiv:1808.06570.