Abstract
Objective In apraxia of speech (AOS), we observed impaired perceptual timing abilities, which lead us to propose a shared mechanism of impaired perceptual timing underlying impaired rhythm discrimination (perceptual processing) and AOS (motor speech output). Given that considerable white matter damage is observed in these patients, we investigate whether white matter changes are related to impaired rhythm processing as one possible mechanism underlying AOS.
Methods We applied deformation-based morphometry (DBM) and diffusion tensor imaging (DTI) in 12 patients with the nonfluent variant (NFV) of Primary Progressive Aphasia (PPA) with AOS, as well as 11 patients with the semantic variant and 24 cognitively intact mature controls.
Results Seventy-five percent of the patients with NFV displayed impaired rhythm discrimination. Severity of the rhythm discrimination impairment correlated with the patients’ speech rhythm abnormality measured from connected speech samples. Moreover, left frontal white matter volume loss adjacent to the supplementary motor area (SMA) correlated with impaired rhythm processing. In addition, we obtained tract-based metrics of the left Aslant tract, which is typically damaged in the NFV. The structural integrity of the left Aslant tract also correlated with rhythmic discrimination abilities in the NFV.
Conclusions A colocalized and perhaps shared white matter substrate adjacent to the SMA is associated with impaired rhythm discrimination and motor speech impairments. This indicates that impaired perceptual timing may be one of the neurocomputational mechanisms underlying AOS. Our observation that regional variations in left frontal lobe atrophy are linked to the phenotypical heterogeneity in NFV may facilitate earlier diagnosis.
Introduction
The nonfluent variant of Primary Progressive Aphasia (NFV) is characterized by apraxia of speech (AOS) or agrammatism1. NFV is phenotypically heterogeneous: impaired AOS and agrammatism can arise in combination or isolation and hence give rise to three subtypes: primary progressive apraxia of speech (ppAOS), progressive agrammatic aphasia or mixed agrammatism and AOS2–4. This heterogeneity suggests a diversity of underlying neuroanatomical substrates. Our study focuses on the in-depth characterization of the mechanism and substrates underpinning AOS. Grey matter atrophy is consistently found in NFV in the left opercular part (BA44) of the inferior frontal gyrus (IFG), insula, premotor and the supplementary motor areas (SMA)1,5. Involvement of the SMA and the left lateral superior premotor cortex is linked to ppAOS2,6. In ppAOS, neurodegeneration of the (pre)motor cortex is more focal compared to the widespread atrophy that extends to the frontotemporal regions in progressive agrammatic aphasia3,4.
Neuropathologically, NFV is somewhat heterogeneous, with a dominance of tau pathology in up to 88% of patients (Progressive Supranuclear Palsy (PSP), Corticobasal degeneration (CBD) or Pick’s disease). Sporadically, TDP43-proteinopathy or Alzheimer’s disease is found in NFV7–9. Frontal white matter changes are more common in NFV caused by tauopathies10, especially in CBD11. In-depth knowledge about the relation between phenotype, structural changes and neuropathology is lacking, whereas a better understanding of this relationship is necessary for targeted development of pharmacological and rehabilitation therapy. So far, neuroimaging studies have mainly focused on grey matter changes. White matter changes may also contribute to impaired speech production in NFV12,13. Here, we specifically test whether white matter changes in NFV relate to impaired non-linguistic processing using non-verbal rhythmic auditory stimuli that do not convey meaning. This is motivated by our finding that rhythm discrimination is significantly impaired in NFV with AOS14. Because impaired rhythm discrimination co-occurred with AOS, we hypothesized that both deficits are related through a common impairment in a “temporal scaffolding mechanism”, which structures input and output in time14,15. Our perceptual timing tasks untangle the processing of the lower-order (local) versus higher-order (global) temporal structure of acoustic stimuli. Clinically, patients with AOS do not have consistent problems producing individual phonemes, but have difficulties with the suprasegmental timing of their speech16. We postulate that the tasks indexing higher-order processing will be most relevant to AOS.
We have expanded our previously published dataset14 to include 12 NFV patients who participated in volumetric imaging. We test the correlation between perceptual timing deficits and tensor-based deformations of the brain (deformation-based morphometry, DBM) and diffusion tensor imaging (DTI) to elucidate the neuroanatomical correlate of the hypothesized temporal scaffolding mechanism. We opted for DBM rather than voxel-based morphometry (VBM) because automated segmentation in regions of abnormal grey and white matter might be unreliable and because DBM allows visualization of changes in subcortical structures containing grey and white matter17. DBM is also easier to interpret than VBM since it reflects atrophy without inference from other pathological white matter changes. Furthermore, we complement DBM with DTI. DTI is sensitive to white matter damage caused by tau pathology18,19 and DTI metrics may able to discriminate between tauopathy and TDP43-proteinopathy20. We focused on the left frontal Aslant tract, which connects BA44 to medial frontal areas including the SMA21. Damage of the left Aslant tract is considered specific for the NFV phenotype13,22. Since it connects IFG with SMA, which has been identified as a gray matter correlate of temporal regularity processing in NFV23,24, the left Aslant tract might also play a role in temporal scaffolding. Our approach aims to clarify the relationship between AOS, rhythm discrimination, and their potentially overlapping anatomical substrate to elucidate the phenotypical heterogeneity in NFV.
Methods
Participants
The study was approved by the Ethics Committee, University Hospitals Leuven. All participants provided written informed consent in accordance with the Declaration of Helsinki. PPA patients were recruited via the memory clinic University Hospitals Leuven. A consecutive series of 38 patients who fulfilled the international consensus criteria for PPA1 enrolled for the experiment (2011-2019). The first 23 patients were described in 14, and the same case numbers are used. Seven patients were excluded due to: hearing loss (n = 3); lack of ability to perform the experimental tasks due to disease severity (n = 2); lack of cooperation (n = 1); unique phenotype (foreign accent syndrome, n = 1). The remaining 31 patients were able to undergo the extensive testing and produce reliable data. Before enrollment, each patient was classified according to the 2011 recommendations1. The classification relied on the clinical evaluation by an experienced neurologist (R.V.), in combination with neurolinguistics assessment and clinical MRI, as well as, where available, [18F]fluorodeoxyglucose PET ([18F]-FDG PET), CSF biomarkers for Alzheimer’s Disease and [11C]-Pittsburgh compound B amyloid PET. Twelve cases were classified as NFV (Table 1), 11 as the semantic variant (SV), and 8 as the logopenic variant (LV). The LV group will not be discussed because of the smaller sample size compared to the NFV and SV groups. All NFV cases exhibited AOS and 5 patients (case 20-23 and 31) also displayed single-word comprehension deficits upon testing and would also fit the more recently described criteria for the “mixed variant”25,26. All patients received a volumetric MRI scan and 7 NFV and 7 SV DTI imaging. Twenty-nine healthy controls (15 male, age range 51-76, education range 9-22 years) performed the perceptual timing tasks, 24 received volumetric MRI and 20 DTI imaging. Hearing sensitivity was measured in all participants using a clinical Bekesy-type audiometer for frequencies of 0.25, 0.5, 1, 2, 4 and 8kHz, on the left and right ear. Impaired pure-tone perception has been observed in NFV27, but here we included only participants able to detect stimuli of up to 1000 Hz below a hearing level of 30 dB on at least one side (Fig. 1). Controls, NFV and SV were matched in terms of age, gender, education or better-ear mean score (one-way ANOVA all P>0.136).
Behavioral testing
Confrontation naming was tested using the Boston Naming test (Dutch norms). Non-verbal executive functioning was evaluated using Raven’s Coloured Progressive Matrices. Speech repetition was assessed using the Akense Afasie test. To assess AOS, the Diagnostisch Instrument voor Apraxie van de Spraak (DIAS) was added when it became available (for this reason it was not performed in 4/12 cases). The DIAS consists of vowel and consonant repetition (15 trials each) and diadochokinesis testing. During the latter task, the examiner first reads three successive alternating syllables aloud, e.g., “pa ta ka” and asks the patient to repeat these. If successful, he/she was asked to repeat it as many times as possible during a period of 8 s. The diadochokinesis severity score is the sum of correctly repeated syllables across trials. Grammaticality was assessed using the auditory sentence comprehension test of the Werkwoorden en Zinnen Test (WEZT), consisting of 40 sentence-picture matching trials with active or passive sentences containing possible role reversal (e.g. “the horse was kicked by the cow”).
Connected speech analysis
To obtain a measure of speech rhythm, we determined the normalized pairwise variability index (PVI) in connected speech samples using Praat 6.1.02. The samples consisted of a 2-minute “Cookie Theft Scene” description (20 controls, 11 NFV, 9 SV). For every participant, the median PVI28 was determined for polysyllabic words with a strong-weak stress pattern (e.g. COO-kie) and for words with a weak-strong stress pattern (e.g. out-DA-ted). PVI was calculated following the procedure outlined in 28, equaling 100 x (d1 - d2)/ [(d1 + d2)/2], where d1 and d2 are the durations of the first and second vowel. Normalization corrects for a difference in speech rates. PVI is a marker of the suprasegmental timing of speech. PVI values closer to zero are consistent with relatively equal stress between the first two vowels of a word (“low contrastiveness”)28.
Perceptual timing tasks
Testing comprised four pre-existing tasks of perceptual timing (r1-r4)14(Fig 2A). The tasks followed a two-alternative forced-choice algorithm. Participants responded verbally or by pointing to a graphical scheme. Instructions, verbally and graphically, were repeated until the participant understood the task. Five practice trials were repeated until five consecutive correct responses were recorded, and if needed, instructions were repeated and the nature of the errors was explained. If the participant indicated during the test phase that they had forgotten the instructions, then they were repeated, the practice trials run again and the test phase then restarted.
All tasks used 500Hz 100 ms pure tones and consisted of 50 trials. Outcome measures were the thresholds obtained by adaptively adjusting the difference between reference and target. The difference was varied as a relative proportion of the duration or tempo of the reference. The ‘Single time-interval duration discrimination’ task (r1) required participants to indicate which of two tone pairs comprised the ‘longer gap’. Initially, the target was longer by 90% of the reference inter-onset-interval (depending on the trial, between 300 and 600 ms), and adaptively adjusted in steps of 12% and 6%. In the ‘Isochrony deviation detection’ task (r2), participants were required to indicate which of two otherwise isochronous five-tone sequences contained a lengthening or ‘extra gap’. The reference sequence had an isochronous inter-onset-interval ranging from 300 to 600 ms. The target had one lengthened inter-onset-interval between the third and fourth tone. The initial default value of the lengthening was 60% of the inter-onset-interval, adaptively adjusted in steps of 6% and 2%. In both tasks (r1,r2), a local deviation is introduced to generate the target. As such, these tasks test the detection of lower-order differences in timing between consecutive tones. In the ‘Metrical pattern discrimination’ tasks (r3, r4), participants had to decide which of three rhythmic sequences (the second or the third) of seven tones sounded “different”, based on a distortion within the rhythm. The reference sequence had a strongly (r3) or a weakly (r4) metrical beat of four evoked by the temporal spacing of the tones over 16 time units. In the strongly metrical sequence, accented tones occurred every four units, in the weakly metrical sequence, two of those were silent29. The default initial distortion in pattern (a change in the long compared to the short intervals) was 65%, adaptively adjusted in steps of 12% and 6%. Metrical pattern discrimination (r3,r4) requires processing of the higher-order temporal structure of the stimuli, since global deviations distributed across the sequence need to be detected. Typical syllable rates in Dutch (the native language of the participants) are 4-5 syllables/s (period 200 - 250ms) which is close to the tempi used in our tasks.
Statistical analysis
The analysis of the perceptual timing tasks was identical to 14. Outcome measures were log-transformed to allow for parametric analysis at the group level. At the individual level, each patient’s performance was analyzed in comparison to the group by using a modified Crawford t-test30. For the comparison between each patient and the controls, to facilitate comparison between tasks and to enable Bonferroni correction, the exact P values (estimated percentiles) calculated according to Crawford and Garthwaite were transformed into normalized Z-scores using the standard normal cumulative distribution function. The significance threshold was set to Z = 2.24 equaling a one-tailed significance level of P<0.05, Bonferroni-corrected for the number of tests (n = 4). We compared the thresholds between NFV and SV using a Student’s t-test (one-tailed significance level of P<0.05, effect size: Cohen’s d with Hedges correction for small samples, R package effsize). For NFV, SV and controls, we correlated PVI for strong-weak and weak-strong words to the perceptual timing tasks to test the link between impaired rhythm discrimination and speech rhythm (one-tailed significance level of P<0.05). We report the coefficient of determination (R2) as well.
Acquisition of MRI data
Twenty-three patients (12 NFV, 11 SV), and 24 controls received a high resolution T1-weighted structural MRI. All controls and 13 patients were scanned on a 3T Philips Intera system equipped with an 8-channel receive-only head coil (SENSitivity Encoding head coil). Ten patients were scanned on a 3T Philips Achieva dstream scanner equipped with a 32-channel head volume coil. An identical 3D turbo field echo sequence was used on both systems (coronal inversion recovery prepared 3D gradient-echo images, inversion time (TI) 900 ms, shot interval = 3000 ms, echo time (TE) = 4.6 ms, flip angle 8°, 182 slices, voxel size 0.98×0.98×1.2 mm3). The diffusion weighted images consisted of 45 directions of diffusion weighting with a b = 800 as well as 1 non-diffusion weighted image (B0), acquired in the axial plane, with isotropic voxel size of 2.2 mm, TR 9900 ms, TE 90 ms, flip angle 90°, fold over direction AP, fat shift direction A (anterior), in-plane parallel image acceleration (SENSE) factor 2.5.
Deformation-based morphometry
DBM was performed using the CAT12 toolbox (http://www.neuro.uni-jena.de/cat), an extension of SPM12 (http://www.fil.ion.ucl.ac.uk/spm). Segmentation was performed in CAT12 using a default tissue probability map. Local adaptive segmentation was used at default strength (medium) and Diffeomorphic Anatomical Registration Through Exponentiated Lie Algebra (DARTEL) was used for registration to the default template (IXI555_MNI152). Voxel size for normalized images was set at 1.5 mm (isotropic) after internal resampling at 1mm. Local deformations were estimated using the Jacobian determinant, while ignoring the affine part of the deformation field. Thus, additional correction for total intracranial volume is not required31. Images were smoothed using a 8 × 8 × 8 mm3 Gaussian kernel. Deformation fields of controls and both PPA groups were compared using a one-way between-subject ANOVA. Multiple linear regression was used to correlate tests (r1-r4) at the individual level to the deformation fields within each PPA subtype. Scanner type and age were introduced as nuisance variables in all analyses. Threshold of significance was set at voxel-level uncorrected P<0.001 and cluster-level FWE-corrected P<0.0514.
Diffusion Tensor Imaging
Diffusion images were preprocessed and analyzed with MRTRIX3. The preprocessing pipeline included the following steps: first, the data were converted to MIF using mrconvert. Using dwidenoise, diffusion data were denoised; subject motion, and eddy current artefacts were also corrected for using dwidenoise (which relies on FSL eddy); following these two steps, the preprocessed diffusion data were bias-corrected with dwibiascorrect. The diffusion data were rigidly aligned to the subject’s T1-weighted volume space using Advanced Normalization Tools (ANTs) and tensor reorientation was performed. Fractional anisotropy (FA) and mean diffusivity (MD) were calculated in subject-space and normalized to MNI space. The calculated tensors were then used to perform a whole brain tractography using the probabilistic Tensor (Tensor_Prob), combined with anatomically constrained tractography with seeding along the grey/white matter interface, and 2 million streamlines to be selected32. The whole brain tractogram was then segmented using volumes of interest (VOIs) acquired from the Freesurfer aparc+aseg parcellation33,34. These VOIs were pars opercularis of the IFG and the superior frontal gyrus, specifically selecting the Aslant tract on diffusion MR data21.
Freesurfer aparc+aseg parcellation was performed to obtain these subject-specific VOIs. For this reason preprocessing of T1-weighted structural MRIs was repeated using FMRIPREP35, a Nipype36 based tool. T1-weighted volume was corrected for intensity non-uniformity using N4BiasFieldCorrection v2.1.037 and skull-stripped using antsBrainExtraction.sh v2.1.0 (using the OASIS template). Brain surfaces were reconstructed using recon-all (FreeSurfer v6.0.138), and the brain mask estimated previously was refined with a custom variation of the method to reconcile ANTs-derived and FreeSurfer-derived segmentations of the cortical gray matter39. Spatial normalization to the ICBM 152 Nonlinear Asymmetrical template version 2009c was performed through nonlinear registration with the antsregistration tool of ANTs v2.1.0 using brain-extracted versions of both the T1-weighted structural MRI and the template. Brain tissue segmentation of cerebrospinal fluid, white matter and gray matter was performed on the brain-extracted T1-weighted structural MRI using fast (FSL v5.0.9).
Smoothed FA and MD maps were compared between controls, NFV and SV using a between-subject ANOVA (same as previous threshold). Scanner type, TIV and age were introduced as a nuisance variables. A template for the left Aslant tract was generated for healthy controls using the 75% overlap threshold21. FA and MD of the left Aslant tract were extracted for each patient by averaging values from all voxels included in this template21. We compared the FA and MD between NFV and SV by means of a Student’s t-test (one-tailed P<0.05). FA and MD were correlated to rhythm discrimination performance within the NFV group to confirm the DBM findings (one-tailed P<0.05).
Data availability
The data that support the findings of this study are available upon reasonable request.
Results
Perceptual timing
Performance on the perceptual timing tasks was poorer in NFV compared to controls (Fig 2AB): mean Z scores were above the threshold (P<0.05 Bonferroni-corrected) in NFV for discrimination of strongly metrical sequences (r3, mean Z: 2.94), discrimination of weakly metrical sequences (r4, mean: 2.93) and isochrony deviation detection (r2, mean: 2.46) (Fig 2B). We compared the test scores between the NFV and SV. This resulted in significantly poorer scores in NFV for the discrimination of weakly metrical sequences (r4, P = 0.001, Hedges’ g: 1.48) (Fig 2BC).
At the individual level, deficits were observed mainly in NFV patients (Z > 2.24) (Fig 2D). The weakly metrical pattern discrimination task (r4) resulted in a significant impairment in 7 NFV (Fig 2D) and 2 SV patients. Similarly, strongly metrical pattern discrimination (r3) was impaired in 6 NFV (Fig 2D) and 4 SV, as well as isochrony deviation detection (r2) in 6 NFV and 2 SV patients. Single time-interval discrimination (r1) was impaired in just 4 NFV and 1 SV patients. In summary, 75% of NFV cases were impaired in one or more of the tasks and 36.4% of SV cases.
Correlation with speech rhythm
PVI values were closer to zero (“low contrastiveness”) for words with a weak-strong stress pattern in NFV compared to controls and SV (one-way ANOVA F(2,35)=5.37, P = 0.009, Fig 3A). PVI for strong-weak words correlated with strongly metrical pattern discrimination (r3) for NFV (R = 0.634, R2 = 0.402, P = 0.036, Fig 3B), SV (R = 0.761, R2 = 0.579, P = 0.017) and controls (R = 0.531, R2 = 0.282, P = 0.016, Fig 3C). This means that participants with less accurate rhythm discrimination, displayed greater duration differences between the first and second vowels of words with a strong-weak stress pattern. No correlation was found with any of the other perceptual timing tasks (r1,r2,r4) and no correlation was found with the PVI for weak-strong words (all P>0.1).
White matter changes: deformation-based morphometry
The expected pattern of changes of the deformation field was observed when comparing the healthy control, NFV and SV groups. In NFV, atrophy was observed mainly in the frontal lobes, with a left-sided predominance (Fig 4AB). In SV, atrophy was localized to the anterior temporal lobes (Fig 4A). In the NFV group, voxel-wise multiple linear regression showed that the strongly metrical rhythm discrimination task (r3) negatively correlated with changes in the deformation field in the left frontal white matter (MNI = −20,20,−36; −17,8,48; −9,39,50; kE 2426 voxels, Z score: 4.92) (Fig 5AB). This negative correlation indicates that poorer discrimination (i.e. larger thresholds and z scores) is linked to more atrophy. For illustrative purposes, we plotted the individual NFV thresholds for the strongly metrical discrimination task (r3) versus volume loss in this region (R = −0.316, R2 = 0.100, P < 0.001) (Fig 5C). In the SV group, DBM analysis yielded no significant correlations with the perceptual timing tasks.
White matter changes: Diffusion Tensor Imaging
A comparison between NFV, SV and controls showed reduced FA in NFV in the left inferior frontal region, the corpus callosum and the anterior cingulate (Fig 6A). MD was widely increased in NFV, with a predominance in both frontal lobes (Fig 6B). In SV, FA was reduced and MD was increased in both anterior temporal lobes (not shown). We then compared FA and MD between NFV and SV specifically within the template of the left Aslant tract derived from the controls. Although, FA was similar between NFV and SV (P=0.175, Hedges’ g: - 0.72, Fig 6C), MD was higher in NFV compared to SV (P = 0.038, Hedges’ g: 1.17, Fig 6D). In NFV, MD in the left Aslant tract was increased when the performance on the strongly metrical rhythm discrimination task was weaker (r3) (R = 0.815, R2 = 0.664, P = 0.026). Although this was not significant, a trend was observed with FA (R = - 0.708, R2 = 0.501, P = 0. 075). Neither FA nor MD in the left Aslant tract correlated with performance on any other task in NFV (r1,r2,r4, all P>0.231). Visual inspection of the left Aslant tract in NFV showed that this tract overlapped with the region where there were white matter volume changes identified by DBM (Fig 6E).
Discussion
By correlating white matter changes to perceptual timing impairments in AOS, we investigated the link between clinical heterogeneity and structural abnormalities in NFV. We propose a shared mechanism for impaired rhythm discrimination and AOS14, whereby the impact of disrupted temporal scaffolding might well extend beyond the linguistic domain. Behaviorally, we observed a correlation between impaired rhythm discrimination and speech rhythm. DBM demonstrated that atrophy in the left frontal lobe correlated with the rhythm discrimination impairment in NFV. We complemented DBM with DTI to provide an independent measure of white matter changes. DTI confirmed a correlation between damage to the left Aslant tract and impaired rhythm perception in NFV. Our findings link impaired perceptual timing of incoming non-linguistic auditory signals to white matter atrophy in the left frontal lobe. Given the prior work implicating the left Aslant tract to motor speech production deficits in NFV13,21,22, our results suggest it is a relevant (part of a) common anatomical substrate for impaired rhythm discrimination and AOS. Whilst our findings are correlational, the results for connected speech complement the two independent white matter metrics. This strengthens the evidence base for a well-defined neurocomputational mechanism of distorted speech. It provides insight into temporal irregularities in spontaneous speech patterns in AOS, as well as the critical role of left frontal regions in this process.
We observed an overlapping white matter substrate that might contribute to impaired rhythm discrimination as well as AOS. White matter degeneration was present in the left frontal Aslant tract close to the SMA, which has previously been linked to temporal regularity discrimination24 and AOS3,6. The Aslant tract connects the superior frontal gyrus/SMA to the IFG, the region which displays the most pronounced atrophy in early NFV40. Agrammatism has been linked to grey matter damage in left BA4441 and white matter damage in the adjacent left anterior inferior and middle frontal regions and uncinate fasciculus4, which connects the IFG to the temporal lobe12,21. The close anatomical proximity of the IFG and the left Aslant tract could explain why agrammatism and AOS often occur simultaneously in NFV, but also why these deficits can occur in isolation42. The clinical relevance of our study is the additional evidence for regional variations in left frontal lobe atrophy that is linked to the phenotypical heterogeneity in NFV.
Although four perceptual timing tasks were performed, both speech rhythm and left frontal lobe atrophy were linked specifically to impaired performance on the strongly metrical rhythm discrimination task (r3). This task is conceptually different from the single time-interval duration discrimination task (r1) and the isochrony deviation detection task (r2): determining the metricality of a tone sequence (r3) requires processing of the higher-order temporal structure determined by the grouping of salvos of notes that induce the sense of a regularly occurring metrical ‘beat’29. Metricality-based rhythm discrimination (r3) necessitates detecting global deviations distributed across the entire sequence. The correlation between rhythm discrimination and speech rhythm may stem from the common processes required to integrate the higher-order/suprasegmental temporal structure. Our results are in agreement with prior work in PPA that demonstrates the detection of temporal changes between syllables was more impaired when stimuli contained a higher number of syllables43. In contrast, the single time-interval duration discrimination task (r1) and the isochrony deviation detection task (r2) test lower-order differences in timing in a simple isochronous sequence based on a local deviation. The weakly metrical rhythm discrimination task (r4) is more challenging as it does not rely on a clear metrical beat29 (higher thresholds for r4 versus r3 in controls, P<0.001). Perhaps more domain-general processes play a role in this task, but additional manipulations are required to confirm this hypothesis. Keeping in mind the labor-intensive administration of our tasks, we reached a considerable albeit modest sample size. Further validation requires a larger multicentric sample given the relative rarity of PPA. Performance on the strongly metrical rhythm discrimination task (r3) also correlated with the temporal variability in speech rhythm, strengthening the hypothesis that perceptual timing and AOS are linked. Similar to prior work, we observed lower contrastiveness of vowel duration in words with a weak-strong pattern16,28. Distortion of weak-strong words most likely reflects an early change in AOS secondary to abnormal lengthening of the first vowel44. Marked distortions of strong-weak words are more likely to occur when the disease is more severe44. Perhaps the overall low contrastiveness at the NFV group level for weak-strong words resulted in a floor effect, thus prohibiting us to detect a correlation with perceptual timing, whereas changes may be more subtle for strong-weak words. Future research on the physiological link between speech rhythm and perceptual timing might also include the use of delayed auditory feedback. This manipulation, consisting of delayed playback of the speaker’s own voice, which elicits nonfluent speech in healthy participants, may improve speech in NFV patients45.
White matter damage is at least part of the substrate of impaired rhythm discrimination and AOS in NFV. One question is whether these white matter changes reflect tau pathology, the most prevalent pathology in NFV8,9. DTI metrics have been put forward as a marker of tauopathy and other proteinopathies46,47 DTI imaging is sensitive to changes caused by tau pathology at the single-subject level18, presumably because of underlying glial pathology48, e.g. by myelin injury or changes in other structures that affect water diffusion12. Similar to prior work49, we observed that MD changes were more pronounced than FA changes in NFV patients. In most patients, neuropathological data is lacking thus prohibiting us from making strong claims in relation to pathology. We would not advocate linking tauopathy to a simple DTI parameter. Rather, our findings advance the broader characterization of the possible disease-specific involvement of white matter tracts. Our results align with the “molecular nexopathy” paradigm50: the left frontal network containing IFG and SMA as nodes demonstrate a selective vulnerability to tau protein, which could spread locally through the left Aslant tract in a prionlike fashion. Even if certain proteinopathies are strongly linked to predictable phenotypes of network disruption, the molecular nexopathy paradigm does not propose complete specificity. Furthermore, DBM demonstrated that the atrophy which correlated to impaired rhythm discrimination in NFV is more widespread than the left Aslant tract (Fig. 6E). The impact of these adjacent white matter changes remains unclear.
We revealed an overlap in the white matter substrates in rhythm processing of auditory input and suprasegmental timing of speech output. We thus present these correlates as the anatomical substrate of impaired temporal scaffolding and increase the mechanistic understanding of the origin of AOS. We also provide additional evidence that generic processing impairments explain part of the NFV phenotype.
Data Availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
The authors declare no competing financial interests.
Funding
This work was supported by Federaal Wetenschapsbeleid [Belspo 7/11]; FWO [G0925.15] and KU Leuven [OT/12/097, C14/17/108]. RB is a postdoctoral fellow of the Research Foundation Flanders (FWO).
Acknowledgements
The authors thank B. Bergmans, MD, PhD, Ch. Swinnen, MD, A. Sieben, MD, and Y.A. Pijnenburg, MD, PhD, for the referral of patients. We thank E. Luckett, MSc, for copyediting.