ABSTRACT
Introduction Temporomandibular disorder (TMD) is a common musculoskeletal pain condition with development of chronic symptoms in 49% of patients. Although a number of biological factors have shown an association with chronic TMD in cross-sectional and case control studies, there are currently no biomarkers that can predict the development of chronic symptoms. The PREDICT study aims to undertake analytical validation of a novel peak alpha frequency (PAF) and corticomotor excitability (CME) biomarker signature using a human model of the transition to sustained myofascial temporomandibular pain (masseter intramuscular injection of nerve growth factor [NGF]). This paper describes, a-priori, the methods and analysis plan.
Methods and analysis This study uses a multi-site longitudinal, experimental study to follow individuals for a period of 30 days as they progressively develop and experience complete resolution of NGF-induced muscle pain. 150 healthy participants will be recruited. Participants will complete twice daily electronic pain dairies from Day 0 to Day 30 and undergo assessment of pressure pain thresholds, and recording of PAF and CME on Days 0, 2 and 5. Intramuscular injection of NGF will be given into the right masseter muscle on Days 0 and 2. The primary outcome is pain sensitivity.
Ethics and dissemination Ethical approval has been obtained from The University of New South Wales (HC190206) and the University of Maryland Baltimore (HP-00085371). Dissemination will occur through presentations at National and International conferences and publications in international peer-reviewed journals.
Registration details ClinicalTrials.gov: NCT04241562 (prospective)
STRENGTHS AND LIMITATIONS OF THIS STUDY
PREDICT is the first study to undertake analytical validation of a peak alpha frequency and corticomotor excitability biomarker signature. The study will determine the sensitivity, specificity and accuracy of this biomarker signature at predicting pain sensitivity.
PREDICT will establish the reportable range of test results and determine automation and simplification of methods for biomarker detection in the clinic.
The methods and statistical analysis plan are pre-specified to ensure reporting transparency.
Future patient studies will be required for clinical validation.
INTRODUCTION
Temporomandibular disorder (TMD) is the second most common musculoskeletal pain condition after back pain, with an annual incidence of 4% and development of chronic symptoms in 49% of patients34, 36. Although a number of biological factors have shown an association with chronic TMD in cross-sectional and case control studies including sensitivity to mechanical stimuli37, up-regulated central nociceptive processing29, 31, increased heart rate and reduced heart rate variability22, single nucleotide polymorphisms35, 38, elevated levels of pro-inflammatory cytokines35, elevated interstitial glutamate concentration2, and altered brain structure and function21, these have either failed to yield clinically meaningful predictive power or have not undergone comprehensive validation in prospective trials. Consequently, there are no biomarkers available that can predict the development of chronic TMD. In fact, there are no biomarkers qualified (considered valid and psychometrically sound) by the FDA for use in clinical trials or clinical practice for any musculoskeletal pain condition39.
In most patients with chronic musculoskeletal pain a peripheral anatomical cause for pain cannot be identified. For example, myofascial TMD is more commonly associated with stress and anxiety than anatomical pathology40 while 90% of all chronic low back pain is diagnosed as ‘non-specific’18. In conditions where a structural impairment can be detected (i.e. articular cartilage damage in osteoarthritis), the magnitude of pain fails to correlate with the extent of tissue damage11. These observations suggest a role for the brain in the development and maintenance of chronic pain. Indeed, early investigations suggest that variability in brain connectivity circuits can predict sensitivity to a transient pain stimulus in healthy individuals30. Although these data have not yet been expanded and the relevance to clinical pain is unknown, brain imaging methods are widely considered to have potential as diagnostic, prognostic and predictive biomarkers of chronic pain5.
Using brain imaging methods of electroencephalography (EEG) and transcranial magnetic stimulation (TMS), preliminary evidence for a unique biomarker signature – combined resting state peak alpha frequency (PAF) and corticomotor excitability (CME) – has recently been demonstrated. In studies using long-lasting human pain models, slow PAF and low CME are associated with high pain severity and longer pain duration13, 32, 33. Consistent with this, low CME in the acute stage of clinical pain is associated with high pain severity and the presence of pain at 6 months follow-up3. These data suggest the combination of slow PAF and low CME may be a plausible predictive biomarker for the development of chronic TMD.
Here, we outline the experimental protocol and statistical analysis plan to undertake extensive analytical validation of the PAF/CME biomarker signature using a standardized human model of the transition to sustained myofascial temporomandibular pain (masseter intramuscular injection of nerve growth factor). We hypothesise that the PAF/CME biomarker signature will predict pain sensitivity (primary) and pain severity and duration (secondary) with at least 75% accuracy in a human transitional pain model of TMD. In addition, we aim to i) determine the sensitivity, specificity and accuracy of the PAF/CME biomarker at predicting pain sensitivity, severity, and duration, ii) determine the reportable range of test results and reference intervals for fast vs. slow PAF and high vs. low CME and iii) establish optimization of the model and automation and simplification of methods for biomarker detection.
METHODS
Design
A multi-site longitudinal, experimental study will be used to follow healthy individuals for a period of 30 days as they progressively develop and experience complete resolution of NGF-induced muscle pain. All data collection will be performed at the Australian site (Neuroscience Research Australia; NeuRA), and blinded data processing and analyses will be performed at the USA site (the University of Maryland Baltimore; UMB). The UMB site will also be responsible for standardization and automation of analytical methods. A data and safety monitoring plan has been established and an independent monitoring committee will conduct annual reviews of study progress and safety. Ethical approval has been obtained from the University of New South Wales (HC190206) and the University of Maryland Baltimore (HP-00085371). All procedures will be conducted in accordance with the Declaration of Helsinki. Written, informed consent will be obtained and participants will be free to withdraw from the study at any time. The study is prospectively registered on ClinicalTrials.gov (NCT04241562).
Patient and public involvement
Patients and the public were not involved in the design of this protocol. Individual results will be provided to participants on request and a summary of the overall outcomes of the study will be available to all participants on completion of the trial.
Participants
Inclusion and exclusion criteria
Healthy men and women with no medical complaints, no history of chronic pain and no current acute pain between the ages of 18 and 44 years will be included. These inclusion criteria are justified based on data from the OPPERA prospective cohort study that demonstrates only marginally greater TMD incidence in females than males and an incidence rate of first-onset TMD of 2.5% per annum among 18 to 24-year olds and 4.5% per annum among 35 to 44-year olds34. Exclusion criteria are: 1) inability or refusal to provide written consent, 2) presence of any acute pain disorder, 3) history or presence of any chronic pain disorder, 4) history or presence of any other medical or psychiatric compliant, 5) use of opioids or illicit drugs in the past 3 months, 6) pregnant or lactating women, 7) contraindicated for TMS (metal implants, epilepsy)16. Participants will be recruited via notices placed on community notice boards at UNSW and NeuRA, flyers, mailings and social media platforms (such as Facebook) as well as the use of a volunteer healthy participant database held by NeuRA.
Sample size
150 healthy subjects will be included. Our preliminary data indicate consistent associations between PAF and future pain severity, as well as strong relationships between CME and pain severity. The design of the current discovery-based study is not amenable to traditional power calculations because the outcomes are not p-value-based inference but rather predictive. Larger sample sizes in the training set give better classification, while larger sample sizes in the testing set give higher accuracy. We have chosen a sample size that provides good classification and accuracy. Allowing for a 10% drop-out rate, we will enrol 165 subjects.
Data collection procedures
Overview
Participants will first complete a phone screen and if eligible, a time will be made for the Day 0 visit. At the Day 0 visit, after reviewing eligibility criteria, participants will complete informed consent (considered enrolment in the study) and questionnaires. Participants will complete twice daily electronic pain dairies from Day 0 to Day 30 and attend 3 laboratory visits of ∼ 2 hours duration on Days 0, 2 and 5. Each laboratory visit will include assessment of pressure pain thresholds (PPTs), and recording of PAF and CME. Intramuscular injection of NGF will be given into the right masseter muscle at the end of each test session on Days 0 and 2 (Fig 1). These procedures are detailed below.
Electronic diaries
Diaries will be completed using a computer, tablet or phone at 10am and 7pm each day from Day 0 to Day 30. Electronic diary completion will take 2-mins. Participants will rate their pain intensity on an 11-point numerical rating scale (NRS) anchored with ‘no pain’ at zero and ‘worst pain imaginable’ at 10 at rest, and while chewing, swallowing, drinking, talking, yawning and smiling8. Study staff will send a text message containing a unique link to the pain dairy synced with the time of diary completion (10am and 7pm) each day. If the diary is not completed for two consecutive days, participants will be followed-up by phone.
Questionnaires
At Day 0 only, participants complete a health history form assessing medical history. We will use the following National Institutes of Health common data elements (CDE) for pain biomarkers including: Pain Catastrophizing Scale (PCS)41; Brief Pain Inventory (BPI) Pain Severity and 7-item Interference subscales17, 43; SF-8 to assess general health48; Sleep Scale; PHQ-2 to assess depression19; GAD-2 to assess anxiety20; Tobacco, Alcohol, Prescription medications, and other Substances (TAPS). Participants will also complete the Perceived Stress Scale (PSS)44. These questionnaires assess factors that have been associated with first onset and/or chronic TMD1 and often worsen as TMD progresses10. These questionnaires will be completed on Day 0.
On Days 0, 2 and 5 participants will complete the Research Diagnostic Criteria questionnaire for TMD9 and two NRS scales asking the following questions:
On a scale of 1 to 10 where 1 is ‘poor sleep quality’ and 10 is ‘excellent sleep quality’, how would you rate your sleep last night?
On a scale of 1 to 10 where 1 is ‘not at all stressed’ and 10 is ‘very stressed’, how would you rate your level of stress over the last 24 hours?
Pressure pain Thresholds (PPTs)
Because NGF injection is known to sensitize mechanosensitive afferents23, 42, and because lower PPTs at cranial sites are associated with increased risk of developing TMD14 and fluctuate with the clinical disease course37, we will assess PPTs at 5 sites - overlying the masseter muscle, temporalis muscle, the temporomandibular joint, the trapezius muscle, and the lateral epicondyle. Three measures will be made at each site, with 1 min rest between measurements at the same site, in pseudorandomized order using a commercially available algometer.
Peak alpha frequency (PAF)
Scalp EEG will be collected using Brain Vision actiCAP with at least 32-channels, following the extended international 10–20 system28, a BrainAmp DC amplifier and Brain Vision Recorder version 1.22.0101 software (all Brain Products GmbH, Munich, Germany). Auxiliary recordings will include skin conductance, respiration, and electrocardiogram (ECG). Participants will be asked to make facial muscle contractions such as clenching their teeth, blinking, and saccades, while EEG is recorded. This will take about two minutes and will be used to aid in automated artefact removal. Participants will then be told to relax their muscles and resting state eyes closed EEG will be recorded for 5 minutes and used for PAF calculation.
Corticomotor excitability
Rapid TMS will be used to map the primary motor cortical representation of the right masseter muscle and right extensor carpi radialis brevis (ECRB) muscle. Mapping of the right ECRB muscle is included to determine whether any changes in corticomotor excitability are restricted to the affected muscle. Single-pulse, biphasic stimuli will be delivered to the left hemisphere using a Magstim Super Rapid2 Plus and a 70mm figure of eight coil. Bipolar surface electrodes will be used to record electromyographic (EMG) activity7. EMG signals will be amplified (x2000), filtered (20-1000 Hz) and digitally sampled at 5kHz. The scalp site that evokes the largest EMG response (motor evoked potential, MEP) at a given TMS intensity will be determined for each muscle in each individual (termed the ‘hotspot’) and the active (aMT – masseter muscle) or resting (rMT – ECRB muscle) motor threshold calculated. A 6 × 6 cm grid will be generated in the neuronavigation software for each muscle, centred to each participant’s hotspot. 110 stimuli will be delivered at 2-sec intervals to pseudorandom locations over the grid at 120% of aMT for the masseter muscle and 120% of rMT for the ECRB muscle.
Intramuscular injection of nerve growth factor (NGF)
After cleaning the skin with alcohol, a sterile solution of recombinant human NGF (dose of 5 μg [0.2 ml]) will be given as a bolus injection into the muscle belly of the right masseter on days 0 and 2 using a 1-ml syringe with a disposable needle. Any individual who does not develop sensitivity to the NGF model, assessed by diary pain ratings and pressure pain thresholds of the injected muscle, will be considered a non-responder and excluded from analyses.
Outcome measures
Primary Outcome
The primary outcome is pain sensitivity: participants are dichotomized as high- or low-pain sensitive based on the peak pain severity from diary recordings12, 33. That is, based on pain severity in the training set (n=100), participants will be classified as the top 40% high- or bottom 40% low-pain sensitive. This classification can be further weighted (e.g. very high, very low) as described in Aim 1.3.
Secondary Outcomes
The secondary outcomes are pain severity (peak average daily pain severity based on diaries on a 0-10 scale) and pain duration, defined as the time between pain onset and complete resolution of pain (0 on a 0-10 scale for two consecutive days).
Biomarker candidates
Biomarker candidates are PAF at Day 0 and CME at Day 5. As this is a discovery project, we will also examine PAF and CME at every day it is tested (see Aim 1.3 below).
Data processing
PAF and other EEG metrics
All data processing will be performed using custom MATLAB scripts implementing EEGLAB6 and FieldTrip toolboxes27. Data will be referenced to the average across all recording channels and segmented into 5-second epochs. These epochs are manually inspected and all epochs containing marked muscular artifacts are rejected. Channels with poor recordings will be rejected. Principal component analysis is then applied to identify and remove components relating to eye blinks, saccades, and ECG artifacts. Power spectral density will be derived in .20 Hz bins and the 2-40 Hz range will be extracted. Power spectral density will be extracted in sensor space around sensorimotor cortices (C3, Cz, C4, and neighboring electrodes), as well as sensorimotor ICA components demonstrating clear alpha peaks26, 46, 47. A Hanning taper will be applied to the data prior to calculating the spectra to reduce any edge artifacts similar to the approach taken in24-26. PAF is calculated using the center of gravity method, as we have done previously12.
Corticomotor excitability
All data processing is performed using a custom MATLAB script. Triangular linear interpolation is used to create a full surface map within a transformed 2D plane containing the stimulation coordinates and their corresponding peak-to-peak MEP amplitudes4, 45. The resultant map is divided into 2500 partitions (50 × 50), with each partition assigned an approximated value based on the nearest acquired MEP data. Map area is determined as the ratio of the number of approximated partitions where the MEP exceeds 10% of the maximum MEP across all partitions. This cut-off reduces data variability. Map volume is then calculated as the sum of all MEPs (subtracted by the 10% level). This approach is described in full detail (including relevant equations) here45.
Statistical analyses
Aim 1.1: Predicting pain sensitivity and optimizing the model
We will validate the PAF/CME biomarker signature and test the predictive accuracy using a nested control-test scheme. The sample of 150 subjects will first be randomly divided into an outer-training set (n = 100) and an outer-testing set (n = 50). The research team at UMB will be blinded to the outcomes of pain sensitivity, severity and duration in the outer-testing cohort. “High pain sensitive” subjects are defined as the 40% of all subjects with the highest pain sensitivity, whereas “low pain sensitive” subjects are the 40% of all subjects with the lowest pain sensitivity. The ratios of high-vs. low-pain sensitive individuals will be matched between the two cohorts. Next, the outer-training cohort will be split into 5-folds (20 subjects for each fold) for cross validation. Each fold of 20 subjects will be tested as an inner testing cohort based on the remaining 4 folds as the inner training cohorts. We expect the 5-fold cross validation will provide sound performance assessment with balanced variance-bias trade-off (see details in15). We will consider multiple classifiers including logistic regression, support vector machine, gradient boosting, random forest, and neural networks. These predictive models along with the tuning parameters will be compared based on the performance of the 5-fold cross validation. The biomarkers may predict outcomes in a nonlinear fashion, and thus most machine learning models (e.g. support vector machine and gradient boosting+random forest) will detect nonlinear functions. The predictive model with the highest performance (i.e., the final model) based on the ability to classify the 40% most pain sensitive and the 40% least pain sensitive participants will be referred to as the “winning classifier”. The parameters of the winning classifier will be fixed and used to predict the outcomes of the outer-testing set. After finalizing the predicted outcomes, the outcomes will be unblinded to the UMB team. We will compare the predicted outcomes with the true outcomes and assess the accuracy, sensitivity, specificity, positive and negative predictive values. The predictive accuracy based on binary outcome prediction is used because it is more robust than mean squared error of a predictive model for continuous variables and is more commonly used in the field. Our target is to achieve an area under the curve of the receiver operating characteristic greater than or equal to 75% when applying the fixed classifier to the testing data set.
Aim 1.2: Reportable ranges
The sensitivity, specificity and accuracy of the PAF/CME biomarker will be based on the blinded prediction of the outer-training 50 samples. Reference intervals will be reported for the whole sample, including intervals for fast vs. slow PAF and high vs. low CME. These will be reported as tables, standardized by age, sex, and other factors. We will further report on the stability of these measures over time (Days 0, 2 and 5).
Aim 1.3: Optimization
We will explore how the inclusion of other combinations of factors in the model affects performance characteristics. The auxiliary factors considered in the model will include questionnaire and diary data, PPTs, and other EEG data (theta, alpha, beta, low gamma power) using a model/variable selection procedure to further boost the performance of the model. The nested training-testing scheme will be used to determine the optimal pain sensitivity prediction model using the biomarkers.
Weighted accuracy: since the low- and high-pain sensitive categories are determined based on a continuous pain scale, subjects with pain intensities near the median should be weighted less. Therefore, in addition to the simple accuracy, the weighted accuracy will be calculated. The weight will be determined by the distance of pain levels to the high-low cut-off.
Automation and Simplification of Methods
In order for the biomarker signature to have application to large populations and settings, users must be able to rapidly collect and analyze data with minimal training. We will develop methods that automatically produce biomarker readouts with minimal human input, thus reducing bias associated with data input. Our goal will be to develop a method for automated signature calculation that achieves an intraclass coefficient of at least 80% compared to output from non-automated data processing and no significant difference between automated and non-automated based on bootstrap inference.
Data Availability
De-identified, individual participant data will be made available immediately following publication via an open-access data repository.
ETHICS AND DISSEMINATION
Ethical approval has been obtained from The University of New South Wales (HC190206) and the University of Maryland Baltimore (HP-00085371). Dissemination will occur through presentations at National and International conferences and publications in international peer-reviewed journals.
Funding
This project is funded by grant 1R61NS113269-01 from The National Institutes of Health to DAS, SMS and SC. SMS receives salary support from The National Health and Medical Research Council of Australia (1105040). SKM and KB hold University of New South Wales International Postgraduate Scholarships.
Author contributions
DAS, SMS and SC acquired funding to undertake this research. DAS, SMS and KB drafted the protocol. DAS, SMS, KB, AC, NC, AF, SKM, PS, and SC contributed to revisions and approved the final version of the manuscript.
Competing Interests Statement
There are no competing interests to declare.
Data Availability Statement
De-identified, individual participant data will be made available immediately following publication via an open-access data repository.