Abstract
Introduction Pharmacological treatment of depression mostly occurs in non-psychiatric settings, but factors that determine the initial choice of antidepressant treatment in these settings are not well-understood. This study models how non-psychiatrists choose among four antidepressant classes at first prescription (selective serotonin reuptake inhibitors [SSRI], bupropion, mirtazapine, or serotonin-norepinephrine reuptake inhibitors [SNRI]), by analyzing electronic health record (EHR) data.
Methods EHR data were from the Mass General Brigham Healthcare System (Boston, Massachusetts, USA) for the period from 1990 to 2018. From a literature search and expert consultation, we selected 64 variables that may be associated with antidepressant choice. Patients who participated in the study were aged 18 to 65 at the time of first antidepressant prescription with a co-occurring International Classification of Diseases (ICD) code for a depressive disorder. Multinomial logistic regression with main effect terms for all 64 variables was used to model the choice of antidepressant. Using SSRI as the reference class, odds ratios, 95% confidence intervals (CI), and likelihood ratio-based p-values for each variable were reported. We used a false discovery rate (FDR) with the Benjamini–Hochberg procedure to correct for multiple comparisons.
Findings A total of 47,107 patients were included after application of inclusion/exclusion criteria. We observed significant associations for 36 of 64 variables after multiple comparison corrections. Many of these associations suggested that antidepressants’ known pharmacological properties/actions guided choice. For example, there was a decreased likelihood of bupropion prescription among patients with epilepsy (adjusted OR 0.41, 95% CI: 0.33–0.51, p < 0.001), an increased likelihood of mirtazapine prescription among patients with insomnia (adjusted OR 1.58, 95% CI: 1.39–1.80, p < 0.001), and an increased likelihood of SNRI prescription among patients with pain (adjusted OR 1.22, 95% CI: 1.11–1.34, p = 0.001).
Interpretation Non-psychiatrists’ selection of antidepressant class appears to be guided by clinically relevant pharmacological properties, indications, and contraindications, suggesting that broadly speaking they choose antidepressants based on meaningful differences among medication classes.
INTRODUCTION
Major depressive disorder (MDD) is one of the most common psychiatric disorders, with a lifetime prevalence in the U.S. of approximately 20% (1). Pharmacological treatment of depression mostly occurs in non-psychiatric settings (2), although little is known about antidepressant treatment selection practices in primary care, and whether factors affecting treatment decisions in these settings are consistent with expert recommendations (3–6). In this analysis, we take advantage of the comprehensive longitudinal data captured in EHRs to examine the factors associated with antidepressant selection by non-psychiatric clinicians. In addition to using coded (structured) data in EHR, we also applied natural language processing (NLP) to extract mental health symptom information that is documented in free text (unstructured) data from clinical notes. We discuss the extent to which these factors align with those recommended in the APA Practice Guideline and the psychiatric literature (3–6).
Factors considered included patient demographics; comorbidities; depression-related mental symptoms; drug side effects; and drug-drug interactions.
METHODS
Institutional Review Board (IRB) approval
This study was approved by the IRB of Mass General Brigham (MGB) Healthcare.
Data source
Data were extracted from the Research Patient Data Repository (RPDR) (7) of the MGB Healthcare System in Boston, Massachusetts. The RPDR includes data on more than 7 million patients and over 3 billion records from 7 hospitals, including 2 major academic hospitals. Clinical data recorded in the RPDR include encounter (patient visit) meta-data (e.g., time, location, provider, etc.), demographics, ICD 9 and 10 Clinical Modification (ICD-9-CM and ICD-10-CM) diagnoses, laboratory tests, medications, procedures, and patient reported outcome measures.
Study population
EHR data were extracted for the period January 1990 to August 2018. Patients included were between age 18 and 65 at the time of their first antidepressant prescription (comprising any one of the following: citalopram, escitalopram, fluoxetine, fluvoxamine, paroxetine, sertraline, venlafaxine, desvenlafaxine, duloxetine, mirtazapine, bupropion, vilazodone, and vortioxetine) with a co-occurring diagnostic ICD code for a depressive disorder (defined as ICD-9-CM: 296.20–6, 296.30–6, and 311; ICD-10-CM: F32.1–9, F32.81, F32.89, F32.8, F33.0–3, and F33.8–9). We defined the first visit with co-occurring antidepressant prescription and depression ICD code as the “index visit.” We excluded patients based on the following criteria: (1) first prescription was given prior to 1997, when the latest antidepressant category (mirtazapine) became available; (2) first prescription was made by a psychiatrist, as our goal was to study non-psychiatrist prescribing practices; (3) absence of clinical notes or details of visits within the 90-day window prior to the date of first prescription (to ensure that data were available to address whether the index date really reflected the initial prescription [see below]); (4) presence of ICD codes for bipolar disorder, schizophrenia, or schizoaffective disorder on or prior to the date of first prescription; (5) first documented prescription included two or more antidepressants, as these were unlikely to be first prescriptions; and (6) prescription of an antidepressant other than the four classes of interest.
Because the data are limited to what is recorded in the MGB EHR, some patients may have already been on an antidepressant prior to the index visit (“prevalent users”) due to previous outside prescriptions. For such patients, the pattern of association between the predictors measured at or around the index visit may be different than those of true initiators (“new users”) at the index visit. The three-month window prior to the index date addresses this to some extent, but to improve the chances of detecting prevalent use, we identified a subset of patients who had a routine primary care visit in the MGB system in the past year, who were thus less likely to have received an undocumented antidepressant from an outside provider. We conducted 200 chart reviews to assess the frequency of prevalent users among both the full cohort and the subset of patients with primary care visits (the “PCP subset”). The analyses described below were performed in both sets of patients.
Variables for analysis
To determine variables to be included in the analysis, a psychiatrist (YS) searched the literature (3–6) and conducted telephone interviews with five other psychiatrists and three non-psychiatrist physicians. The physicians were first asked open questions regarding what factors they consider when selecting an antidepressant, and then queried more specifically when the initial answers were broad (i.e., drug-drug interaction, etc.) After all information was gathered, a set of variables was derived by including all factors that were mentioned to affect prescribing decisions in either the interviews or the literature. The final list of variables can be broadly categorized as patient demographics, prescription timing information, co-morbidities at the index visit, other medications, and depression-related symptoms.
For co-morbidities, ICD billing codes that occurred before or at the index visit were collected for each patient and mapped to individual diseases, adopting mappings from those either previously validated and published in a peer-reviewed journal (8,9), provided by authoritative sources (10,11), or electronic ICD code databases (12,13) if the prior two sources were not available (Appendix A shows the complete mapping).
Prescriptions between 90 days before the index visit and at the index visit were retrieved for each subject. These medications were then categorized based on generic names regardless of dosing and route of administration. Counts for distinct medications other than antidepressants were then generated for each patient within the 90-day time window. We chose to include the number of medications rather than specific medications because most clinicians reported that this was a bigger driver of decision-making based on differences in broad drug-drug interactions across medication class. However, a count of the number of prescriptions for nonsteroidal anti-inflammatory drugs (NSAIDs) was constructed as a separate variable because these were reported to be specifically associated with abnormal gastrointestinal bleeding with concurrent use of SSRI and SNRI antidepressants (14,15). We also included the calendar year of the index visit because clinical practice may change over time in response to new information, such as new clinical trials or updated guidelines.
Extracting depression-related symptoms from clinical notes with NLP
We adopted a hierarchical approach to identify and create variables for depression-related symptoms. The hierarchy consisted of four levels, from top to bottom: (1) categories of depression-related symptoms (e.g., depressive mood and anhedonia, loss of appetite and body weight, and insomnia), (2) concepts within these categories (e.g., for depressive mood and anhedonia: “anhedonia” and “sadness”), (3) specific terms used to describe these concepts (e.g., for anhedonia, “anhedonia,” “can’t enjoy anything, “no pleasure”), (4) lexical derivatives and regular expressions (a common text format used for computer reading). We initially grouped the concepts of depression-related symptoms into depressive symptom categories corresponding to criteria in the Diagnostic and Statistical Manual of Mental
Disorders-5th edition (DSM-5) (16). We then expanded the search to include terms largely synonymous with these concepts, being fairly liberal in order to make sure we captured the core concept, since many of the terms were derived from the patients’ actual wording; for example, “despondent” was taken as synonymous with “depressed.” The actual variables built into the data matrix were the number of concepts present per category (e.g., category 1 “depressive mood and anhedonia” had a total of 8 concepts, each of which could be indicated by the presence of one of many terms) in the aggregation of notes within the 90-day window prior to and including the index date for each patient. For example, if a patient had at least one term indicating two out of 8 among these concepts, say, “depressed mood” and “melancholia,” he would receive a score of 2 for category 1. Appendix B provides further details on the psychopathology hierarchical structure.
Before extracting terms from the clinical notes, a pre-processing step removed sentence segments containing words that implied negations (e.g., not, no) while sparing double negations and sentence components separated by “but”, “however”, “although”, and “nevertheless” that would indicate a reverse of sentiment. Sentence parsing was done using the spaCy (https://spacy.io/) (17) package for Python (version 3.7), which applied the customized sentence segmentation rules mentioned above. We then applied regular expression matching to detect the presence of any terms for depression-related symptoms within the notes using the Python Re module. Using these matching results, we constructed the depression-related symptom variables in accordance with the hierarchical mapping previously described. To avoid false positive matching, regular expressions of shorter length were protected by word boundary detection (i.e., match only when white spaces or punctuation were present around the string) to prevent them from being matched as a substring of a longer word; for example, “cry” would not be matched if the string only appeared in “cryptococcus.”
Statistical analysis
(a) Descriptive analysis of variable value distribution
For each categorical variable, we calculated the proportion of subjects in each category. For continuous variables, we calculated mean, standard deviation, and range. All calculations were performed for the full sample and stratified by index antidepressant category.
(b) Statistical modeling of initial antidepressant choice
To model the choice of the antidepressant initiated, we performed a multivariate multinomial regression with antidepressant class as the outcome and with SSRIs, the most commonly prescribed class, as the reference category. We included the main effect of each of the 64 variables as predictors in the model, without considering any interactions. We performed the modeling in R version 3.5.2 with the package “mnlogit,” which allows efficient inference using the Newton-Raphson method. We estimated the odds ratio and 95% CI for each variable and for each contrast of treatment with SSRIs (i.e., bupropion, mirtazapine, and SNRI versus SSRI). Likelihood ratio tests were done globally for each variable comparing the full model with the model leaving out the variable of interest with a chi-squared test. This procedure was recurrently applied to generate p-values for each variable. We report both nominal and FDR-corrected p-values.
(c) Sensitivity analysis
For sensitivity analysis, we identified a subset of patients with routine primary care visits in the MGB system (the “PCP” subset), for whom we were more likely to capture (and thus be able to exclude) prevalent users. To do this, we identified patients with routine health services or annual check-up visits within one year prior to the index date. We identified such patients using the presence of either of the following: (1) at least one of the following Current Procedural Terminology (CPT) codes for these services recorded: 99201, 99202, 99203, 99204, 99205, 99211, 99212, 99213, 99214, 99215; or (2) “reason for visit” recorded as “check-up” in the encounter metadata.
RESULTS
Our initial selection criteria yielded 111,571 patients from the RPDR. After applying our exclusion criteria, a total of 47,107 patients were retained for the main analysis of the study (“the full sample”). Figure 1 displays the detailed exclusion steps and the resulting patient counts at each step. Among the 47,107 patients, 22,848 had an MGB primary care visit in the year prior to the index date (the “PCP subset”). Table 1.1 presents the descriptive results for the full sample. The majority of patients were first prescribed an SSRI (n = 34,709, 73.7%), followed by bupropion (n = 5,969, 12.7%), SNRI (n = 4,924, 10.4%), and mirtazapine (n = 1,505, 3.2%). The study sample was predominantly female (67%), consistent with the known sex distribution of depression (18). Approximately 77% of the patients were identified as Caucasian. Common comorbid diagnostic codes included anxiety-related diagnoses (34.8%), primary hypertension (29.5%), and any malignancy (26.5%), including past malignancies. The pattern of first prescription class changed over time, as can be seen in Figure 2.
Based on chart reviews of 200 randomly sampled patients, we estimated that approximately 29% of patients in the full sample were not new users. Applying the method above for identifying the PCP subset to the same 200 patients reduced the proportion of non-new users by about one-third--to 19%. The distribution of sociodemographic and clinical features was largely similar between the full sample and the PCP subset (results not shown).
There were statistically significant but small differences in age across the treatment groups, e.g., the SNRI and mirtazapine groups were slightly older (mean ages 45.90 and 45.29, respectively, compared to 43.96 for SSRI). Patients in the bupropion group were more likely to be male (50% vs. 33% overall) and obese (19% vs. 15% overall). The mirtazapine group had higher proportions of co-morbidities (e.g., congestive heart failure, primary hypertension, metastatic malignancy, and problems with sleep), as well as more concomitant medications. Both the bupropion group and the mirtazapine group had higher proportions of co-morbid substance use disorder. The characteristics of the patients in the SSRI and SNRI groups were more similar to one another than to those of the two other groups.
Table 2 shows the association between all selected variables and the choice of antidepressant class in the full sample, and Appendix C for the PCP subset. Figure 3 plots the odds ratios and confidence intervals for variables that were globally significant (i.e., significant for the omnibus test for the variable) in the full sample, (see Appendix D for the PCP subset). Appendix E shows the same plot for all variables in the full sample, and Appendix F for the PCP subset. For the full sample, 36 of 64 candidate predictors identified by literature review and clinical expert consultation were associated with antidepressant class selection. All binary variables among the 36 that were significant had at least one OR of either > 1.1 or < 0.9 for one of the three contrasts (i.e., initiated one of the other three antidepressant categories versus SSRI) derived by multinominal regression). Age, year of prescription, total number of other medications, and number of NSAID prescriptions all showed significant association with treatment selection (all FDR-corrected p < 0.001). Among all psychiatric comorbidities considered, only eating disorders and cluster A/B/C personality disorders were not associated with initial antidepressant selection. Other psychiatric disorders, including attention-deficit/hyperactivity disorder, alcohol use disorders, other substance use disorders, anxiety disorders, post-traumatic stress disorder, and other personality disorders, all showed strong signals of association (FDR-corrected p = 0.002 for other personality disorders; p <0.001 for all other diagnoses).
With the exception of cerebrovascular disease and stroke, all examined neurological comorbidities were associated with antidepressant selection. Notably, bupropion was less commonly prescribed to patients with neurological disorders, such as epilepsy, hemiplegia, multiple sclerosis, traumatic brain injury, and cerebral vascular disease, while bupropion was more commonly prescribed to patients with Parkinson’s disease. Mirtazapine and SNRIs were more commonly prescribed to patients with migraine and hemiplegia, while SNRIs were less commonly prescribed to patients with cardiovascular disease.
As for general medical comorbidities, bupropion was less commonly prescribed to patients with congestive heart failure, and was more commonly given to patients with obesity, organ transplantation, and sexual dysfunction. Mirtazapine was less frequently prescribed for patients with obesity, moderate to severe liver disease, and primary hypertension, and more frequently prescribed for patients with inflammatory bowel disease, mild liver disease, metastatic malignancy, organ transplantation, and peptic ulcer, compared to SSRIs, with all other factors controlled. SNRIs were more frequently prescribed to patients with diabetes with chronic complications and less frequently to patients with congestive heart failure and chronic pulmonary disease.
As can be seen in Appendix C, Appendix D, and Appendix F, despite some loss of power, the general pattern of findings was very similar in the PCP subset.
DISCUSSION
Using real-world EHR data to study the initial prescribing choices for depression among non-psychiatrists, the current study detected strong signals for many factors consistent with currently recommended psychiatric practice. For example, pain and loss of appetite showed strong associations with SNRIs (which have indications in pain management) and mirtazapine (which is known to increase appetite). Applying NLP to unstructured EHR enabled the current study to capture depression-related clinical characteristics, even in non-psychiatric settings where relevant records might be sparse. Recent studies have also looked into antidepressant prescriptions in non-psychiatric settings, such as off-label use of antidepressants (19), as well as modeling the overall likelihood of any treatment initiation for people with depression in primary care settings (20). One study (21) performed semi-structured interviews with 28 general practitioners regarding the factors they would consider when prescribing an antidepressant. However, none of these studies quantitatively analyzed the association between detailed clinical characteristics and the choice of antidepressant with depression as the indication, as we have done here.
It is of particular interest that most of the neurological disorders studied showed significant correlations with the choice of antidepressant. The inverse association of bupropion prescription with epilepsy and other neurological disorders, such as traumatic brain injury (22), multiple sclerosis (23), and hemiplegia (most commonly caused by stroke) (24) could be explained by prescribing physicians taking into account bupropion’s lowering of the seizure threshold. On the other hand, the association of mirtazapine prescription with comorbid migraine was unexpected, since there are no obvious direct pharmacological or clinical considerations that would support this association.
Our results should be interpreted in the context of several limitations. First, real-world EHR data are inherently limited by the presence of missing data. In our case, data were present only when the patient received care within the MGB system and only for observations documented in the EHR. As mentioned previously, another limitation of our study is the difficulty of discerning new from prevalent users. This is also a problem of missing data in the sense that we might be missing antidepressant prescription information prior to the index visit, particularly for patients who obtain initial prescriptions from community psychiatrists or other sources. With prevalent users, the characteristics collected in the window before the index visit might not reflect the patient’s status in the period before actual treatment initiation, which may have occurred long before the index visit date. It is also possible that some of these characteristics, could have been caused by the prevalent drug (e.g., loss of appetite and weight loss can be caused by bupropion) and thus adjusting for them could introduce bias. To address this, our sensitivity analysis looked at patients with PCPs within the MGB system, where the proportion of prevalent users as judged by physician chart review was somewhat lower (29% vs. 19%); these analyses yielded results similar to the main analysis. That said, we acknowledge these problems could be further mitigated by obtaining more complete information regarding treatment and health histories (e.g., by mining medication lists beyond prescriptions or clinical notes for evidence of medications patients are taking from outside sources, by linking EHR data to insurance claims data that capture encounters and treatment beyond a single healthcare system, or by linking specifically to part D claims).
Second, phenotyping depression with ICD codes alone may result in low sensitivity and specificity. This can be remedied to some extent by applying phenotyping algorithms that can be more accurate than ICD codes alone (25–27). We elected not to use such algorithms here because we were interested in prescribing practices when the non-psychiatrist believes that they are treating depression, irrespective of the accuracy of such a diagnosis.
A third limitation is that we did not consider non-pharmacological therapies in the study. Part of the reason for not considering psychotherapies is that a substantial fraction of psychotherapy may occur outside the MGB system. Thus, we were not able to address whether such treatments had been used instead of pharmacologic treatments, before pharmacologic treatments, or concurrent with them. That said, it has been documented that specialized mental health services are less likely to be recommended in primary care settings (28), and this likely applies to other non-psychiatrist clinicians as well.
Fourth, we did not examine antidepressant dosing. The dosing of the medications may affect the observed choice for initiation for drugs for indications other than depression (e.g., low dose mirtazapine for insomnia). Since insomnia is itself a symptom of depression and the two frequently co-occur, without dosing information it would be difficult to discern the actual treatment target in such cases. One may argue that factors considered by the physician upon prescription may be different for the two indications; alternatively, use of mirtazapine to treat primary insomnia might upwardly bias an apparent association between this symptom and mirtazapine selection. In addition, primary care physicians may use low doses but still be intending to treat depression—and in fact this is likely given the presence of a depression code for the visit at drug initiation. Finally, because our data are observational, we cannot be sure that factors associated with prescription were the cause of prescription choices.
In conclusion, our study investigated factors associated with first prescription choice of antidepressants by non-psychiatrists using large-scale EHR data, incorporating both structured and unstructured features. To our knowledge, this is the first study to quantitatively demonstrate that factors affecting the choice of first antidepressant prescription by non-psychiatrists are generally consistent with those recommended or commonly used in psychiatric practice. Future research may benefit from efforts to improve data completeness and cleanliness, phenotyping accuracy, and natural language processing techniques.
Data Availability
The electronic health records data used in this study are not available either to the public or per request for privacy protection.
CONFLICT OF INTEREST
JWS is a member of the Leon Levy Foundation Neuroscience Advisory Board and received an honorarium for an internal seminar at Biogen, Inc. He is PI of a collaborative study of the genetics of depression and bipolar disorder sponsored by 23andMe for which 23andMe provides analysis time as in-kind support but no payments.
ACKNOWLEDGEMENTS
The authors thank the MGB RPDR team for providing the EHR data in an analyzable format.
Appendix A ICD codes mapping to identifying pre-existing co-morbid conditions
Appendix B Concept terms table for depression-related symptoms construction
*This table is constructed starting from the “concept names” (the third column), which are intended to capture the criteria for major depressive episode in DSM-5. Concepts are grouped into categories (the second column), which are given a numeric index (the first column). Each concept is described by one or more terms (the 4th column). Lexical derivatives and their matching regular expression (what the computer reads) are then constructed based on the terms (not shown).
Appendix C Modeled propensity odds ratio and 95% confidence intervals, PCP subset
Appendix D Forest plot for ORs and CIs of globally significant variables for multinominal regression model of antidepressant category propensity, PCP subset
*Parkinson’s disease and Marital Status: Other are not shown for illustration purpose due to very wide confidence interval