Spatiotemporal Forecasting of Opioid-related Fatal Overdoses: Towards Best Practices for Modeling and Evaluation

Kyle Heuton; Jyontika Kapoor; Shikhar Shrestha; Thomas J. Stopka; Michael C. Hughes

doi:10.1101/2024.01.03.24300803

Abstract

To inform public health interventions, researchers have developed models to forecast opioid-related overdose mortality. However, these efforts often have limited overlap in the models and datasets employed, presenting challenges to assessing progress in this field. Furthermore, common error-based performance metrics, such as root mean squared error, are not directly suitable to assess a key modeling purpose: the identification of priority areas for public health interventions. We recommend a new intervention-aware performance metric and establish a set of baseline models with competitive performance. To show how model and metric choice vary across locations, we explore two distinct geographies: Cook County, Illinois and the state of Massachusetts. We introduce a new, intervention-aware evaluation metric: the Percentage of Best Possible Reach (%BPR). The top-performing models based on error-based metrics recommend fixed-budget interventions in areas that do not always reach the most possible overdose events. In Massachusetts the top models, as ranked by our proposed %BPR metric, could have reached 18 additional fatal overdoses per year in our 2020-2021 test period compared to models favored by error-based metrics, assuming the ability to intervene in 100 census tracts out of the 1620 in Massachusetts. We release open code and data for others to build upon.

Repository for code and data: https://github.com/tufts-ml/opioid-overdose-models

Introduction

The opioid overdose epidemic in the United States has resulted in over 450,000 deaths during the past eight years, with more than 80,000 fatal opioid-related overdoses during 2022, the highest yet in a single year.¹ Managing the opioid overdose epidemic requires a constellation of efforts ranging from substance use treatment programs offering medications for opioid use disorder (e.g., methadone, buprenorphine),^2,3 harm reduction programs with provisions for overdose education and naloxone distribution, and comprehensive mental health and social support services.^4–6 Beyond the provision of harm reduction and healthcare services, it is critical for policymaking to address the ever-evolving substance use environment and plan for targeted interventions.

There has been considerable variation in the availability of different types of opioids and the consequent increase in opioid use disorder and opioid-related fatal overdoses in the past two decades. The current fatal opioid overdose epidemic has been characterized by four waves.^7,8 In the early 2000s, prescription opioids drove overdoses. Then, heroin-related deaths surged post-2010, followed by a fentanyl spike in 2013.⁹ This culminated with the fourth wave of combined stimulant and fentanyl-related overdoses.⁸ These shifts in supply accompanied changes in social and ecological conditions, impacting substance use behaviors in varying ways across geographic regions.^10,11 Hence, it is critical to examine local spatiotemporal variation in opioid overdose outcomes, identifying the most-impacted areas and predicting future outcomes to inform preemptive public health responses.

A growing body of research¹² has explored spatiotemporal variations in the opioid overdose landscape. Yet forecasting approaches are in a nascent stage and there are few prediction studies at the population level¹³. Other research focuses on patient-specific risk prediction ^14–16, assuming access to detailed, person-level demographic and medical history data. However, there are immense challenges in compiling rich datasets for person-level analysis. Most state-level public health authorities may not have access to the data and technical resources to conduct individual predictive modeling. Analyses focused on population-level predictions that solely depend on more readily-available aggregated data have the potential to be more easily adopted by public health authorities with limited resources.

While a number of prior studies have identified historical overdose “hotspots”^17–19, fewer studies have forecasted future spatiotemporal overdose spikes. Research that focuses on hotspots often assumes that identified clusters represent the locations where the highest needs will exist in the future. In our analyses, we show that this assumption does not always hold. Intervention and policies that rely on such findings may be acting on lagged measures of opioid burden, thereby limiting the effectiveness of interventions. Existing research also spans a broad range of spatial and temporal resolutions. In geographic space, studies range from coarser county-level analysis¹⁷, to finer analyses based on ZIP Codes, census tracts, or census block groups²⁰. Temporally, studies range in focus across yearly aggregated¹⁷ data, quarterly²¹, or weekly data²².

The overarching goal of our study is to help public health departments make short-term forecasts of future overdose events to enable planning of geographically and temporally targeted interventions that are cognizant of limited resources and needed intervention efficiencies. We focus specifically on development of forecasts at a fine spatial scale (census tracts) at annual intervals, which we selected to match the decision-making needs of public health agencies. In order to understand the role of different forecasting models and evaluation metrics on different communities, our evaluations cover two distinct catchment areas. First, we study Cook County, Illinois, covering over 5 million residents of Chicago and its surroundings, where we forecast death events across 1328 populated census tracts from years 2015-2022 through analysis of publicly available data. Second, we study the state of Massachusetts, where we forecast fatal overdose events across 1620 census tracts representing over 6 million residents from 2001-2021. These locations were selected based on data availability, and to demonstrate the impact of model and metric choice at multiple locations.

To establish best practices for modeling and evaluation, we carefully compare different modeling approaches and performance metrics in each catchment area. We implement a comprehensive set of existing models – including heuristic baselines, statistical models,^12,20,22,23 and neural networks²². We then assess the opioid-related fatal overdose forecasts they produce for both Cook County and Massachusetts at the census-tract-level at annual timeframes. We compute widely-used error-based performance metrics and introduce a new intervention-focused performance metric. Our Python-based software is available for other researchers to reuse and build upon: https://github.com/tufts-ml/opioid-overdose-models.

Methods

Data Sources and Preparation

To assess models, we assembled two datasets suitable for forecasting opioid-related fatal overdoses annually at the census tract level. Our relatively coarse annual temporal scale was chosen to match the frequency at which decision-makers might set new priorities and at which new reliable data become available. We chose the census tract spatial scale due to its potential for targeted interventions at a sub-municipality level. Each census tract by design contains a mean count of 4000 people (with a range of 1200-8000²⁴). For many (but not all) interventions, costs scale with population size, and thus the cost of deploying an intervention in any tract is roughly uniform.

Data source 1: Cook County, Illinois

We obtained fully de-identified data from the Cook County Government Medical Examiner Case Archive²⁵ for opioid-involved overdose deaths from August 2014 (the first date records are available) to May 2023. These data contained every fatal incident under the medical examiner’s jurisdiction that was determined to have any opioid as a primary cause. We used the provided incident latitude and longitude to map each overdose fatality to one of 1328 census tracts. Because the underlying data is in the public domain, we make our processed Cook County data available in our shared repository.

Data source 2: Massachusetts

We obtained death certificate data from the Massachusetts Registry of Vital Records and Statistics for opioid-involved overdose deaths between 2001 and 2019. These deaths were defined as unintentional, intentional, and undetermined drug poisonings containing an opioid code (ICD-10 codes T.40.0-T40.4, or T40.6) as a “multiple cause of death”. Each fatal overdose is linked to a calendar date as well as a residential street address. Decedent addresses for the place of residence at the time of the fatal overdose were geocoded, assigning latitude and longitude measures to each event that was then mapped to one of 1620 census tracts using the 2020 census tract boundaries.

Dataset Preparation

For each dataset, we computed the observed number of fatal overdose events y_s,t at time unit t for individuals residing in spatial tract s. We employed open tools²⁶ that utilize the US Census Geocoding API to map locations (street address or latitude/longitude) to its corresponding census tract, using the tract boundaries for the states of Massachusetts and Illinois defined by the U.S. Census Bureau in 2020.

In each dataset, a uniform set of covariates is available as input for prediction models. At each time period t and spatial region s, we provide the history of fatal overdose counts from previous times in that region, as well as the spatial location (numerical latitude and longitude of the tract), and timestamp (numerical time, measured in years since the first available year for that dataset). Further, for each census tract s at time t, an optional covariate vector represents social vulnerability across socioeconomic status, age-related demographics, minority status, housing, and composite dimensions using percentile ranking across all census tracts in that state. These features stem from the five dimensions of the Social Vulnerability Index (SVI)²⁷ published between 2000 and 2018 for every U.S. state. Published tract values are updated every five years; we selected the closest value to each time period t. These SVI features were chosen for their simplicity and portability, mirroring the role of higher-dimensional socioeconomic covariates in previous studies ^23,28.

Metrics

To evaluate model forecasting accuracy against observed mortality in a test period, suitable performance metrics were essential. Our study considered both commonly-used error-based metrics and a new intervention-focused metric.

Error-based metrics

Model performance is often assessed via summary statistics of the errors between predicted and observed mortality across all S spatial regions in the test period. Within this category, two common metrics are Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE), both defined in Equation 1 below. RMSE calculates the square root of the average squared errors, while MAE computes the average of absolute errors.

Both RMSE and MAE have been concretely used as the primary metrics to assess opioid overdose forecasting^22,23,29. RMSE can be more sensitive to large errors due to its use of squaring.

Intervention-focused metric

In our intended use case of mitigating the opioid crisis, stakeholders at a public health agency could use a forecasting model to select a targeted subset of all possible census tracts in which to deploy an intervention in the near future. We assume these actors have a limited budget, allowing intervention deployment in a maximum of K of the S regions in their jurisdiction. For a given model, we can obtain its recommended set of K regions, which we refer to as the intervention set I, in two steps. Step 1: predict mortality counts for all S regions in the test period. Step 2: identify the K regions with the K highest predictions (breaking ties at random), and store these as the recommended set I.

To evaluate such intervention recommendations, we need new metrics. The error-based metrics defined earlier (RMSE or MAE) aggregate error for all S regions equally. They do not directly measure if a forecast’s guess of the K highest-risk areas specifically aligns with the actual areas with highest mortality. We thus wish to design a metric better aligned with how stakeholders will determine and assess intervention priorities. We suggest a model is favorable if, during the test period, the total count of fatal overdose events in its recommended set I of K tracts is as large as possible. This would indicate the model is good at identifying where adverse events will occur, and thus increase the possibility that stakeholders could “reach” and hopefully mitigate these events via interventions targeted at the model’s recommended tracts.

Our proposed metric, “best possible reach” (BPR), assesses a model’s recommendations via a ratio of two numbers. First, the numerator counts how many opioid-related fatalities actually occurred in the model’s recommended set I of size K. Second, the denominator counts the total number of opioid-related fatalities in the K regions that would be chosen with perfect hindsight of the actual count vector y = [y₁, …, y_S] of fatalities in the test period. Mathematically, we define BPR as follows

where TopKInds(y) denotes a function that returns the distinct indices of the K largest elements of vector y.

For public health applications, BPR holds a practical interpretation as the proportion of fatal overdose events the current model’s interventions would reach, compared to the perfect foresight of future events. BPR’s numerical value by definition will have a minimum of 0.0 and a maximum of 1.0. We typically convert the fractional BPR to a percentage ranging from 0-100%, denoted as %BPR. A higher %BPR value signifies a more effective model at deciding where to intervene. A value of 100% indicates perfect decision-making given a limited budget: there is no other set of K regions any model could have recommended that would reach more events.

Although independently developed by our team (see our preliminary workshop paper²¹), our proposed BPR metric closely resembles the metric suggested by a recent pre-registered trial³⁰ and a later feasibility study²⁰ to evaluate opioid overdose forecasts in Rhode Island. The primary distinction lies in the denominator: our BPR sums only the top K indices, while the alternative includes all S regions. We prefer our definition due to %BPR’s consistent range of 0-100%, with 100% representing a model that could not have made a better decision given the limited budget of K regions. In contrast, the alternative definition’s maximum value fluctuates based on observed data in the test period, making it difficult to know if another model could have done better.

Models

Below we define a range of possible forecasting methods that can be used for our where-to-intervene prediction task. All methods are trained and evaluated via a common protocol using the same provided splits (train/validation/test) of the two available datasets (Cook County and Massachusetts). Thorough details about model fitting and hyperparameter tuning are provided in the Supplement. This study was reviewed by the Tufts University Health Sciences Institutional Review Board and deemed to be non-human subjects research.

Simple Baseline Models

We study several easy-to-implement baseline models to highlight their comparative strengths. Public health practitioners seeking data-driven allocation of scarce intervention resources without sophisticated modeling could easily use these approaches.

Our first baseline, dubbed all zeroes, predicts uniformly across all S tracts that zero fatal overdoses will occur in the test period. This model, by definition, ranks all tracts as equally high-risk, so for metrics like BPR that require a set of K recommended regions we report an average over many samples of K distinct regions selected uniformly at random.

Our second baseline, known as last year, predicts that the mortality at the next year in a specific location will mirror the mortality observed at the most recent recorded year for that location.

The final baseline we consider is a historical average, which predicts the next timestep’s mortality as an average of all mortality counts observed over the preceding W timesteps. Here, the appropriate “look back” period length W serves as a model hyperparameter.

Complex Models

Next, we consider several more flexible models with parameters that can be fit to the data. The first is the weighted historical average: a weighted average of the previous W years of overdose events. This is more flexible than historical average, because each year’s count is multiplied by a customized weight coefficient.

The next model we consider is a Generalized Linear Model (GLM) with a Poisson likelihood. This model assumes that fatal overdose count y for spatial tract s at time t is modeled by a Poisson distribution where the log of the mean parameter is a linear function of the covariate vector x for that tract and time:

GLMs have limited flexibility due to the assumption of a monotonic relationship between covariates and fatal event count. We thus also include a Gradient Boosted Trees³¹ model, a popular more flexible ensemble of regression trees. Previous studies of opioid forecasting ^20,32 have used similar tree ensembles.

In addition, we include three spatially-sophisticated statistical models used in recent opioid overdose forecasting applications. First, we include a Gaussian Process model²⁰ for its ability to flexibly capture spatial and temporal correlations. We use similar covariance functions (“kernels”) to prior overdose forecasting work (details in the supplement). Next, Bayesian Spatio-Temporal (BST) models²³ use a Markov Random Field to model inherent spatial and temporal trends. Thirdly, NBSpLag denotes a negative binomial regression model with spatially lagged features²⁸, where each tract is informed by its spatial neighbors. In a variable selection experiment²⁸, these spatially lagged covariates were found to be the most predictive features. Unlike previous evaluations of both BST²³ and NBSpLag models, our study compares to the rich set of baselines described above.

Finally, we include CASTNet²², a neural network approach custom developed for opioid-overdose forecasting. Unlike previous methods, CASTNet employs multi-head attentional networks that allow predictions at a given location to be informed by learned “communities” of regions.

Experimental Protocol

We applied each of the models described above separately to the Cook County, IL dataset and the MA dataset. In each case, we sought to use available historical counts of opioid-related fatal overdoses (together with other covariates described above) to predict future fatal overdose counts in each census tract. We further assessed how these predictions can be used to recommend where to intervene in the near future.

For training on a dataset, for all S regions we assemble covariate vector, fatality count pairs (x_s,t, y_s,t) for each year in the training set (t = 2010-2018 for Massachusetts, 2015-2019 for Cook County). The historical covariates inside each x vector can summarize the recent history of W previous years (W=10 for Massachusetts, W=5 for Cook County). Hyperparameters are chosen to maximize performance as assessed by BPR on a validation set of data from the year prior to evaluation (2019 for Massachusetts, 2020 for Cook County. Finally, models are evaluated on predictions for the final two years (2020-2021 in Massachusetts, 2021-2022 in Cook County).

From each model, we obtained predictions for each of the S tracts in each test year. We then computed each evaluation metric (RMSE, MAE, BPR) as well as an interval that quantifies our uncertainty in its precise value. Inspired by resampling methods for uncertainty quantification³³, for each test year we obtained 50 different without-replacement samples of 1370 of the 1620 locations in MA (1078 of the 1328 in Cook County, IL), and recorded all metrics of interest for each sample. Our reported intervals quantify the min-max range of these 50 samples. We chose the number of tracts retained in each catchment area so that each sample on average retains 85% of all fatal overdose events.

Results

Results from the experiments conducted on Massachusetts and Cook County data are summarized in Table 1 and Table 2, respectively. The best model(s) can vary depending on the chosen evaluation metric.

View this table:

Table 1.

Comparison of fatal opioid-related overdose prediction models trained on Massachusetts decedent data from 2010-2019, then evaluated on data from 2020 and 2021.

View this table:

Table 2.

Comparison of fatal opioid-related overdose prediction models trained on Cook County, Illinois decedent data, from 2015 to 2020, then evaluated on data from 2021 and 2022.

In both catchment areas, we see reasons to prefer our proposed BPR metric to alternatives when the goal is effectively prioritizing where to intervene. First, in Massachusetts, both the Gaussian Process (GP) and Bayesian Spatiotemporal model (BST) have top performance as assessed by MAE and RMSE. However, the BST has higher %BPR than the GP (62.0% compared to 58.2%). Interventions guided by the BST model would have the potential to preemptively identify 18 additional fatal overdoses per year in the top 100 census tracts. Similarly, in Cook County the Gradient Boosted Trees model with SVI covariates has superior MAE and RMSE to the GLM model, yet has worse %BPR (77.1% versus 79.4% for GLM). Interventions guided by the GLM model (preferred via the %BPR metric) could reach 15 more fatal overdose events annually in Cook County.

We also observe that while complex models like BST do well in both catchment areas, so does the simple historical average baseline and its weighted extension. In Massachusetts, historical average delivers a BPR and MAE that fall within the uncertainty intervals of the best performing models. The weighted extension’s ultimate %BPR is so close to the top method (NPSpLag) that the difference amounts to identifying fewer than 2 additional overdose events annually. In Cook County, the historical average baseline delivers competitive scores (within the intervals of the best models) as assessed by all three metrics (BPR, MAE, and RMSE); the best model (BST) here would reach less than 1 additional overdose event annually.

Discussion

Our study’s first contribution to the science of spatiotemporal forecasting of opioid-related overdose deaths is highlighting the need for extensive comparisons to a robust suite of simple baselines. This lesson matches reports^34,35 from across the sciences, especially efforts in health^36,37 and the social sciences³⁸, that suggest advanced modeling techniques may not substantially outperform simpler baselines on some difficult prediction tasks. Our findings are similar in both the large state of MA and the far denser Cook County. In each catchment area, across both intervention-aware and error-based metrics, we found that a historical average baseline performed competitively (within the uncertainty bounds of top-ranked statistical models). The key to success here is careful selection of the number of recent years in the look-back period, following standard best practices for hyperparameter tuning.^39,40 If this simple baseline model yields such high performance, it raises questions about the rationale for adopting more complex counterparts that require specialized expertise. Many prior overdose forecasting studies^20,23 completely omit such baselines, or often include only the poor performing ones such as the last-year²⁸ model or a too-long historical average²². For all future studies of opioid overdose forecasting, we recommend including historical averages with tuned look-back periods.

Our second contribution, developed in parallel to contemporary work³⁰, is a new metric – percentage of best possible reach (%BPR) – which evaluates predictions based on their utility for informing decisions about where to intervene. In both Massachusetts and Cook County, we demonstrate that using %BPR as an evaluation metric can lead to different model rankings and different recommendations of where to intervene than error-based metrics like RMSE, improving the total number of annual fatal overdose events that could be preemptively identified by 15 in IL and 18 in MA. This is an important finding, as we believe that intervention-aware metrics like %BPR more closely reflect how public health agencies wish to use forecasting models to inform their intervention strategies.²⁰

Lastly, we emphasize that our study is designed to be reproducible and open to extensions by other researchers. We released the software for fitting all models and computing all metrics under a permissive open-source license (link in Introduction). We also released our cleaned version of the public-domain Cook County dataset as well as all preprocessing code. Historically, overdose forecasting studies have not often shared code and have focused on private bespoke datasets, often reasonably due to privacy issues around decedent data. Enabling diverse researchers to pursue a common prediction task, especially via the availability of a public dataset for evaluation, has been a key driver of progress in predictive modeling⁴¹.

Limitations

This study has several limitations. First, our findings come from only two places (Massachusetts and Cook County), and may not be generalizable to other counties, states, or public health jurisdictions. Cook County is predominantly urban, while Massachusetts is a large state with substantial urban, rural and suburban areas. The spatiotemporal trends in opioid-related mortality could thus be dramatically different in these two locations, necessitating different model rankings and intervention strategies. Furthermore, we acknowledge that not all Cook County deaths are reported to the Medical Examiner. The Medical Examiner’s jurisdiction only covers specific fatalities for cause-of-death determination.

Second, there are limitations to our analysis of the proposed BPR metric. For simplicity, all results here assumed an intervention budget of K=100 census tracts. Different K values may lead to different method rankings. Our suggested BPR metric is intended for identifying where to intervene to relieve high overall burden. However, it does not directly prioritize the rate of change. Interventions aimed to reduce risk in communities that are at very high risk but do not already have a high burden may not be correctly identified using BPR.

Finally, other choices of covariates are possible. Our focus on a limited set of covariates, derived from the SVI of the American Community Survey, was an intentional choice to ensure the nationwide availability of these covariates. Certain jurisdictions may possess useful alternate data sources, such as emergency medical service (EMS) calls, insurance claims data, and measures for a mix of linked administrative datasets⁴², necessitating additional covariate consideration for enhanced model performance.

Conclusion

In an effort to better predict future fatal opioid-related overdose spikes and inform future harm-reducing interventions, we compared overdose forecasting options. Our study reinforces the value of intervention-aware metrics like %BPR in evaluating models for opioid overdose mortality forecasting. Our study also suggests that simple baselines like (weighted) historical averages should be included in future analyses, as more sophisticated and expensive-to-train models may not substantially outperform these baselines. As the opioid crisis continues to evolve, we hope our findings and our open-source resources enable improved model comparisons and better data-informed public health interventions that ultimately reduce the harm caused by overdose events.

Data Availability

We share pre-processed data for the Cook County dataset, which is freely available. This data and all software is available at https://github.com/tufts-ml/opioid-overdose-models. The Massachusetts data belongs to the Massachusetts Department of Public Health

https://github.com/tufts-ml/opioid-overdose-models

Supplementary Material

Additional Model details

Model hyperparameters are selected using the year prior to the test years as a validation year: 2019 for Massachusetts and 2020 for Cook County. When validating, models are trained through the year prior to the validation year. For evaluation on the test years, models are retrained through the validation year. Hyperparameters are selected by selecting the model with the highest BPR. In Massachusetts, we consider using up to 10 years of historical data when training (2010-2019). In Cook County there are fewer years of available historical data, and so training is limited to 2016-2020.

All-Zeroes

The All-Zeroes model is presented to highlight two things: the BPR of a naive policy, and the RMSE and MAE of a naive model. This model is very simple: every prediction is always 0 fatal overdoses. However, this presents a challenge for calculating BPR: what are the top K locations if every location is tied? In this case, we take 10,000 samples, and randomly pick the K locations to serve as the numerator for BPR. We then calculate the BPR for each of 10,000 samples, and report the average.

Last Year

For this model, the prediction is simply the previous year’s fatal overdose count. When predicting for the second evaluation year (2021 in Massachusetts and 2022 in Cook County), the fatal overdose count from the first evaluation year is used. This is subtly different from the behavior of the regression models, where the models are trained using no data from the evaluation years.

Historical Average

In this model, the output is an un-weighted average of historical mortality. For both Massachusetts and Cook County 4 years are selected. These are both selected via the validation year. When predicting for the second evaluation year (2021 in Massachusetts and 2022 in Cook County), the fatal overdose count from the first evaluation year is used. This is subtly different from the behavior of the regression models, where the models are trained using no data from the evaluation years.

Weighted Historical Average

This model is a linear regression on historical fatal overdose count using only past years mortality as a predictor. Scikit-learn’s⁴³ ridge regression is used, which performs L2-regularized regression. The regularization strength α is selected via hyperparameter search by trying 29 evenly spaced values on a log scale between 10^-6 and 10⁸. For Massachusetts, 10 years of historical data and an α of 10⁴^.5 is used. For Cook County, 3 years of historical data are used and an α of 10⁴^.5 is selected.

Linear Poisson GLM

This uses Scikit-learn’s⁴³ Generalized Linear Model with a Poisson Likelihood and a log link. The hyperparameters explored are the number of prior years of mortality to include in the model and the L2 regularization strength α. Up to 10 years of previous mortality were considered for Massachusetts and 6 years for Cook County. The model is run with and without social vulnerability covariates. Without social vulnerability covariates In Massachusetts 10 years of prior mortality are used in the model with an α of 1. In Cook County, 5 years of historical data are used with an α of 10^0.5. With social vulnerability covariates In Massachusetts 6 years of prior mortality are used in the model with an α of 1. In Cook County, 5 years of historical data are used with an α of 10.

Gradient Boosted Trees

This uses Scikit-learn’s⁴³ Histogram-basedGradient Boosted Trees model. We considered both squared-error and Poisson loss functions. We tested using both 32 and 128 maximum iterations. For the minimum samples per leaf we used 9 equally spaced values between 2⁰ and 2⁸ on a log-2 scale. The maximum number of leaf nodes tested were 5 equally spaced values between 2⁴ and 2⁸ on a log-2 scale. Up to 10 years of previous mortality were considered for Massachusetts and 6 years for Cook County.

Gaussian Process Models

Here we use Scikit-learn’s⁴³ Gaussian Process (GP) implementation. Due to the high computational cost of GP models, and following prior work²⁰, we only consider up to 5 years of historical data for both Massachusetts and Cook County, and we omit social vulnerability covariates. As in the prior work,we use a kernel that additively combines a Radial-Basis Function (RBF) kernel with a white noise kernel. The initial length scale of the RBF kernel is set to 0.5, and the noise level bounds on the white noise are set to (10^-5, 10¹). The outcomes are normalized to 0-mean and unit variance. Up to 9 restarts of the optimizer are used.

CASTNet

While we attempted to follow the original implementation of CASTNet²² as closely as possible, there are some significant modifications. The original CASTNet was concerned with predictions at a fine temporal-resolution, weekly. However, our initial experiments found little benefit at this scale, and we consider a much coarser scale: annual predictions. We lack the high-resolution crime data that the original CASTNet project uses as dynamic covariates. Furthermore, while we do have demographic and economic variables (the 5-dimensional Social Vulnerability index), these are not static at the annual scale but dynamic, accordingly these are used as the only dynamic covariates. For static covariates, only the latitude and longitude of the census tract are used. We use the hyperparameters selected by the original work: the LSTMs have a hidden unit size of 32 with a dropout value of 0.1, the group-level regularization coefficient is 0.0025, and the optimizer used was Adam with a learning rate of 0.5. Given that we have 2 evaluation years, we train the model twice, once with a lag time of 1-year and again with a lag time of 2-years. The 1-year lag model is used to predict for the first evaluation year (2020 in MA and 2021 in Cook County) and the 2-year lag model is used to predict for the second. This way no training data leaks into the model.

Bayesian Spatiotemporal Models

In the original work²³ on Bayesian Spatiotemporal models for opioid overdose forecasting, three separate models are proposed. All three models use an Autoregressive-1 term to model temporal dependence, and two of the models use spatial correlations. Because all three models are reported to behave similarly, we use what is called “Model 1”, lacking spatial correlations. Furthermore, the authors state that any number of temporal terms could be used, but do not specify which. Here, we choose a linear temporal term.This model is implemented using R-INLA⁴⁴. Linear coefficients are used when adding the social vulnerability covariates.

Negative Binomial Regression with Spatially Lagged Covariates

The authors of this method²⁸ helpfully provide code to run this model which we were able to use with little modification. Census tract level population estimates are taken from the same survey data as the social vulnerability covariates. The carrying capacity is initialized to 5% of the population in the first year of training data (2010 for MA, 2015 for Cook County).

Acknowledgments

We are grateful to both the Massachusetts Department of Public Health (DPH) and the Cook County Medical Examiner’s Office for data access.

Author KH was supported by the U.S. National Science Foundation under NSF award NRT-HDR 2021874. Author JK’s effort during a summer research program for undergraduates hosted at Tufts University was supported by NSF award REU-2149871. Authors KH, TJS, and MCH gratefully acknowledge support for early work on this project from NSF award IIS-1908617.

References

1.↵
FB Ahmad, JA Cisewsky, LM Rossen, P Sutton. Provisional drug overdose death counts. Published September 6, 2023. Accessed September 17, 2023. https://www.cdc.gov/nchs/nvss/vsrr/drug-overdose-data.htm
2.↵
Mattick RP, Breen C, Kimber J, Davoli M. Buprenorphine maintenance versus placebo or methadone maintenance for opioid dependence. Cochrane Database Syst Rev. 2014;(2).
3.↵
Leshner AI, Mancher M, eds. Medications for Opioid Use Disorder Save Lives. National Academies Press; 2019. doi:10.17226/25310
OpenUrl CrossRef
4.↵
Clark AK, Wilder CM, Winstanley EL. A systematic review of community opioid overdose prevention and naloxone distribution programs. J Addict Med. 2014;8(3):153–163. doi:10.1097/ADM.0000000000000034
OpenUrl CrossRef PubMed
5.↵
Hawk KF, Vaca FE, D’Onofrio G. Reducing Fatal Opioid Overdose: Prevention, Treatment and Harm Reduction Strategies. Yale J Biol Med. 2015;88(3):235–245.
OpenUrl PubMed
6.↵
Peiper NC, Clarke SD, Vincent LB, Ciccarone D, Kral AH, Zibbell JE. Fentanyl test strips as an opioid overdose prevention strategy: Findings from a syringe services program in the Southeastern United States. Int J Drug Policy. 2019;63:122–128. doi:10.1016/j.drugpo.2018.08.007
OpenUrl CrossRef
7.↵
Ciccarone D. The triple wave epidemic: Supply and demand drivers of the US opioid overdose crisis. Int J Drug Policy. 2019;71:183–188. doi:10.1016/j.drugpo.2019.01.010
OpenUrl CrossRef PubMed
8.↵
Ciccarone D. The rise of illicit fentanyls, stimulants and the fourth wave of the opioid overdose crisis. Curr Opin Psychiatry. 2021;34(4):344–350. doi:10.1097/YCO.0000000000000717
OpenUrl CrossRef
9.↵
Understanding the Opioid Overdose Epidemic | Opioids | CDC. Published August 8, 2023. Accessed September 17, 2023. https://www.cdc.gov/opioids/basics/epidemic.html
10.↵
Stopka TJ, Larochelle MR, Li X, et al. Opioid-related mortality: Dynamic temporal and spatial trends by drug type and demographic subpopulations, Massachusetts, 2005–2021. Drug Alcohol Depend. 2023;246:109836. doi:10.1016/j.drugalcdep.2023.109836
OpenUrl CrossRef
11.↵
Friedman SR, Krawczyk N, Perlman DC, et al. The Opioid/Overdose Crisis as a Dialectics of Pain, Despair, and One-Sided Struggle. Front Public Health. 2020;8. Accessed November 8, 2023. https://www.frontiersin.org/articles/10.3389/fpubh.2020.540423
12.↵
Marks C, Carrasco-Escobar G, Carrasco-Hernández R, et al. Methodological approaches for the prediction of opioid use-related epidemics in the United States: a narrative review and cross-disciplinary call to action. Transl Res J Lab Clin Med. 2021;234:88–113. doi:10.1016/j.trsl.2021.03.018
OpenUrl CrossRef
13.↵
Borquez A, Martin NK. Fatal overdose: Predicting to prevent. Int J Drug Policy. 2022;104:103677. doi:10.1016/j.drugpo.2022.103677
OpenUrl CrossRef
14.↵
Tseregounis IE, Henry SG. Assessing opioid overdose risk: a review of clinical prediction models utilizing patient-level data. Transl Res J Lab Clin Med. 2021;234:74–87. doi:10.1016/j.trsl.2021.03.012
OpenUrl CrossRef
15.
Lo-Ciganic WH, Donohue JM, Hulsey EG, et al. Integrating human services and criminal justice data with claims data to predict risk of opioid overdose among Medicaid beneficiaries: A machine-learning approach. PloS One. 2021;16(3):e0248360. doi:10.1371/journal.pone.0248360
OpenUrl CrossRef
16.↵
Lo-Ciganic WH, Donohue JM, Yang Q, et al. Developing and validating a machine-learning algorithm to predict opioid overdose in Medicaid beneficiaries in two US states: a prognostic modelling study. Lancet Digit Health. 2022;4(6):e455–e465. doi:10.1016/S2589-7500(22)00062-0
OpenUrl CrossRef
17.↵
Acharya A, Izquierdo AM, Gonçalves SF, et al. Exploring County-level Spatio-temporal Patterns in Opioid Overdose Related Emergency Department Visits. MedRxiv Prepr Serv Health Sci. Published online 2022.
18.
Herlands W, McFowland III E, Wilson AG, Neill DB. Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data. In: Artificial Intelligence and Statistics.; 2018. Accessed November 6, 2018. http://proceedings.mlr.press/v84/herlands18a/herlands18a.pdf
19.↵
Lu H, Crawford FW, Gonsalves GS, Grau LE. Geographic and temporal trends in fentanyl-detected deaths in Connecticut, 2009-2019. Ann Epidemiol. 2023;79:32–38. doi:10.1016/j.annepidem.2023.01.009
OpenUrl CrossRef
20.↵
Allen B, Neill DB, Schell RC, et al. Translating predictive analytics for public health practice: A case study of overdose prevention in Rhode Island. Am J Epidemiol. Published online May 17, 2023:kwad119. doi:10.1093/aje/kwad119
OpenUrl CrossRef
21.↵
Heuton K, Shrestha S, Stopka TJ, Pustz J, Liu LP, Hughes MC. Predicting Spatiotemporal Counts of Opioid-related Fatal Overdoses via Zero-Inflated Gaussian Processes. 2022 NeurIPS Workshop Gaussian Process Spatiotemporal Model Decis-Mak Syst. Published online December 2022. Accessed May 24, 2023. https://par.nsf.gov/biblio/10389257-predicting-spatiotemporal-counts-opioid-related-fatal-overdoses-via-zero-inflated-gaussian-processes
22.↵
Ertugrul AM, Lin YR, Taskaya-Temizel T. CASTNet: Community-Attentive Spatio-Temporal Networks for Opioid Overdose Forecasting. In: Machine Learning and Knowledge Discovery in Databases: European Conference (ECML PKDD). ; 2019. Accessed September 23, 2022. http://arxiv.org/abs/1905.04714
23.↵
Bauer C, Zhang K, Li W, et al. Small Area Forecasting of Opioid-Related Mortality: Bayesian Spatiotemporal Dynamic Modeling Approach. JMIR Public Health Surveill. 2023;9(1):e41450. doi:10.2196/41450
OpenUrl CrossRef
24.↵
US Census Bureau. Glossary. Censusgov Geogr Program. Published online 2022. Accessed September 22, 2022. https://www.census.gov/programs-surveys/geography/about/glossary.html#par_textimage_13
25.↵
Medical Examiner Case Archive | Cook County Open Data. Accessed September 18, 2023. https://datacatalog.cookcountyil.gov/Public-Safety/Medical-Examiner-Case-Archive/cjeq-bs86
26.↵
Freeman N. Censusgeocode: Thin Python Wrapper for the US Census Geocoder. Published online 2022. Accessed September 23, 2022. https://github.com/fitnr/censusgeocode
27.↵
CDC ATSDR. Social Vulnerability Index 2018 Database for Massachusetts. Published online 2018.
28.↵
Marks C, Abramovitz D, Donnelly CA, et al. Identifying counties at risk of high overdose mortality burden during the emerging fentanyl epidemic in the USA: a predictive statistical modelling study. Lancet Public Health. 2021;6(10):e720–e728. doi:10.1016/S2468-2667(21)00080-3
OpenUrl CrossRef
29.↵
Sumetsky N. Spatiotemporal Modeling of Opioid Abuse and Dependence Outcomes Using Bayesian Hierarchical Methods. Master’s Thesis. University of Pittsburgh Graduate School of Public Health; 2017.
30.↵
Marshall BDL, Alexander-Scott N, Yedinak JL, et al. Preventing Overdose Using Information and Data from the Environment (PROVIDENT): protocol for a randomized, population-based, community intervention trial. Addict Abingdon Engl. 2022;117(4):1152–1162. doi:10.1111/add.15731
OpenUrl CrossRef
31.↵
Friedman JH. Greedy function approximation: A gradient boosting machine. Ann Stat. 2001;29(5):1189–1232. doi:10.1214/aos/1013203451
OpenUrl CrossRef PubMed
32.↵
Schell RC, Allen B, Goedel WC, et al. Identifying Predictors of Opioid Overdose Death at a Neighborhood Level With Machine Learning. Am J Epidemiol. 2022;191(3):526. doi:10.1093/aje/kwab279
OpenUrl CrossRef
33.↵
Spatial sampling and resampling for Machine Learning. doi:10.5281/zenodo.5886678
34.↵
Hand DJ. Classifier Technology and the Illusion of Progress. Stat Sci. 2006;21(1):1–14. doi:10.1214/088342306000000060
OpenUrl CrossRef PubMed Web of Science
35.↵
Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell. 2019;1(5):206–215. doi:10.1038/s42256-019-0048-x
OpenUrl CrossRef PubMed
36.↵
Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110:12–22. doi:10.1016/j.jclinepi.2019.02.004
OpenUrl CrossRef PubMed
37.↵
Nusinovici S, Tham YC, Chak Yan MY, et al. Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol. 2020;122:56–69. doi:10.1016/j.jclinepi.2020.03.002
OpenUrl CrossRef PubMed
38.↵
Salganik MJ, Lundberg I, Kindel AT, et al. Measuring the predictability of life outcomes with a scientific mass collaboration. Proc Natl Acad Sci. 2020;117(15):8398–8403. doi:10.1073/pnas.1915006117
OpenUrl Abstract/FREE Full Text
39.↵
Probst P, Bischl B, Boulesteix AL. Tunability: Importance of Hyperparameters of Machine Learning Algorithms. Published online October 22, 2018. doi:10.48550/arXiv.1802.09596
OpenUrl CrossRef
40.↵
Raschka S. Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning. Published online November 10, 2020. doi:10.48550/arXiv.1811.12808
OpenUrl CrossRef
41.↵
Donoho D. 50 Years of Data Science. J Comput Graph Stat. 2017;26(4):745–766. doi:10.1080/10618600.2017.1384734
OpenUrl CrossRef
42.↵
Bharel M, Bernson D, Averbach A. Using Data to Guide Action in Response to the Public Health Crisis of Opioid Overdoses. NEJM Catal. 2020;1(5). doi:10.1056/CAT.19.1118
OpenUrl CrossRef
43.↵
Buitinck L, Louppe G, Blondel M, et al. API design for machine learning software: experiences from the scikit-learn project. Published online September 1, 2013. doi:10.48550/arXiv.1309.0238
OpenUrl CrossRef
44.↵
Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J R Stat Soc Ser B Stat Methodol. 2009;71(2):319–392. doi:10.1111/j.1467-9868.2008.00700.x
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted January 04, 2024.

Download PDF

Data/Code

Citation Tools

Subject Area

Epidemiology

Subject Areas

All Articles

Addiction Medicine (405)
Allergy and Immunology (718)
Anesthesia (210)
Cardiovascular Medicine (2995)
Dentistry and Oral Medicine (338)
Dermatology (254)
Emergency Medicine (447)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1059)
Epidemiology (12891)
Forensic Medicine (12)
Gastroenterology (840)
Genetic and Genomic Medicine (4684)
Geriatric Medicine (431)
Health Economics (738)
Health Informatics (2981)
Health Policy (1081)
Health Systems and Quality Improvement (1100)
Hematology (396)
HIV/AIDS (942)
Infectious Diseases (except HIV/AIDS) (14205)
Intensive Care and Critical Care Medicine (863)
Medical Education (437)
Medical Ethics (116)
Nephrology (479)
Neurology (4474)
Nursing (239)
Nutrition (656)
Obstetrics and Gynecology (824)
Occupational and Environmental Health (746)
Oncology (2322)
Ophthalmology (659)
Orthopedics (261)
Otolaryngology (330)
Pain Medicine (289)
Palliative Medicine (85)
Pathology (506)
Pediatrics (1212)
Pharmacology and Therapeutics (513)
Primary Care Research (509)
Psychiatry and Clinical Psychology (3840)
Public and Global Health (7060)
Radiology and Imaging (1570)
Rehabilitation Medicine and Physical Therapy (937)
Respiratory Medicine (930)
Rheumatology (449)
Sexual and Reproductive Health (453)
Sports Medicine (389)
Surgery (495)
Toxicology (61)
Transplantation (214)
Urology (186)

[1] 1.↵
FB Ahmad, JA Cisewsky, LM Rossen, P Sutton. Provisional drug overdose death counts. Published September 6, 2023. Accessed September 17, 2023. https://www.cdc.gov/nchs/nvss/vsrr/drug-overdose-data.htm

[2] 2.↵
Mattick RP, Breen C, Kimber J, Davoli M. Buprenorphine maintenance versus placebo or methadone maintenance for opioid dependence. Cochrane Database Syst Rev. 2014;(2).

[3] 3.↵
Leshner AI, Mancher M, eds. Medications for Opioid Use Disorder Save Lives. National Academies Press; 2019. doi:10.17226/25310
OpenUrl CrossRef

[4] 4.↵
Clark AK, Wilder CM, Winstanley EL. A systematic review of community opioid overdose prevention and naloxone distribution programs. J Addict Med. 2014;8(3):153–163. doi:10.1097/ADM.0000000000000034
OpenUrl CrossRef PubMed

[5] 5.↵
Hawk KF, Vaca FE, D’Onofrio G. Reducing Fatal Opioid Overdose: Prevention, Treatment and Harm Reduction Strategies. Yale J Biol Med. 2015;88(3):235–245.
OpenUrl PubMed

[6] 6.↵
Peiper NC, Clarke SD, Vincent LB, Ciccarone D, Kral AH, Zibbell JE. Fentanyl test strips as an opioid overdose prevention strategy: Findings from a syringe services program in the Southeastern United States. Int J Drug Policy. 2019;63:122–128. doi:10.1016/j.drugpo.2018.08.007
OpenUrl CrossRef

[7] 7.↵
Ciccarone D. The triple wave epidemic: Supply and demand drivers of the US opioid overdose crisis. Int J Drug Policy. 2019;71:183–188. doi:10.1016/j.drugpo.2019.01.010
OpenUrl CrossRef PubMed

[8] 8.↵
Ciccarone D. The rise of illicit fentanyls, stimulants and the fourth wave of the opioid overdose crisis. Curr Opin Psychiatry. 2021;34(4):344–350. doi:10.1097/YCO.0000000000000717
OpenUrl CrossRef

[9] 9.↵
Understanding the Opioid Overdose Epidemic | Opioids | CDC. Published August 8, 2023. Accessed September 17, 2023. https://www.cdc.gov/opioids/basics/epidemic.html

[10] 10.↵
Stopka TJ, Larochelle MR, Li X, et al. Opioid-related mortality: Dynamic temporal and spatial trends by drug type and demographic subpopulations, Massachusetts, 2005–2021. Drug Alcohol Depend. 2023;246:109836. doi:10.1016/j.drugalcdep.2023.109836
OpenUrl CrossRef

[11] 11.↵
Friedman SR, Krawczyk N, Perlman DC, et al. The Opioid/Overdose Crisis as a Dialectics of Pain, Despair, and One-Sided Struggle. Front Public Health. 2020;8. Accessed November 8, 2023. https://www.frontiersin.org/articles/10.3389/fpubh.2020.540423

[12] 12.↵
Marks C, Carrasco-Escobar G, Carrasco-Hernández R, et al. Methodological approaches for the prediction of opioid use-related epidemics in the United States: a narrative review and cross-disciplinary call to action. Transl Res J Lab Clin Med. 2021;234:88–113. doi:10.1016/j.trsl.2021.03.018
OpenUrl CrossRef

[13] 13.↵
Borquez A, Martin NK. Fatal overdose: Predicting to prevent. Int J Drug Policy. 2022;104:103677. doi:10.1016/j.drugpo.2022.103677
OpenUrl CrossRef

[14] 14.↵
Tseregounis IE, Henry SG. Assessing opioid overdose risk: a review of clinical prediction models utilizing patient-level data. Transl Res J Lab Clin Med. 2021;234:74–87. doi:10.1016/j.trsl.2021.03.012
OpenUrl CrossRef

[15] 15.
Lo-Ciganic WH, Donohue JM, Hulsey EG, et al. Integrating human services and criminal justice data with claims data to predict risk of opioid overdose among Medicaid beneficiaries: A machine-learning approach. PloS One. 2021;16(3):e0248360. doi:10.1371/journal.pone.0248360
OpenUrl CrossRef

[16] 16.↵
Lo-Ciganic WH, Donohue JM, Yang Q, et al. Developing and validating a machine-learning algorithm to predict opioid overdose in Medicaid beneficiaries in two US states: a prognostic modelling study. Lancet Digit Health. 2022;4(6):e455–e465. doi:10.1016/S2589-7500(22)00062-0
OpenUrl CrossRef

[17] 17.↵
Acharya A, Izquierdo AM, Gonçalves SF, et al. Exploring County-level Spatio-temporal Patterns in Opioid Overdose Related Emergency Department Visits. MedRxiv Prepr Serv Health Sci. Published online 2022.

[18] 18.
Herlands W, McFowland III E, Wilson AG, Neill DB. Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data. In: Artificial Intelligence and Statistics.; 2018. Accessed November 6, 2018. http://proceedings.mlr.press/v84/herlands18a/herlands18a.pdf

[19] 19.↵
Lu H, Crawford FW, Gonsalves GS, Grau LE. Geographic and temporal trends in fentanyl-detected deaths in Connecticut, 2009-2019. Ann Epidemiol. 2023;79:32–38. doi:10.1016/j.annepidem.2023.01.009
OpenUrl CrossRef

[20] 20.↵
Allen B, Neill DB, Schell RC, et al. Translating predictive analytics for public health practice: A case study of overdose prevention in Rhode Island. Am J Epidemiol. Published online May 17, 2023:kwad119. doi:10.1093/aje/kwad119
OpenUrl CrossRef

[21] 21.↵
Heuton K, Shrestha S, Stopka TJ, Pustz J, Liu LP, Hughes MC. Predicting Spatiotemporal Counts of Opioid-related Fatal Overdoses via Zero-Inflated Gaussian Processes. 2022 NeurIPS Workshop Gaussian Process Spatiotemporal Model Decis-Mak Syst. Published online December 2022. Accessed May 24, 2023. https://par.nsf.gov/biblio/10389257-predicting-spatiotemporal-counts-opioid-related-fatal-overdoses-via-zero-inflated-gaussian-processes

[22] 22.↵
Ertugrul AM, Lin YR, Taskaya-Temizel T. CASTNet: Community-Attentive Spatio-Temporal Networks for Opioid Overdose Forecasting. In: Machine Learning and Knowledge Discovery in Databases: European Conference (ECML PKDD). ; 2019. Accessed September 23, 2022. http://arxiv.org/abs/1905.04714

[23] 23.↵
Bauer C, Zhang K, Li W, et al. Small Area Forecasting of Opioid-Related Mortality: Bayesian Spatiotemporal Dynamic Modeling Approach. JMIR Public Health Surveill. 2023;9(1):e41450. doi:10.2196/41450
OpenUrl CrossRef

[24] 24.↵
US Census Bureau. Glossary. Censusgov Geogr Program. Published online 2022. Accessed September 22, 2022. https://www.census.gov/programs-surveys/geography/about/glossary.html#par_textimage_13

[25] 25.↵
Medical Examiner Case Archive | Cook County Open Data. Accessed September 18, 2023. https://datacatalog.cookcountyil.gov/Public-Safety/Medical-Examiner-Case-Archive/cjeq-bs86

[26] 26.↵
Freeman N. Censusgeocode: Thin Python Wrapper for the US Census Geocoder. Published online 2022. Accessed September 23, 2022. https://github.com/fitnr/censusgeocode

[27] 27.↵
CDC ATSDR. Social Vulnerability Index 2018 Database for Massachusetts. Published online 2018.

[28] 28.↵
Marks C, Abramovitz D, Donnelly CA, et al. Identifying counties at risk of high overdose mortality burden during the emerging fentanyl epidemic in the USA: a predictive statistical modelling study. Lancet Public Health. 2021;6(10):e720–e728. doi:10.1016/S2468-2667(21)00080-3
OpenUrl CrossRef

[29] 29.↵
Sumetsky N. Spatiotemporal Modeling of Opioid Abuse and Dependence Outcomes Using Bayesian Hierarchical Methods. Master’s Thesis. University of Pittsburgh Graduate School of Public Health; 2017.

[30] 30.↵
Marshall BDL, Alexander-Scott N, Yedinak JL, et al. Preventing Overdose Using Information and Data from the Environment (PROVIDENT): protocol for a randomized, population-based, community intervention trial. Addict Abingdon Engl. 2022;117(4):1152–1162. doi:10.1111/add.15731
OpenUrl CrossRef

[31] 31.↵
Friedman JH. Greedy function approximation: A gradient boosting machine. Ann Stat. 2001;29(5):1189–1232. doi:10.1214/aos/1013203451
OpenUrl CrossRef PubMed

[32] 32.↵
Schell RC, Allen B, Goedel WC, et al. Identifying Predictors of Opioid Overdose Death at a Neighborhood Level With Machine Learning. Am J Epidemiol. 2022;191(3):526. doi:10.1093/aje/kwab279
OpenUrl CrossRef

[33] 33.↵
Spatial sampling and resampling for Machine Learning. doi:10.5281/zenodo.5886678

[34] 34.↵
Hand DJ. Classifier Technology and the Illusion of Progress. Stat Sci. 2006;21(1):1–14. doi:10.1214/088342306000000060
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell. 2019;1(5):206–215. doi:10.1038/s42256-019-0048-x
OpenUrl CrossRef PubMed

[36] 36.↵
Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110:12–22. doi:10.1016/j.jclinepi.2019.02.004
OpenUrl CrossRef PubMed

[37] 37.↵
Nusinovici S, Tham YC, Chak Yan MY, et al. Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol. 2020;122:56–69. doi:10.1016/j.jclinepi.2020.03.002
OpenUrl CrossRef PubMed

[38] 38.↵
Salganik MJ, Lundberg I, Kindel AT, et al. Measuring the predictability of life outcomes with a scientific mass collaboration. Proc Natl Acad Sci. 2020;117(15):8398–8403. doi:10.1073/pnas.1915006117
OpenUrl Abstract/FREE Full Text

[39] 39.↵
Probst P, Bischl B, Boulesteix AL. Tunability: Importance of Hyperparameters of Machine Learning Algorithms. Published online October 22, 2018. doi:10.48550/arXiv.1802.09596
OpenUrl CrossRef

[40] 40.↵
Raschka S. Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning. Published online November 10, 2020. doi:10.48550/arXiv.1811.12808
OpenUrl CrossRef

[41] 41.↵
Donoho D. 50 Years of Data Science. J Comput Graph Stat. 2017;26(4):745–766. doi:10.1080/10618600.2017.1384734
OpenUrl CrossRef

[42] 42.↵
Bharel M, Bernson D, Averbach A. Using Data to Guide Action in Response to the Public Health Crisis of Opioid Overdoses. NEJM Catal. 2020;1(5). doi:10.1056/CAT.19.1118
OpenUrl CrossRef

[43] 43.↵
Buitinck L, Louppe G, Blondel M, et al. API design for machine learning software: experiences from the scikit-learn project. Published online September 1, 2013. doi:10.48550/arXiv.1309.0238
OpenUrl CrossRef

[44] 44.↵
Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J R Stat Soc Ser B Stat Methodol. 2009;71(2):319–392. doi:10.1111/j.1467-9868.2008.00700.x
OpenUrl CrossRef PubMed Web of Science