ABSTRACT
Background There are concerns about the balance of benefits and harms in people using opioids for chronic non-cancer pain. Technologies using artificial intelligence (AI) are being developed to examine and optimise the use of opioids. Yet, this research has not been synthesised to determine the types of AI models being developed and the application of these models.
Methods We aimed to synthesise studies exploring the use of AI in people taking opioids. We searched three databases: the Cochrane Database of Systematic Reviews, EMBASE, and Medline, on 4 January 2021. Studies were included if they were published after 2010, conducted in a real-life community setting involving humans, and used AI to understand opioid use. Data on the types and applications of AI models were extracted and descriptively analysed.
Results Eighty-one articles were included in our review, representing over 5.3 million participants and 14.6 million social media posts. Most (93%) studies were conducted in the USA. The types of AI technologies included natural language processing (46%) and a range of machine learning algorithms, the most common being random forest algorithms (36%). AI was predominately applied for the surveillance and monitoring of opioids (46%), followed by risk prediction (42%), pain management (10%), and patient support (2%). Few of the AI models were ready for adoption, with most (62%) being in preliminary stages.
Conclusions A variety of AI models are being developed and applied to understand opioid use. However, there is a need for these AI technologies to be externally validated and robustly evaluated to determine whether they can improve the use and safety of opioids.
Key Points
Key Points Across the landscape of opioid research, natural language processing models (46%) and ensemble algorithms, particularly random forest algorithms (36%), were the most common types of AI technologies studied.
There were four main domains that AI was applied to assess the use of opioids, including surveillance and monitoring (46%), risk prediction (42%), pain management (10%), and patient support (2%).
The AI technologies were at various stages of development, validation, and deployment, with most (62%) models in preliminary stages, 11% required external validation, and few models were openly available to access (6%).
INTRODUCTION
In the USA, 128 lives were lost every day to opioid overdoses in 2018.[1] Such opioid-related deaths, widely described as the US opioid epidemic, were linked to the increased prescribing of opioids, opioid misuse, and the transition to illicit substances. In the UK, six deaths due to a drug overdose involved an opioid in 2018.[2] The prescribing of opioids more than doubled in England between 1998 and 2016.[3] Thus, interventions are being developed to examine and improve the use of opioids.
Several approaches have been trialled to improve the use of opioids with partial success. This has included: educational resources;[4] non-pharmacological therapies, for example, cognitive behavioural therapy, hypnosis, relaxation techniques, mindfulness, acupuncture, and exercise[5]); the monitoring of prescribing data,[6] and toolkits to support the review and safe reduction of opioids.[7] However, technological advances could help streamline such approaches to improve the use of opioids.
The use of artificial intelligence (AI) technologies in preventative health and medicines optimisation are gaining traction. For example, AI is being used to predict sudden death in heart failure patients and support the selection of appropriate treatments.[8] A review on the use of AI interventions to aid in opioid use disorders found 29 unique interventions.[9] However, this review only examined the grey literature. Thus, the types of AI reported in peer-reviewed literature and across other aspects of opioid research are unknown.
The UK published a National AI strategy in September 2021 [10] and is encouraging the use of AI to drive digital transformation across the National Health Service (NHS) [11]. NHS organisations (e.g. NHSX) are supporting the acceleration of AI technologies through financial awards[12] and the UK’s overprescribing review has recommended the commission of digital tools to tackle and reduce overprescribing.[13] However, there are few reviews that examine the use of AI to inform such policies. Therefore, the aim of this review was to explore the use of AI technologies across the landscape of opioid research.
METHODS
We designed a narrative review to understand how AI technologies have been used, applied, and implemented in research on opioid use.
Search strategy
An information specialist designed and ran the search strategy in three databases: the Cochrane Database of Systematic Reviews, EMBASE, and Medline. Search terms relating to “opioids” and “artificial intelligence” were included (see Table S1 in Appendix 1 for the complete list of terms). The search was initially performed on 27 January 2020 and updated on 4 January 2021. We also searched Google Scholar on 18 November 2020.
Eligibility criteria
Studies were included if they were published after 2010, had been conducted in a real-life setting in human beings, and tested a form of AI to optimise or understand opioid use. AI was defined as ‘computer systems that can perform tasks normally requiring human intelligence’,[14] which could include any form of machine learning, deep learning, neural networks, and natural language processing. Studies were not restricted by outcomes or settings, and conference abstracts were included to capture all emerging research. Studies were excluded if they were not published in English, did not specifically relate to both opioids and AI were conducted outside of a real-life setting, for example, in research settings exploring genes, receptor subtypes and modulators, and were not original research. Editorials and commentary were excluded.
Study selection
Titles and abstracts were screened independently using the prespecified eligibility criteria by one review author (SG), followed by the full-text articles. Where the conference abstract and full article were both available, the full article was included.
Data extraction
One review author (SG) extracted data from included studies into a predeveloped spreadsheet. This included: year of study and author names; country; study design; study population and data source; sample size; technology investigated; area of application; outcomes; and stage of development.
Data analysis
The findings were descriptively synthesised by identifying areas of commonality in terms of the AI technology used and the area of application. The types of AI technology were classified based on the method described by Brownlee in the Tour of Machine Learning Algorithms.[15] The stages of development were defined based on reported findings in the studies and categorised into seven groups: preliminary research; model development required; model development planned; external validation required; prototype for scale-up developed; local implementation; openly available.
RESULTS
We screened 389 titles and abstracts and 118 full texts for eligibility (Figure 1). There were 81 studies that met our eligibility criteria and were included in the review (Table 1). Of the 81 studies, 18 were conference abstracts, which are summarised in Table S2 Appendix 2.
The included studies represented over 5.3 million participants and 14.6 million social media posts. The majority (93%, n=75) of studies were conducted in the USA with the remainder being performed in Bangladesh (n=1), Bulgaria (n=1), Germany (n=1), India (n=1), Israel (n=1) and Italy (n=1). Of the published articles (n=63), most studies used observational designs, including cohort (54%), retrospective observational (22%), case-control (8%), prognostic (6%), and cross-sectional (5%). One study was a case series, one was a pilot study using mixed methods, and there was one retrospective infovelliance study (Table 1). The main sources of data for testing AI models were medical records and claims databases (54%) and social media (20%) (Figure 2).
The areas where AI technology was being applied and tested broadly fell into four distinct categories: risk prediction of outcomes such as opioid use disorder, dependence, or overdose (42%, n= 34); surveillance and monitoring of activity or consequences such as misuse, respiratory depression, and HIV outbreaks (46%, n=37); pain management (10%, n=8); and patient support technology (2%, n=2) (Appendix 3). For studies that focused on risk prediction (n= 34), 18% specifically looked at prolonged opioid use following surgery (Table 2).
Ensemble algorithms (59%, n=48), particularly random forest algorithms (36%, n=29) and natural language processing models (46%, n=37) were the most common types of AI technology researched (Table 3). In terms of efficacy measures, several studies (43%, n=35) used the area under the receiver-operating characteristic (AUROC) curve. Other efficacy measures used included the macro averaged F1 score, Brier score, positive predictive value, and negative predictive value.
The AI models included in the review were at various stages of development, validation, and deployment. The majority (62%, n=50) were at the preliminary stage, 11% (n=9) required external validation, few models were openly available to access (6%, n=5) (Figure 3).
DISCUSSION
We identified 81 studies that tested AI in people using opioids. Most research was from the USA, with no studies reported in the UK. There were various types of AI models being used, including a range of machine learning algorithms and models using natural language processing. The majority of included studies examined the use of AI in risk prediction and surveillance and monitoring, two areas that could have significant patient safety benefits to prevent long-term opioid use and opioid dependence.
Most of the studies reviewed focused on developing AI technology to support the identification of factors that could predict the increased risk of developing an adverse opioid-related outcome. The need for pre-operative identification of people at risk of sustained opioid use following surgery has been recommended by experts.[79] Thus, this is a potential gap that AI technology could fill if studies were robustly designed.
A commonly reported predictive factor across nearly all studies on risk prediction was the concomitant use of sedative and anxiolytic medication such as benzodiazepines. An evidence review by Public Health England identified an increased risk of long-term opioid use in people who had previously used or were currently using benzodiazepines.[80] A systematic review into factors associated with high-dose opioids reported that people co-prescribed benzodiazepines is a high priority area for targeted interventions and coordinated strategies.[81]
Progress of AI development in detecting illegal use also has the potential to significantly reduce the number of opioid-related deaths related to misuse. However, to enable effective interventions to be developed in this area also requires localised real-time intelligence. AI technology focused on detecting illegal combined with real-time intelligence could support agencies that are currently working in this area to target activity at the time it is detected.
Guidelines and educational resources have been developed that support clinicians with pain management and highlight the problems associated with long term opioid treatment.[4,82] To date, these have had limited impact on improving opioid prescribing, with prescribing of high doses of opioids in some areas showing an increasing trend.[6] However, AI technology to improve pain management used in combination with educational resources and guidelines could be a way to prevent unnecessary escalation and long-term use of opioid treatment.
Beaulieu and colleagues have reviewed the grey literature on opioid use disorder and AI interventions, identifying 29 unique interventions.[9] Our narrative review expanded on this research to assess the AI technology across all areas of opioid research and includes a wider scope of literature. Similar to our review, Beaulieu and colleagues found a lack of scientific evaluation,[9] which was also highlighted by Hossain and colleagues in their conference abstract on AI in opioid research and practice.[83] The literature on AI models for diagnosing ischemic stroke has also illustrated the variation in measuring efficacy and the need for standardisation. [84,85]
Strengths and limitations
Conference abstracts were included in our review to capture emerging research. However, we did not follow up on further development and validation of models beyond the reported findings in the publication. Post-publication, several of the models may have undergone further development and external validation and may now be available publicly. Thus, we were limited by the reporting of information in included studies and the infancy of AI research. In particular, it was difficult to evaluate the performance of AI models as various efficacy measures were used.
It was challenging to find a comprehensive way to classify the various types of AI technologies being researched. Various systems have been described that include classification by complexity,[86] learning style and algorithm similarity.[15] The method described by Brownlee that classified AI technology according to similarity was chosen for the review as it clearly described which algorithms fell into each category.[15] Using this system, it was found that natural language processing models and the random forest algorithm were the most commonly researched AI technology used by the studies.
Our review only included studies published in English, which could exclude published research conducted in non-English speaking countries. Finally, we conducted a narrative review that did not involve a quality assessment of the included studies. However, nearly all the studies included in this review were observational. Thus, high-quality research is required to test the efficacy and effectiveness of AI technologies in people using or receiving opioids.
Implications
Public Health initiatives have focused on identifying and addressing people that are taking opioids long-term or dependent on opioids,[80,87] yet this process has not been systematically embedded in clinical practice. Preventative interventions involving AI early in the care pathway could improve the systematic nature of such initiatives and reduce the number of people taking long-term opioids or dependant. AI technology could also support intelligence to reduce illegal and illicitly available opioids, to identify the prevalence and local hot spots to target interventions. However, before the adoption of AI models in clinical practice, future research should be conducted to standardise methods and determine whether AI models are superior to current initiatives in clinical practice.
To conduct AI research, large datasets are required. Hence, we found that electronic health records and social media posts were most often used. Access to good quality data is recognised as a current barrier and vital enabler in the UK National AI strategy.[10] The strategy makes recommendations to review datasets and their availability that would accelerate the development of AI models. These datasets are available in the NHS, however, guidance on their safe use is urgently required.
The increased use of AI technology is a key recommendation in the NHS Long-Term Plan,[11] thus funding should be allocated to conduct randomised control trials and prospective studies in UK healthcare settings. To enable validation and implementation of AI technology into care pathways, collaboration between many stakeholders is required, including developers, healthcare organisations, clinicians, and patients. National guidance on the development, testing, and implementation of AI technologies would standardise such processes and help organisations ensure patients are protected.
Conclusions
Various AI technologies have been applied to several areas of opioid use, yet this research is still in its infancy. The effectiveness of AI technologies in reducing opioid use and harms cannot be determined until robust randomised and prospective studies are conducted. Therefore, there is a clear need for these AI models to be validated and robustly evaluated. To facilitate the spread and adoption of innovation in this area, collaboration of organisations, developers, funders, researchers, prescribers, and patients is required.
Data Availability
All data produced in the present work are contained in the manuscript and appendices
DISCLOSURE OF INTERESTS
GCR was financially supported by the National Institute of Health Research (NIHR) School for Primary Care Research (SPCR), the Naji Foundation, and the Rotary Foundation to study for a Doctor of Philosophy (DPhil/PhD) at the University of Oxford (2017-2020). GCR is an Associate Editor of BMJ Evidence Based Medicine. No other study authors have interest to disclose.
ACKNOWLEDGEMENTS
We acknowledge Anne Gray, Knowledge Specialist at Oxford Academic Health Science Network, for carrying out the literature search.
APPENDIX 1.
APPENDIX 2.
APPENDIX 3. Detailed summary of the four areas of AI application in opioid research
1: Risk prediction
Most of the studies reviewed focused on developing AI technology to support identification of factors that could predict the increased risk of developing an adverse opioid related outcome. The aim of this research being either to identify risks early and prevent resultant issues or, as a method of classification for opioid-dependent and long-term users. In this category, most of the AI tools researched focused on the development of prolonged opioid use following surgery. A range of factors were found to be predictive of prolonged use following surgery and included age; marital status; preoperative opioid use, medication, and haemoglobin; tobacco use; comorbidity of depression or diabetes and instrumentation. Other adverse outcomes explored were the risk of dependence, abuse, and overdose.
Some of the AI technology developed in this category was at a more advanced level of development with several researchers publishing their tools online as open access. However, to progress the validation and deployment of these at pace and scale within individual healthcare settings requires facilitation and guidance at a national level.
2: Surveillance and monitoring
Studies in this category investigated AI technology to improve surveillance and monitoring of misuse and illegal selling and to detect consequences that could result from opioid misuse. Most of the AI models in this category used natural language processing technology. The models ranged in their stage of development from preliminary research through to being available online as open access.
AI technology developed aimed to gather intelligence to support public health surveillance and prevent adverse consequences of opioid use. Adverse clinical consequences studied included HIV outbreaks triggered by opioid abuse and transition to injection drug use, suicidality, opioid overdose in opioid users from their social media posts, opioid-induced respiratory depression, and opioid induced constipation. In some of these studies the clinical consequence of opioid use had resulted in avoidable utilisation of healthcare resources, for example misdiagnosis of the cause of abdominal pain resulting in unnecessary surgery.[52]
Some studies in this category researched AI technology to classify different subgroups, estimate prevalence and identify the scale and location of illegal online selling. Illegal use of opioids, both selling and individual use, is a difficult area to tackle.
3: Pain management
Studies in this category explored pain management in various patient cohorts including adolescents from minority backgrounds and patients with depression concomitantly prescribed an antidepressant. Research also focused on patient characteristics that could determine opioid requirements post-surgery. AI models in this category ranged in their stage of development with some being at the preliminary stages of research, others required external validation, and some were available online as open access.
4: Patient support technology
This was the group with the least number of studies. Both studies in this category used smart phone technology. One study used a random forest algorithm to predict opioid craving or stress in the user through their movement as assessed by GPS.[77] The other study tested an AI enabled peer support platform that patients with OUD could use to support their recovery.[78] In both cases, further development of the model was being planned.