RT Journal Article SR Electronic T1 Infusing behavior science into large language models for activity coaching JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2023.03.31.23287995 DO 10.1101/2023.03.31.23287995 A1 Vardhan, Madhurima A1 Hegde, Narayan A1 Nathani, Deepak A1 Rosenzweig, Emily A1 Karthikesalingam, Alan A1 Seneviratne, Martin YR 2023 UL http://medrxiv.org/content/early/2023/04/03/2023.03.31.23287995.abstract AB Large language models (LLMs) have shown promise for task-oriented dialogue across a range of domains. The use of LLMs in health and fitness coaching is under-explored. Behavior science frameworks such as COM-B, which conceptualizes behavior change in terms of capability (C), Opportunity (O) and Motivation (M), can be used to architect coaching interventions in a way that promotes sustained change. Here we aim to incorporate behavior science principles into an LLM using two knowledge infusion techniques: coach message priming (where exemplar coach responses are provided as context to the LLM), and dialogue re-ranking (where the COM-B category of the LLM output is matched to the inferred user need). Simulated conversations were conducted between the primed or unprimed LLM and a member of the research team, and then evaluated by 8 human raters. Ratings for the primed conversations were significantly higher in terms of empathy and actionability. The same raters also compared a single response generated by the unprimed, primed and re-ranked models, finding a significant uplift in actionability from the re-ranking technique. This is a proof of concept of how behavior science frameworks can be infused into automated conversational agents for a more principled coaching experience.Institutional Review Board (IRB) The study does not involve human subjects beyond the volunteer annotators. IRB approval was not sought for this research.Competing Interest StatementThe authors have declared no competing interest.Funding StatementGoogleAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:No human was involved in the conversations. Simulated conversation samples generated from : bard.google.com based on https://dl.acm.org/doi/10.1145/3503252.3531301 I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesConversation queries are made available in supplementary code. The code is also made publicly available https://github.com/fitllm/classifiers