ABSTRACT
Digital twins, computational representations of individuals or systems, offer promising applications in the intensive care unit (ICU) by enhancing decision-making and reducing cognitive load. We developed digital twins using a large language model (LLM), LLaMA-3, fine-tuned with Low-Rank Adapters (LoRA) on physician notes from different ICU specialties in the MIMIC-III dataset. This study hypothesizes that specialty-specific training improves treatment recommendation accuracy compared to training on other ICU specialties. Additionally, we evaluated a zero-shot baseline model, which relied solely on contextual instructions without training. Discharge summaries were analyzed, and medications were masked to create datasets for model training and testing. The medical ICU dataset (1,000 notes) was used for evaluation, and performance was measured using BERTScore and ROUGE-L. LLMs trained on medical ICU notes achieved the highest BERTScore (0.842), outperforming models trained on other specialties or mixed datasets, while untrained zero-shot models showed the lowest performance. These results underscore the value of context-specific training for digital twins, offering foundational insights into LLMs for personalized clinical decision support.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study received funding from the National Heart, Lung, and Blood Institute, United States, under Grant ID NIH 1R01HL157262, and the U.S. National Library of Medicine, United States, under Grant ID NIH R01 LM012973.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Loyola University Chicago Institutional Review Board (IRB) reviewed and determined that the research project titled 'Learning Patient Representations from Electronic Health Records' is exempt from IRB oversight requirements, according to 45 CFR 46.101, as of February 15, 2024.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
The Chan Zuckerberg Initiative, Cold Spring Harbor Laboratory, the Sergey Brin Family Foundation, California Institute of Technology, Centre National de la Recherche Scientifique, Fred Hutchinson Cancer Center, Imperial College London, Massachusetts Institute of Technology, Stanford University, University of Washington, and Vrije Universiteit Amsterdam.