Real-World Evaluation of Large Language Models in Healthcare (RWE-LLM): A New Realm of AI Safety & Validation
Meenesh Bhimani, Alex Miller, Jonathan D. Agnew, Markel Sanz Ausin, Mariska Raglow-Defranco, Harpreet Mangat, Michelle Voisard, Maggie Taylor, Sebastian Bierman-Lytle, Vishal Parikh, Juliana Ghukasyan, Rae Lasko, Saad Godil, Ashish Atreja, Subhabrata Mukherjee
medRxiv 2025.03.17.25324157; doi: https://doi.org/10.1101/2025.03.17.25324157