A Systematic Review of Testing and Evaluation of Healthcare Applications of Large Language Models (LLMs)
View ORCID ProfileSuhana Bedi, View ORCID ProfileYutong Liu, View ORCID ProfileLucy Orr-Ewing, View ORCID ProfileDev Dash, View ORCID ProfileSanmi Koyejo, View ORCID ProfileAlison Callahan, View ORCID ProfileJason A. Fries, View ORCID ProfileMichael Wornow, View ORCID ProfileAkshay Swaminathan, View ORCID ProfileLisa Soleymani Lehmann, Hyo Jung Hong, View ORCID ProfileMehr Kashyap, Akash R. Chaurasia, Nirav R. Shah, Karandeep Singh, Troy Tazbaz, View ORCID ProfileArnold Milstein, View ORCID ProfileMichael A. Pfeffer, View ORCID ProfileNigam H. Shah
doi: https://doi.org/10.1101/2024.04.15.24305869
Suhana Bedi
1Stanford University
Yutong Liu
1Stanford University
Lucy Orr-Ewing
1Stanford University
Dev Dash
1Stanford University
Sanmi Koyejo
1Stanford University
Alison Callahan
1Stanford University
Jason A. Fries
1Stanford University
Michael Wornow
2Stanford
Akshay Swaminathan
1Stanford University
Lisa Soleymani Lehmann
3Harvard University
Hyo Jung Hong
1Stanford University
Mehr Kashyap
1Stanford University
Akash R. Chaurasia
1Stanford University
Nirav R. Shah
1Stanford University
Karandeep Singh
4University of California San Diego
Troy Tazbaz
5US Food and Drug Administration
Arnold Milstein
1Stanford University
Michael A. Pfeffer
1Stanford University
Nigam H. Shah
1Stanford University
Data Availability
All data produced in the present study are available upon reasonable request to the authors
Posted April 18, 2024.
A Systematic Review of Testing and Evaluation of Healthcare Applications of Large Language Models (LLMs)
Suhana Bedi, Yutong Liu, Lucy Orr-Ewing, Dev Dash, Sanmi Koyejo, Alison Callahan, Jason A. Fries, Michael Wornow, Akshay Swaminathan, Lisa Soleymani Lehmann, Hyo Jung Hong, Mehr Kashyap, Akash R. Chaurasia, Nirav R. Shah, Karandeep Singh, Troy Tazbaz, Arnold Milstein, Michael A. Pfeffer, Nigam H. Shah
medRxiv 2024.04.15.24305869; doi: https://doi.org/10.1101/2024.04.15.24305869
A Systematic Review of Testing and Evaluation of Healthcare Applications of Large Language Models (LLMs)
Suhana Bedi, Yutong Liu, Lucy Orr-Ewing, Dev Dash, Sanmi Koyejo, Alison Callahan, Jason A. Fries, Michael Wornow, Akshay Swaminathan, Lisa Soleymani Lehmann, Hyo Jung Hong, Mehr Kashyap, Akash R. Chaurasia, Nirav R. Shah, Karandeep Singh, Troy Tazbaz, Arnold Milstein, Michael A. Pfeffer, Nigam H. Shah
medRxiv 2024.04.15.24305869; doi: https://doi.org/10.1101/2024.04.15.24305869
Subject Area
Subject Areas
- Addiction Medicine (380)
- Allergy and Immunology (695)
- Anesthesia (186)
- Cardiovascular Medicine (2809)
- Dermatology (241)
- Emergency Medicine (424)
- Epidemiology (12499)
- Forensic Medicine (10)
- Gastroenterology (796)
- Genetic and Genomic Medicine (4364)
- Geriatric Medicine (398)
- Health Economics (711)
- Health Informatics (2813)
- Health Policy (1042)
- Hematology (372)
- HIV/AIDS (888)
- Medical Education (411)
- Medical Ethics (113)
- Nephrology (460)
- Neurology (4131)
- Nursing (219)
- Nutrition (613)
- Oncology (2178)
- Ophthalmology (616)
- Orthopedics (253)
- Otolaryngology (316)
- Pain Medicine (260)
- Palliative Medicine (80)
- Pathology (482)
- Pediatrics (1166)
- Primary Care Research (480)
- Public and Global Health (6720)
- Radiology and Imaging (1475)
- Respiratory Medicine (893)
- Rheumatology (427)
- Sports Medicine (359)
- Surgery (468)
- Toxicology (57)
- Transplantation (197)
- Urology (173)