Commentary

Evaluation and mitigation of the limitations of large language models in clinical decision-making.

September 11, 2024

Hager P, Jungmann F, Holland R, et al. Nat Med. 2024;30(9):2613-2622.

View more articles from the same authors.

Researchers, clinicians, and other stakeholders are hopeful that integration of artificial intelligence and large language models (LLMs) can improve patient safety and reduce clinician burden. This study used 2,400 real patient cases to test several LLM's ability to correctly diagnose common abdominal complaints. Each LLM performed significantly worse than physicians, did not follow treatment or diagnostic guidelines, could not interpret laboratory results, and often failed to follow instructions.

PubMed citation

Related Resources From the Same Author(s)

Predictive power of the "trigger tool" for the detection of adverse events in general surgery: a multicenter observational validation study.

March 9, 2022

Problem-based training improves recognition of patient hazards by advanced medical students during chart review: a randomized controlled crossover study.

May 7, 2014

Effects of night surgery on postoperative mortality and morbidity: a multicentre cohort study.

December 23, 2020

Seroprevalence of SARS-CoV-2 among frontline health care personnel in a multistate hospital network--13 academic medical centers, April-June 2020.

September 23, 2020

Usability and feasibility of consumer-facing technology to reduce unsafe medication use by older adults.

May 22, 2019

Meta-analysis: effect of interactive communication between collaborating primary care physicians and specialists.

March 3, 2010

Advancing the science of patient safety.

September 20, 2011

How does context affect interventions to improve patient safety? An assessment of evidence from studies of five patient safety practices and proposals for research.

July 22, 2011

The role of theory in research to develop and evaluate the implementation of patient safety practices.

April 22, 2011

What context features might be important determinants of the effectiveness of patient safety practice interventions?

June 29, 2011

View More

Related Resources

Stakeholder perspectives on contributors to delayed and inaccurate diagnosis of cardiovascular disease and their implications for digital health technologies: a UK-based qualitative study.

June 19, 2024

Artificial intelligence in the provision of health care: an American College of Physicians policy position paper.

June 12, 2024

Using artificial intelligence to improve primary care for patients and clinicians.

February 28, 2024

Implementing a safer and more reliable system to monitor test results at a teaching university-affiliated facility in a family medicine group: a quality improvement process report.

November 1, 2023

Patient and family contributions to improve the diagnostic process through the OurDX electronic health record tool: a mixed method analysis.

September 13, 2023

Challenges and strategies for patient safety in primary care: a qualitative study.

September 28, 2022

The critical role of health information technology in the safe integration of behavioral health and primary care to improve patient care.

November 10, 2021

COVID-19 and open notes: a new method to enhance patient safety and trust.

July 7, 2021

Using a patient internet portal to prevent adverse drug events: a randomized, controlled trial.

September 11, 2013

Processes for effective communication in primary care.

February 28, 2011

View More

See More About The Topic

Ambulatory Clinic or Office

Provider-Patient Communication

Technologic Approaches