Provide Feedback |

Register

Study

Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine.

August 28, 2024

Jin Q, Chen F, Zhou Y, et al. NPJ Dig Med. 2024;7(1):190.

View more articles from the same authors.

Numerous studies have evaluated Generative Pre-trained Transformer’s (GPT) accuracy in response to text-only questions. This study appraises GPT-4 with Vision (GPT-4V), which analyzes images and text together. GPT-4V performs similarly to physicians regarding multiple choice accuracy, but demonstrates flawed rationale even when it selected the correct response.

Related Resources From the Same Author(s)

Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence.

March 24, 2019

Optimizing Pediatric Patient Safety in the Emergency Care Setting.

October 19, 2022

Seroprevalence of SARS-CoV-2 among frontline health care personnel in a multistate hospital network--13 academic medical centers, April-June 2020.

September 23, 2020

Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer.

April 12, 2019

Prevalence and causes of diagnostic errors in hospitalized patients under investigation for COVID-19.

April 12, 2023

The practical implementation of artificial intelligence technologies in medicine.

January 30, 2019

A randomized trial of a multifactorial strategy to prevent serious fall injuries.

July 29, 2020

Harnessing in situ simulation to identify human errors and latent safety threats in adult tracheostomy care.

March 13, 2024

Supratherapeutic dosing of acetaminophen among hospitalized patients.

December 21, 2014

Unmasking bias in artificial intelligence: a systematic review of bias detection and mitigation strategies in electronic health record-based models.

April 17, 2024

View More

Related Resources

Combining multiple large language models improves diagnostic accuracy.

November 20, 2024

Diagnosis in the Era of Digital Health and Artificial Intelligence: A Workshop.

July 25, 2024 - July 25, 2024

Does an app a day keep the doctor away? AI symptom checker applications, entrenched bias, and professional responsibility.

June 26, 2024

Performance evaluation of ChatGPT in detecting diagnostic errors and their contributing factors: an analysis of 545 case reports of diagnostic errors.

June 19, 2024

Clinical reasoning of a generative artificial intelligence model compared with physicians.

April 17, 2024

Assessing the utility of ChatGPT throughout the entire clinical workflow: development and usability study.

September 13, 2023

Experimental evidence for structured information-sharing networks reducing medical errors.

August 9, 2023

Automated identification of diagnostic labelling errors in medicine.

June 8, 2022

Use of a structured approach and virtual simulation practice to improve diagnostic reasoning.

February 23, 2022

Comparative accuracy of diagnosis by collective intelligence of multiple physicians vs individual physicians.

March 13, 2019

View More

See More About The Topic

Clinical Misdiagnosis

Technologic Approaches

To sign up for updates or to access your subscriber preferences, please enter your email address below.

5600 Fishers Lane
Rockville, MD 20857
Telephone: (301) 427-1364