Sorry, you need to enable JavaScript to visit this website.
Skip to main content
Study

Accuracy of a proprietary large language model in labeling obstetric incident reports.

Johnson J, Brown C, Lee GM, et al. Accuracy of a proprietary large language model in labeling obstetric incident reports. Jt Comm J Qual Patient Saf. 2024;Epub Aug 6. doi:10.1016/j.jcjq.2024.08.001.

Save
Print
September 25, 2024
Johnson J, Brown C, Lee GM, et al. Jt Comm J Qual Patient Saf. 2024;Epub Aug 6.
View more articles from the same authors.

Voluntary incident reporting is an important resource for identifying adverse events and near misses, but the volume of reports can pose challenges. This study used the large language model (LLM) ChatGPT-3.5 in a secure environment to label a sample of obstetric incident reports (e.g., neonatal resuscitation supplies, lactation support). Compared with the human-assigned labels—the gold standard—ChatGPT demonstrated high sensitivity and specificity.

Save
Print
Cite
Citation

Johnson J, Brown C, Lee GM, et al. Accuracy of a proprietary large language model in labeling obstetric incident reports. Jt Comm J Qual Patient Saf. 2024;Epub Aug 6. doi:10.1016/j.jcjq.2024.08.001.