Cited 0 times in

Efficacy of large language models and their potential in Obstetrics and Gynecology education

Authors
 Kyung Jin Eoh  ;  Gu Yeun Kwon  ;  Eun Jin Lee  ;  JoonHo Lee  ;  Inha Lee  ;  Young Tae Kim  ;  Eun Ji Nam 
Citation
 Obstetrics & Gynecology Science, Vol.67(6) : 550-556, 2024-11 
Journal Title
Obstetrics & Gynecology Science
ISSN
 2287-8572 
Issue Date
2024-11
Keywords
Artificial intelligence ; Gynecology ; Medical education ; Obstetrics
Abstract
Objective: The performance of large language models (LLMs) and their potential utility in obstetric and gynecological education are topics of ongoing debate. This study aimed to contribute to this discussion by examining the recent advancements in LLM technology and their transformative potential in artificial intelligence.

Methods: This study assessed the performance of generative pre-trained transformer (GPT)-3.5 and -4 in understanding clinical information, as well as its potential implications for obstetric and gynecological education. Obstetrics and gynecology residents at three hospitals underwent an annual promotional examination, from which 116 of the 170 questions over 4 years (2020-2023) were analyzed, excluding 54 questions with images. The scores achieved by GPT-3.5, -4, and the 100 residents were compared.

Results: The average scores across all 4 years for GPT-3.5 and -4 were 38.79 (standard deviation [SD], 5.65) and 79.31 (SD, 3.67), respectively. For groups first-year resident, second-year resident, and third-year resident, the cumulative annual average scores were 79.12 (SD, 9.00), 80.95 (SD, 5.86), and 83.60 (SD, 6.82), respectively. No statistically significant differences were observed between the scores of GPT-4.0 and those of the residents. When analyzing questions specific to obstetrics, the average scores for GPT-3.5 and -4.0 were 33.44 (SD, 10.18) and 90.22 (SD, 7.68), respectively.

Conclusion: GPT-4 demonstrated exceptional performance in obstetrics, different types of data interpretation, and problem solving, showcasing the potential utility of LLMs in these areas. However, acknowledging the constraints of LLMs is crucial and their utilization should augment human expertise and discernment.
Files in This Item:
T202407579.pdf Download
DOI
10.5468/ogs.24211
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Obstetrics and Gynecology (산부인과학교실) > 1. Journal Papers
Yonsei Authors
Kim, Young Tae(김영태) ORCID logo https://orcid.org/0000-0002-7347-1052
Nam, Eun Ji(남은지) ORCID logo https://orcid.org/0000-0003-0189-3560
Eoh, Kyung Jin(어경진) ORCID logo https://orcid.org/0000-0002-1684-2267
Lee, Inha(이인하) ORCID logo https://orcid.org/0000-0003-4869-6281
Lee, Joon Ho(이준호)
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/201576
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links