Cited 0 times in 
Cited 0 times in 
Leveraging multimodal large language model chatbots in oral radiology: a comprehensive evaluation using questions from a Korean dental university
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jeong, Hui | - |
| dc.contributor.author | Jeon, Kug Jin | - |
| dc.contributor.author | Lee, Chena | - |
| dc.contributor.author | Choi, Yoon Joo | - |
| dc.contributor.author | Jo, Gyu-Dong | - |
| dc.contributor.author | Han, Sang-Sun | - |
| dc.date.accessioned | 2026-01-22T02:31:02Z | - |
| dc.date.available | 2026-01-22T02:31:02Z | - |
| dc.date.created | 2026-01-16 | - |
| dc.date.issued | 2025-12 | - |
| dc.identifier.issn | 0250-832X | - |
| dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/210159 | - |
| dc.description.abstract | Objectives This study aimed to conduct a comprehensive evaluation of general-purpose multimodal large language model (LLM) chatbots in oral radiology. Methods Ninety text- and image-based oral radiology questions from a Korean dental university were extracted and categorized into six educational contents and two question types. ChatGPT-4o and Gemini 2.0 Flash were evaluated with following items: accuracy with group differences across six contents (using Fisher's exact test with Bonferroni correction, P < .0167), answer consistency across ten repeated outputs (evaluated as the mean agreement and Fleiss' kappa coefficient), and hallucination (evaluated as the mean of the 5-point Global Quality Score assigned by two oral radiologists). Results Multimodal LLM chatbots (ChatGPT-4o and Gemini 2.0 Flash) achieved excellent performance on text-based questions with over 80% accuracy but showed limited performance on image-based tasks, with accuracy under 30%. Additionally, image-based tasks exhibited high response variability, and hallucinations were frequently observed, providing incorrect information. These findings suggest that AI chatbots are not yet suitable for reliable use in oral radiology. Conclusions This study provided timely insights into the capabilities and limitations of general-purpose multimodal LLM chatbots in the oral radiology, and will serve as a foundation for more safe and effective applications of AI chatbots in the oral radiology field in the future. Advances in knowledge This is the first study to comprehensively assess multimodal LLM chatbots in oral radiology. It provides key insights into the performance benchmarks for AI chatbots in oral radiology, promoting the responsible and transparent integration of AI into dental education. | - |
| dc.language | English | - |
| dc.publisher | British Institute of Radiology | - |
| dc.relation.isPartOf | DENTOMAXILLOFACIAL RADIOLOGY | - |
| dc.relation.isPartOf | DENTOMAXILLOFACIAL RADIOLOGY | - |
| dc.title | Leveraging multimodal large language model chatbots in oral radiology: a comprehensive evaluation using questions from a Korean dental university | - |
| dc.type | Article | - |
| dc.contributor.googleauthor | Jeong, Hui | - |
| dc.contributor.googleauthor | Jeon, Kug Jin | - |
| dc.contributor.googleauthor | Lee, Chena | - |
| dc.contributor.googleauthor | Choi, Yoon Joo | - |
| dc.contributor.googleauthor | Jo, Gyu-Dong | - |
| dc.contributor.googleauthor | Han, Sang-Sun | - |
| dc.identifier.doi | 10.1093/dmfr/twaf083 | - |
| dc.relation.journalcode | J00704 | - |
| dc.identifier.eissn | 1476-542X | - |
| dc.identifier.pmid | 41386253 | - |
| dc.identifier.url | https://academic.oup.com/dmfr/advance-article/doi/10.1093/dmfr/twaf083/8378392 | - |
| dc.subject.keyword | oral radiology | - |
| dc.subject.keyword | multimodal large language model | - |
| dc.subject.keyword | accuracy | - |
| dc.subject.keyword | answer consistency | - |
| dc.subject.keyword | hallucination | - |
| dc.contributor.affiliatedAuthor | Jeong, Hui | - |
| dc.contributor.affiliatedAuthor | Jeon, Kug Jin | - |
| dc.contributor.affiliatedAuthor | Lee, Chena | - |
| dc.contributor.affiliatedAuthor | Choi, Yoon Joo | - |
| dc.contributor.affiliatedAuthor | Jo, Gyu-Dong | - |
| dc.contributor.affiliatedAuthor | Han, Sang-Sun | - |
| dc.identifier.wosid | 001644509600001 | - |
| dc.identifier.bibliographicCitation | DENTOMAXILLOFACIAL RADIOLOGY, 2025-12 | - |
| dc.identifier.rimsid | 91015 | - |
| dc.type.rims | ART | - |
| dc.description.journalClass | 1 | - |
| dc.description.journalClass | 1 | - |
| dc.subject.keywordAuthor | oral radiology | - |
| dc.subject.keywordAuthor | multimodal large language model | - |
| dc.subject.keywordAuthor | accuracy | - |
| dc.subject.keywordAuthor | answer consistency | - |
| dc.subject.keywordAuthor | hallucination | - |
| dc.type.docType | Article; Early Access | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalWebOfScienceCategory | Dentistry, Oral Surgery & Medicine | - |
| dc.relation.journalWebOfScienceCategory | Radiology, Nuclear Medicine & Medical Imaging | - |
| dc.relation.journalResearchArea | Dentistry, Oral Surgery & Medicine | - |
| dc.relation.journalResearchArea | Radiology, Nuclear Medicine & Medical Imaging | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.