0 14

Cited 0 times in

Cited 0 times in

Large Language Models Versus Human Readers in CAD-RADS 2.0 Categorization of Coronary CT Angiography Reports

DC Field Value Language
dc.contributor.authorYoo, Won-Seok-
dc.contributor.authorSon, Jinwoo-
dc.contributor.authorKim, Jin Young-
dc.contributor.authorPark, Jun Hye-
dc.contributor.authorPark, Hee Jun-
dc.contributor.authorKim, Cherry-
dc.contributor.authorChoi, Byoung Wook-
dc.contributor.authorSuh, Young Joo-
dc.contributor.author손진우-
dc.date.accessioned2026-01-19T02:57:23Z-
dc.date.available2026-01-19T02:57:23Z-
dc.date.created2026-01-02-
dc.date.issued2025-10-
dc.identifier.issn2948-2925-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/209922-
dc.description.abstractThis study evaluated the accuracy of large language models (LLMs) in assigning Coronary Artery Disease Reporting and Data System (CAD-RADS) 2.0 categories and modifiers based on real-world coronary CT angiography (CCTA) reports and compared their accuracy with human readers. From 2752 eligible CCTA reports generated at an academic hospital between January and September 2024, 180 were randomly selected to fit a balanced distribution of categories and modifiers. The reference standard was established by consensus between two expert cardiac radiologists with 15 and 14 years of experience, respectively. Four LLMs (O1, GPT-4o, GPT-4, GPT-3.5-turbo) and four human readers (a cardiac radiologist, a fellow, two residents) independently assigned CAD-RADS categories and modifiers for each report. For LLMs, the input prompt consisted of the report and a summary of CAD-RADS 2.0. The accuracy of evaluators in full CAD-RADS categorization was compared with O1 using McNemar tests. O1 demonstrated the highest accuracy (90.7%) in full CAD-RADS categorization, outperforming GPT-4o (73.8%), GPT-4 (59.7%), GPT-3.5-turbo (25.8%), the fellow (83.3%), and resident 1 (83.3%; all P-values <= 0.01). However, there was no significant difference in accuracy when compared to the cardiac radiologist (86.1%; P = 0.12) and resident 2 (89.4%; P = 0.68). Processing time per report ranged 1.34-16.61 s for LLMs, whereas human readers required 32.10-55.06 s. In the external validation dataset (n = 327) derived from two independent institutions, O1 achieved 95.7% accuracy for full CAD-RADS categorization. In conclusion, compared to human readers, O1 exhibited similar or higher accuracy and shorter processing times to produce a full CAD-RADS 2.0 categorization based on CCTA reports.-
dc.languageEnglish-
dc.publisherSpringer Nature-
dc.relation.isPartOfJOURNAL OF IMAGING INFORMATICS IN MEDICINE-
dc.relation.isPartOfJOURNAL OF IMAGING INFORMATICS IN MEDICINE-
dc.titleLarge Language Models Versus Human Readers in CAD-RADS 2.0 Categorization of Coronary CT Angiography Reports-
dc.typeArticle-
dc.contributor.googleauthorYoo, Won-Seok-
dc.contributor.googleauthorSon, Jinwoo-
dc.contributor.googleauthorKim, Jin Young-
dc.contributor.googleauthorPark, Jun Hye-
dc.contributor.googleauthorPark, Hee Jun-
dc.contributor.googleauthorKim, Cherry-
dc.contributor.googleauthorChoi, Byoung Wook-
dc.contributor.googleauthorSuh, Young Joo-
dc.identifier.doi10.1007/s10278-025-01704-2-
dc.relation.journalcodeJ04610-
dc.identifier.eissn2948-2933-
dc.identifier.pmid41055832-
dc.identifier.urlhttps://link.springer.com/article/10.1007/s10278-025-01704-2-
dc.subject.keywordLarge language models-
dc.subject.keywordCoronary computed tomography angiography-
dc.subject.keywordCoronary artery disease reporting and data system-
dc.subject.keywordRadiologists-
dc.contributor.affiliatedAuthorYoo, Won-Seok-
dc.contributor.affiliatedAuthorSon, Jinwoo-
dc.contributor.affiliatedAuthorKim, Jin Young-
dc.contributor.affiliatedAuthorPark, Jun Hye-
dc.contributor.affiliatedAuthorPark, Hee Jun-
dc.contributor.affiliatedAuthorChoi, Byoung Wook-
dc.contributor.affiliatedAuthorSuh, Young Joo-
dc.identifier.scopusid2-s2.0-105018322772-
dc.identifier.wosid001588315900001-
dc.identifier.bibliographicCitationJOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025-10-
dc.identifier.rimsid90614-
dc.type.rimsART-
dc.description.journalClass1-
dc.description.journalClass1-
dc.subject.keywordAuthorLarge language models-
dc.subject.keywordAuthorCoronary computed tomography angiography-
dc.subject.keywordAuthorCoronary artery disease reporting and data system-
dc.subject.keywordAuthorRadiologists-
dc.subject.keywordPlusEXPERT CONSENSUS DOCUMENT-
dc.subject.keywordPlusCOMPUTED-TOMOGRAPHY SCCT-
dc.subject.keywordPlusARTERY-DISEASE-
dc.subject.keywordPlusAMERICAN-COLLEGE-
dc.subject.keywordPlusRADIOLOGY ACR-
dc.subject.keywordPlusDATA SYSTEM-
dc.subject.keywordPlusSOCIETY-
dc.subject.keywordPlusGUIDE-
dc.type.docTypeArticle; Early Access-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalWebOfScienceCategoryRadiology, Nuclear Medicine & Medical Imaging-
dc.relation.journalResearchAreaRadiology, Nuclear Medicine & Medical Imaging-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.