Large Language Models Versus Human Readers in CAD-RADS 2.0 Categorization of Coronary CT Angiography Reports

Yoo, Won-Seok; Son, Jinwoo; Kim, Jin Young; Park, Jun Hye; Park, Hee Jun; Kim, Cherry; Choi, Byoung Wook; Suh, Young Joo

doi:10.1007/s10278-025-01704-2

YUHSpace

BROWSE

0 78

Cited 2 times in

Large Language Models Versus Human Readers in CAD-RADS 2.0 Categorization of Coronary CT Angiography Reports

DC Field	Value	Language
dc.contributor.author	Yoo, Won-Seok	-
dc.contributor.author	Son, Jinwoo	-
dc.contributor.author	Kim, Jin Young	-
dc.contributor.author	Park, Jun Hye	-
dc.contributor.author	Park, Hee Jun	-
dc.contributor.author	Kim, Cherry	-
dc.contributor.author	Choi, Byoung Wook	-
dc.contributor.author	Suh, Young Joo	-
dc.contributor.author	손진우	-
dc.date.accessioned	2026-01-19T02:57:23Z	-
dc.date.available	2026-01-19T02:57:23Z	-
dc.date.created	2026-01-02	-
dc.date.issued	2025-10	-
dc.identifier.issn	2948-2925	-
dc.identifier.uri	https://ir.ymlib.yonsei.ac.kr/handle/22282913/209922	-
dc.description.abstract	This study evaluated the accuracy of large language models (LLMs) in assigning Coronary Artery Disease Reporting and Data System (CAD-RADS) 2.0 categories and modifiers based on real-world coronary CT angiography (CCTA) reports and compared their accuracy with human readers. From 2752 eligible CCTA reports generated at an academic hospital between January and September 2024, 180 were randomly selected to fit a balanced distribution of categories and modifiers. The reference standard was established by consensus between two expert cardiac radiologists with 15 and 14 years of experience, respectively. Four LLMs (O1, GPT-4o, GPT-4, GPT-3.5-turbo) and four human readers (a cardiac radiologist, a fellow, two residents) independently assigned CAD-RADS categories and modifiers for each report. For LLMs, the input prompt consisted of the report and a summary of CAD-RADS 2.0. The accuracy of evaluators in full CAD-RADS categorization was compared with O1 using McNemar tests. O1 demonstrated the highest accuracy (90.7%) in full CAD-RADS categorization, outperforming GPT-4o (73.8%), GPT-4 (59.7%), GPT-3.5-turbo (25.8%), the fellow (83.3%), and resident 1 (83.3%; all P-values <= 0.01). However, there was no significant difference in accuracy when compared to the cardiac radiologist (86.1%; P = 0.12) and resident 2 (89.4%; P = 0.68). Processing time per report ranged 1.34-16.61 s for LLMs, whereas human readers required 32.10-55.06 s. In the external validation dataset (n = 327) derived from two independent institutions, O1 achieved 95.7% accuracy for full CAD-RADS categorization. In conclusion, compared to human readers, O1 exhibited similar or higher accuracy and shorter processing times to produce a full CAD-RADS 2.0 categorization based on CCTA reports.	-
dc.language	English	-
dc.publisher	Springer Nature	-
dc.relation.isPartOf	JOURNAL OF IMAGING INFORMATICS IN MEDICINE	-
dc.relation.isPartOf	JOURNAL OF IMAGING INFORMATICS IN MEDICINE	-
dc.title	Large Language Models Versus Human Readers in CAD-RADS 2.0 Categorization of Coronary CT Angiography Reports	-
dc.type	Article	-
dc.contributor.googleauthor	Yoo, Won-Seok	-
dc.contributor.googleauthor	Son, Jinwoo	-
dc.contributor.googleauthor	Kim, Jin Young	-
dc.contributor.googleauthor	Park, Jun Hye	-
dc.contributor.googleauthor	Park, Hee Jun	-
dc.contributor.googleauthor	Kim, Cherry	-
dc.contributor.googleauthor	Choi, Byoung Wook	-
dc.contributor.googleauthor	Suh, Young Joo	-
dc.identifier.doi	10.1007/s10278-025-01704-2	-
dc.relation.journalcode	J04610	-
dc.identifier.eissn	2948-2933	-
dc.identifier.pmid	41055832	-
dc.identifier.url	https://link.springer.com/article/10.1007/s10278-025-01704-2	-
dc.subject.keyword	Large language models	-
dc.subject.keyword	Coronary computed tomography angiography	-
dc.subject.keyword	Coronary artery disease reporting and data system	-
dc.subject.keyword	Radiologists	-
dc.contributor.affiliatedAuthor	Yoo, Won-Seok	-
dc.contributor.affiliatedAuthor	Son, Jinwoo	-
dc.contributor.affiliatedAuthor	Kim, Jin Young	-
dc.contributor.affiliatedAuthor	Park, Jun Hye	-
dc.contributor.affiliatedAuthor	Park, Hee Jun	-
dc.contributor.affiliatedAuthor	Choi, Byoung Wook	-
dc.contributor.affiliatedAuthor	Suh, Young Joo	-
dc.identifier.scopusid	2-s2.0-105018322772	-
dc.identifier.wosid	001588315900001	-
dc.identifier.bibliographicCitation	JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025-10	-
dc.identifier.rimsid	90614	-
dc.type.rims	ART	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.subject.keywordAuthor	Large language models	-
dc.subject.keywordAuthor	Coronary computed tomography angiography	-
dc.subject.keywordAuthor	Coronary artery disease reporting and data system	-
dc.subject.keywordAuthor	Radiologists	-
dc.subject.keywordPlus	EXPERT CONSENSUS DOCUMENT	-
dc.subject.keywordPlus	COMPUTED-TOMOGRAPHY SCCT	-
dc.subject.keywordPlus	ARTERY-DISEASE	-
dc.subject.keywordPlus	AMERICAN-COLLEGE	-
dc.subject.keywordPlus	RADIOLOGY ACR	-
dc.subject.keywordPlus	DATA SYSTEM	-
dc.subject.keywordPlus	SOCIETY	-
dc.subject.keywordPlus	GUIDE	-
dc.type.docType	Article; Early Access	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalWebOfScienceCategory	Radiology, Nuclear Medicine & Medical Imaging	-
dc.relation.journalResearchArea	Radiology, Nuclear Medicine & Medical Imaging	-

Appears in Collections:: 1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers

Show simple item record Find it @ YMLIB

License

YUHSpace: Large Language Models Versus Human Readers in CAD-RADS 2.0 Categorization of Coronary CT Angiography Reports

YUHSpace

BROWSE

Browse

Links