In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling

Kim, Songsoo; Kim, Donghyun; Kim, Jaewoong; Koo, Jalim; Yoon, Jinsik; Yoon, Dukyong

doi:10.4258/hir.2025.31.3.295

YUHSpace

BROWSE

38 84

Cited 2 times in

In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling

DC Field	Value	Language
dc.contributor.author	Kim, Songsoo	-
dc.contributor.author	Kim, Donghyun	-
dc.contributor.author	Kim, Jaewoong	-
dc.contributor.author	Koo, Jalim	-
dc.contributor.author	Yoon, Jinsik	-
dc.contributor.author	Yoon, Dukyong	-
dc.date.accessioned	2025-11-05T01:12:39Z	-
dc.date.available	2025-11-05T01:12:39Z	-
dc.date.created	2025-09-12	-
dc.date.issued	2025-07	-
dc.identifier.issn	2093-3681	-
dc.identifier.uri	https://ir.ymlib.yonsei.ac.kr/handle/22282913/208203	-
dc.description.abstract	Objectives: This study assessed the effectiveness of in-context learning using Generative Pre-trained Transformer-4 (GPT-4) for labeling radiology reports. Methods: In this retrospective study, radiology reports were obtained from the Medical Information Mart for Intensive Care III database. Two structured prompts-the "basic prompt" and the "in-context prompt"- were compared. An optimization experiment was conducted to assess consistency and the occurrence of output format errors. The primary labeling experiments were performed on 200 unseen head computed tomography (CT) reports for multi-label classification of predefined labels (Experiment 1) and on 400 unseen abdominal CT reports for multi-label classification of actionable findings (Experiment 2). Results: The inter-reader accuracies in Experiments 1 and 2 were 0.93 and 0.84, respectively. For multi-label classification of head CT reports (Experiment 1), the in-context prompt led to notable increases in F1-scores for the "foreign body" and "mass" labels (gains of 0.66 and 0.22, respectively). However, improvements for other labels were modest. In multi-label classification of abdominal CT reports (Experiment 2), in-context prompts produced substantial improvements in F1-scores across all labels compared to basic prompts. Providing context equipped the model with domain-specific knowledge and helped align its existing knowledge, thereby improving performance. Conclusions: Incontext learning with GPT-4 consistently improved performance in labeling radiology reports. This approach is particularly effective for subjective labeling tasks and allows the model to align its criteria with those of human annotators for objective labeling. This practical strategy offers a simple, adaptable, and researcher-oriented method that can be applied to diverse labeling tasks.	-
dc.format	application/pdf	-
dc.language	Korean	-
dc.publisher	Korean Society of Medical Informatics	-
dc.relation.isPartOf	HEALTHCARE INFORMATICS RESEARCH	-
dc.relation.isPartOf	HEALTHCARE INFORMATICS RESEARCH	-
dc.title	In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling	-
dc.type	Article	-
dc.contributor.googleauthor	Kim, Songsoo	-
dc.contributor.googleauthor	Kim, Donghyun	-
dc.contributor.googleauthor	Kim, Jaewoong	-
dc.contributor.googleauthor	Koo, Jalim	-
dc.contributor.googleauthor	Yoon, Jinsik	-
dc.contributor.googleauthor	Yoon, Dukyong	-
dc.identifier.doi	10.4258/hir.2025.31.3.295	-
dc.relation.journalcode	J00974	-
dc.identifier.eissn	2093-369X	-
dc.identifier.pmid	40840937	-
dc.subject.keyword	Radiology	-
dc.subject.keyword	Natural Language Processing	-
dc.subject.keyword	Medical Informatics	-
dc.subject.keyword	Artificial Intelligence	-
dc.subject.keyword	Computer-Assisted Diagnosis	-
dc.contributor.affiliatedAuthor	Kim, Songsoo	-
dc.contributor.affiliatedAuthor	Kim, Jaewoong	-
dc.contributor.affiliatedAuthor	Koo, Jalim	-
dc.contributor.affiliatedAuthor	Yoon, Jinsik	-
dc.contributor.affiliatedAuthor	Yoon, Dukyong	-
dc.identifier.scopusid	2-s2.0-105013354654	-
dc.identifier.wosid	001547254900010	-
dc.citation.volume	31	-
dc.citation.number	3	-
dc.citation.startPage	295	-
dc.citation.endPage	309	-
dc.identifier.bibliographicCitation	HEALTHCARE INFORMATICS RESEARCH, Vol.31(3) : 295-309, 2025-07	-
dc.identifier.rimsid	89335	-
dc.type.rims	ART	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.subject.keywordAuthor	Radiology	-
dc.subject.keywordAuthor	Natural Language Processing	-
dc.subject.keywordAuthor	Medical Informatics	-
dc.subject.keywordAuthor	Artificial Intelligence	-
dc.subject.keywordAuthor	Computer-Assisted Diagnosis	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scopus	-
dc.description.journalRegisteredClass	kci	-
dc.relation.journalWebOfScienceCategory	Medical Informatics	-
dc.relation.journalResearchArea	Medical Informatics	-

Appears in Collections:: 1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers

Show simple item record Find it @ YMLIB

License

YUHSpace: In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling

YUHSpace

BROWSE

Browse

Links