Cited 0 times in 
Cited 0 times in 
In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Songsoo | - |
| dc.contributor.author | Kim, Donghyun | - |
| dc.contributor.author | Kim, Jaewoong | - |
| dc.contributor.author | Koo, Jalim | - |
| dc.contributor.author | Yoon, Jinsik | - |
| dc.contributor.author | Yoon, Dukyong | - |
| dc.date.accessioned | 2025-11-05T01:12:39Z | - |
| dc.date.available | 2025-11-05T01:12:39Z | - |
| dc.date.created | 2025-09-12 | - |
| dc.date.issued | 2025-07 | - |
| dc.identifier.issn | 2093-3681 | - |
| dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/208203 | - |
| dc.description.abstract | Objectives: This study assessed the effectiveness of in-context learning using Generative Pre-trained Transformer-4 (GPT-4) for labeling radiology reports. Methods: In this retrospective study, radiology reports were obtained from the Medical Information Mart for Intensive Care III database. Two structured prompts-the "basic prompt" and the "in-context prompt"- were compared. An optimization experiment was conducted to assess consistency and the occurrence of output format errors. The primary labeling experiments were performed on 200 unseen head computed tomography (CT) reports for multi-label classification of predefined labels (Experiment 1) and on 400 unseen abdominal CT reports for multi-label classification of actionable findings (Experiment 2). Results: The inter-reader accuracies in Experiments 1 and 2 were 0.93 and 0.84, respectively. For multi-label classification of head CT reports (Experiment 1), the in-context prompt led to notable increases in F1-scores for the "foreign body" and "mass" labels (gains of 0.66 and 0.22, respectively). However, improvements for other labels were modest. In multi-label classification of abdominal CT reports (Experiment 2), in-context prompts produced substantial improvements in F1-scores across all labels compared to basic prompts. Providing context equipped the model with domain-specific knowledge and helped align its existing knowledge, thereby improving performance. Conclusions: Incontext learning with GPT-4 consistently improved performance in labeling radiology reports. This approach is particularly effective for subjective labeling tasks and allows the model to align its criteria with those of human annotators for objective labeling. This practical strategy offers a simple, adaptable, and researcher-oriented method that can be applied to diverse labeling tasks. | - |
| dc.format | application/pdf | - |
| dc.language | Korean | - |
| dc.publisher | Korean Society of Medical Informatics | - |
| dc.relation.isPartOf | HEALTHCARE INFORMATICS RESEARCH | - |
| dc.relation.isPartOf | HEALTHCARE INFORMATICS RESEARCH | - |
| dc.title | In-Context Learning with Large Language Models: A Simple and Effective Approach to Improve Radiology Report Labeling | - |
| dc.type | Article | - |
| dc.contributor.googleauthor | Kim, Songsoo | - |
| dc.contributor.googleauthor | Kim, Donghyun | - |
| dc.contributor.googleauthor | Kim, Jaewoong | - |
| dc.contributor.googleauthor | Koo, Jalim | - |
| dc.contributor.googleauthor | Yoon, Jinsik | - |
| dc.contributor.googleauthor | Yoon, Dukyong | - |
| dc.identifier.doi | 10.4258/hir.2025.31.3.295 | - |
| dc.relation.journalcode | J00974 | - |
| dc.identifier.eissn | 2093-369X | - |
| dc.identifier.pmid | 40840937 | - |
| dc.subject.keyword | Radiology | - |
| dc.subject.keyword | Natural Language Processing | - |
| dc.subject.keyword | Medical Informatics | - |
| dc.subject.keyword | Artificial Intelligence | - |
| dc.subject.keyword | Computer-Assisted Diagnosis | - |
| dc.contributor.affiliatedAuthor | Kim, Songsoo | - |
| dc.contributor.affiliatedAuthor | Kim, Jaewoong | - |
| dc.contributor.affiliatedAuthor | Koo, Jalim | - |
| dc.contributor.affiliatedAuthor | Yoon, Jinsik | - |
| dc.contributor.affiliatedAuthor | Yoon, Dukyong | - |
| dc.identifier.scopusid | 2-s2.0-105013354654 | - |
| dc.identifier.wosid | 001547254900010 | - |
| dc.citation.volume | 31 | - |
| dc.citation.number | 3 | - |
| dc.citation.startPage | 295 | - |
| dc.citation.endPage | 309 | - |
| dc.identifier.bibliographicCitation | HEALTHCARE INFORMATICS RESEARCH, Vol.31(3) : 295-309, 2025-07 | - |
| dc.identifier.rimsid | 89335 | - |
| dc.type.rims | ART | - |
| dc.description.journalClass | 1 | - |
| dc.description.journalClass | 1 | - |
| dc.subject.keywordAuthor | Radiology | - |
| dc.subject.keywordAuthor | Natural Language Processing | - |
| dc.subject.keywordAuthor | Medical Informatics | - |
| dc.subject.keywordAuthor | Artificial Intelligence | - |
| dc.subject.keywordAuthor | Computer-Assisted Diagnosis | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.relation.journalWebOfScienceCategory | Medical Informatics | - |
| dc.relation.journalResearchArea | Medical Informatics | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.