0 1

Cited 0 times in

Cited 0 times in

Evaluation of Context-Aware Prompting Techniques for Classification of Tumor Response Categories in Radiology Reports Using Large Language Model

DC Field Value Language
dc.contributor.authorPark, Jiwoo-
dc.contributor.authorSim, Woo Seob-
dc.contributor.authorYu, Jae Yong-
dc.contributor.authorPark, Yu Rang-
dc.contributor.authorLee, Young Han-
dc.date.accessioned2026-01-16T05:56:27Z-
dc.date.available2026-01-16T05:56:27Z-
dc.date.created2026-01-02-
dc.date.issued2025-09-
dc.identifier.issn2948-2925-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/209798-
dc.description.abstractRadiology reports are essential for medical decision-making, providing crucial data for diagnosing diseases, devising treatment plans, and monitoring disease progression. While large language models (LLMs) have shown promise in processing free-text reports, research on effective prompting techniques for radiologic applications remains limited. To evaluate the effectiveness of LLM-driven classification based on radiology reports in terms of tumor response category (TRC), and to optimize the model through a comparison of four different prompt engineering techniques for effectively performing this classification task in clinical applications, we included 3062 whole-spine contrast-enhanced magnetic resonance imaging (MRI) radiology reports for prompt engineering and validation. TRCs were labeled by two radiologists based on criteria modified from the Response Evaluation Criteria in Solid Tumors (RECIST) guidelines. The Llama3 instruct model was used to classify TRCs in this study through four different prompts: General, In-Context Learning (ICL), Chain-of-Thought (CoT), and ICL with CoT. AUROC, accuracy, precision, recall, and F1-score were calculated against each prompt and model (8B, 70B) with the test report dataset. The average AUROC for ICL (0.96 internally, 0.93 externally) and ICL with CoT prompts (0.97 internally, 0.94 externally) outperformed other prompts. Error increased with prompt complexity, including 0.8% incomplete sentence errors and 11.3% probability-classification inconsistencies. This study demonstrates that context-aware LLM prompts substantially improved the efficiency and effectiveness of classifying TRCs from radiology reports, despite potential intrinsic hallucinations. While further improvements are required for real-world application, our findings suggest that context-aware prompts have significant potential for segmenting complex radiology reports and enhancing oncology clinical workflows.-
dc.languageEnglish-
dc.publisherSpringer Nature-
dc.relation.isPartOfJOURNAL OF IMAGING INFORMATICS IN MEDICINE-
dc.relation.isPartOfJOURNAL OF IMAGING INFORMATICS IN MEDICINE-
dc.titleEvaluation of Context-Aware Prompting Techniques for Classification of Tumor Response Categories in Radiology Reports Using Large Language Model-
dc.typeArticle-
dc.contributor.googleauthorPark, Jiwoo-
dc.contributor.googleauthorSim, Woo Seob-
dc.contributor.googleauthorYu, Jae Yong-
dc.contributor.googleauthorPark, Yu Rang-
dc.contributor.googleauthorLee, Young Han-
dc.identifier.doi10.1007/s10278-025-01685-2-
dc.relation.journalcodeJ04610-
dc.identifier.eissn2948-2933-
dc.identifier.pmid41023521-
dc.identifier.urlhttps://link.springer.com/article/10.1007/s10278-025-01685-2-
dc.subject.keywordLarge language model-
dc.subject.keywordNatural language processing-
dc.subject.keywordRadiologic report-
dc.subject.keywordArtificial intelligence-
dc.subject.keywordDisease progression-
dc.contributor.affiliatedAuthorPark, Jiwoo-
dc.contributor.affiliatedAuthorSim, Woo Seob-
dc.contributor.affiliatedAuthorYu, Jae Yong-
dc.contributor.affiliatedAuthorPark, Yu Rang-
dc.contributor.affiliatedAuthorLee, Young Han-
dc.identifier.scopusid2-s2.0-105017458018-
dc.identifier.wosid001583096600001-
dc.identifier.bibliographicCitationJOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025-09-
dc.identifier.rimsid90709-
dc.type.rimsART-
dc.description.journalClass1-
dc.description.journalClass1-
dc.subject.keywordAuthorLarge language model-
dc.subject.keywordAuthorNatural language processing-
dc.subject.keywordAuthorRadiologic report-
dc.subject.keywordAuthorArtificial intelligence-
dc.subject.keywordAuthorDisease progression-
dc.type.docTypeArticle; Early Access-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalWebOfScienceCategoryRadiology, Nuclear Medicine & Medical Imaging-
dc.relation.journalResearchAreaRadiology, Nuclear Medicine & Medical Imaging-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.