Confidence-linked and uncertainty-based staged framework for phenotype validation using large language models

Lee, Sumin; Lee, Hyeok-Hee; Lee, Hokyou; Yum, Kyu Sun; Baek, Jang-Hyun; Khil, Jaewon; Lee, Jaeyong; Shin, Sojung; Cho, Minsung; Ahn, Na Yeon; You, Seng Chan; Kim, Hyeon Chang

doi:10.1093/jamia/ocaf099

YUHSpace

BROWSE

0 59

Cited 0 times in

Confidence-linked and uncertainty-based staged framework for phenotype validation using large language models

DC Field	Value	Language
dc.contributor.author	Lee, Sumin	-
dc.contributor.author	Lee, Hyeok-Hee	-
dc.contributor.author	Lee, Hokyou	-
dc.contributor.author	Yum, Kyu Sun	-
dc.contributor.author	Baek, Jang-Hyun	-
dc.contributor.author	Khil, Jaewon	-
dc.contributor.author	Lee, Jaeyong	-
dc.contributor.author	Shin, Sojung	-
dc.contributor.author	Cho, Minsung	-
dc.contributor.author	Ahn, Na Yeon	-
dc.contributor.author	You, Seng Chan	-
dc.contributor.author	Kim, Hyeon Chang	-
dc.contributor.author	이혁희	-
dc.date.accessioned	2025-11-04T02:34:26Z	-
dc.date.available	2025-11-04T02:34:26Z	-
dc.date.created	2025-09-12	-
dc.date.issued	2025-08	-
dc.identifier.issn	1067-5027	-
dc.identifier.uri	https://ir.ymlib.yonsei.ac.kr/handle/22282913/208166	-
dc.description.abstract	Objectives This study develops and validates the confidence-linked and uncertainty-based staged (CLUES) framework by integrating large language models (LLMs) with uncertainty quantification to assist manual chart review while ensuring reliability through a selective human review.Materials and Methods The CLUES framework assesses stroke-related hospitalizations using imaging reports for 1739 patients across 24 Korean hospitals (2011-2022). Uncertainty was quantified via entropy from LLM-derived confidence values. Our framework operated in 3 stages: (1) zero-shot prompting with ensemble averaging, where high-uncertainty cases advanced to stage 2, (2) few-shot prompting using retrieved low-uncertainty cases, with remaining high-uncertainty cases proceeding to stage 3, and (3) manual chart review for final uncertain cases. Performance was evaluated against physician-labeled data using F1-score and Cohen's Kappa.Results Among 1072 test cases, stage 1 classified 507 cases as low uncertainty, while 565 were high uncertainty. Stage 2 reclassified 280 cases as low uncertainty, leaving 285 for manual review. Low-uncertainty cases consistently outperformed high-uncertainty cases in both stages (weighted F1-scores: 0.94 vs 0.57 in stage 1 and 0.82 vs 0.58 in stage 2). The overall framework performance showed a progressive improvement in F1-scores from 0.840 (stage 1) to 0.878 (stage 2) to 0.955 (stage 3).Discussion The CLUES framework reduced manual review burden by 75% while maintaining high accuracy. By integrating uncertainty quantification with selective human oversight, it provides an efficient and reliable approach to phenotype validation.Conclusion This framework demonstrates the effective integration of LLMs into clinical workflows while ensuring human oversight, enhancing both accuracy and efficiency.	-
dc.language	English	-
dc.publisher	Oxford University Press	-
dc.relation.isPartOf	JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION	-
dc.relation.isPartOf	JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION	-
dc.title	Confidence-linked and uncertainty-based staged framework for phenotype validation using large language models	-
dc.type	Article	-
dc.contributor.googleauthor	Lee, Sumin	-
dc.contributor.googleauthor	Lee, Hyeok-Hee	-
dc.contributor.googleauthor	Lee, Hokyou	-
dc.contributor.googleauthor	Yum, Kyu Sun	-
dc.contributor.googleauthor	Baek, Jang-Hyun	-
dc.contributor.googleauthor	Khil, Jaewon	-
dc.contributor.googleauthor	Lee, Jaeyong	-
dc.contributor.googleauthor	Shin, Sojung	-
dc.contributor.googleauthor	Cho, Minsung	-
dc.contributor.googleauthor	Ahn, Na Yeon	-
dc.contributor.googleauthor	You, Seng Chan	-
dc.contributor.googleauthor	Kim, Hyeon Chang	-
dc.identifier.doi	10.1093/jamia/ocaf099	-
dc.relation.journalcode	J04522	-
dc.identifier.eissn	1527-974X	-
dc.identifier.pmid	40574695	-
dc.identifier.url	https://academic.oup.com/jamia/article-abstract/32/8/1320/8165643	-
dc.subject.keyword	review	-
dc.subject.keyword	phenotype	-
dc.subject.keyword	large language models	-
dc.subject.keyword	uncertainty	-
dc.subject.keyword	entropy	-
dc.contributor.affiliatedAuthor	Lee, Sumin	-
dc.contributor.affiliatedAuthor	Lee, Hyeok-Hee	-
dc.contributor.affiliatedAuthor	Lee, Hokyou	-
dc.contributor.affiliatedAuthor	Khil, Jaewon	-
dc.contributor.affiliatedAuthor	Lee, Jaeyong	-
dc.contributor.affiliatedAuthor	Shin, Sojung	-
dc.contributor.affiliatedAuthor	Cho, Minsung	-
dc.contributor.affiliatedAuthor	Ahn, Na Yeon	-
dc.contributor.affiliatedAuthor	You, Seng Chan	-
dc.contributor.affiliatedAuthor	Kim, Hyeon Chang	-
dc.identifier.scopusid	2-s2.0-105011210392	-
dc.identifier.wosid	001517823500001	-
dc.citation.volume	32	-
dc.citation.number	8	-
dc.citation.startPage	1320	-
dc.citation.endPage	1327	-
dc.identifier.bibliographicCitation	JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, Vol.32(8) : 1320-1327, 2025-08	-
dc.identifier.rimsid	89402	-
dc.type.rims	ART	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.subject.keywordAuthor	review	-
dc.subject.keywordAuthor	phenotype	-
dc.subject.keywordAuthor	large language models	-
dc.subject.keywordAuthor	uncertainty	-
dc.subject.keywordAuthor	entropy	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	ssci	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.relation.journalWebOfScienceCategory	Health Care Sciences & Services	-
dc.relation.journalWebOfScienceCategory	Information Science & Library Science	-
dc.relation.journalWebOfScienceCategory	Medical Informatics	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Health Care Sciences & Services	-
dc.relation.journalResearchArea	Information Science & Library Science	-
dc.relation.journalResearchArea	Medical Informatics	-

Appears in Collections:: 1. College of Medicine (의과대학) > Dept. of Preventive Medicine (예방의학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers

Show simple item record Find it @ YMLIB

License

YUHSpace: Confidence-linked and uncertainty-based staged framework for phenotype validation using large language models

YUHSpace

BROWSE

Browse

Links