Cited 0 times in 
Cited 0 times in 
Confidence-linked and uncertainty-based staged framework for phenotype validation using large language models
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Sumin | - |
| dc.contributor.author | Lee, Hyeok-Hee | - |
| dc.contributor.author | Lee, Hokyou | - |
| dc.contributor.author | Yum, Kyu Sun | - |
| dc.contributor.author | Baek, Jang-Hyun | - |
| dc.contributor.author | Khil, Jaewon | - |
| dc.contributor.author | Lee, Jaeyong | - |
| dc.contributor.author | Shin, Sojung | - |
| dc.contributor.author | Cho, Minsung | - |
| dc.contributor.author | Ahn, Na Yeon | - |
| dc.contributor.author | You, Seng Chan | - |
| dc.contributor.author | Kim, Hyeon Chang | - |
| dc.contributor.author | 이혁희 | - |
| dc.date.accessioned | 2025-11-04T02:34:26Z | - |
| dc.date.available | 2025-11-04T02:34:26Z | - |
| dc.date.created | 2025-09-12 | - |
| dc.date.issued | 2025-08 | - |
| dc.identifier.issn | 1067-5027 | - |
| dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/208166 | - |
| dc.description.abstract | Objectives This study develops and validates the confidence-linked and uncertainty-based staged (CLUES) framework by integrating large language models (LLMs) with uncertainty quantification to assist manual chart review while ensuring reliability through a selective human review.Materials and Methods The CLUES framework assesses stroke-related hospitalizations using imaging reports for 1739 patients across 24 Korean hospitals (2011-2022). Uncertainty was quantified via entropy from LLM-derived confidence values. Our framework operated in 3 stages: (1) zero-shot prompting with ensemble averaging, where high-uncertainty cases advanced to stage 2, (2) few-shot prompting using retrieved low-uncertainty cases, with remaining high-uncertainty cases proceeding to stage 3, and (3) manual chart review for final uncertain cases. Performance was evaluated against physician-labeled data using F1-score and Cohen's Kappa.Results Among 1072 test cases, stage 1 classified 507 cases as low uncertainty, while 565 were high uncertainty. Stage 2 reclassified 280 cases as low uncertainty, leaving 285 for manual review. Low-uncertainty cases consistently outperformed high-uncertainty cases in both stages (weighted F1-scores: 0.94 vs 0.57 in stage 1 and 0.82 vs 0.58 in stage 2). The overall framework performance showed a progressive improvement in F1-scores from 0.840 (stage 1) to 0.878 (stage 2) to 0.955 (stage 3).Discussion The CLUES framework reduced manual review burden by 75% while maintaining high accuracy. By integrating uncertainty quantification with selective human oversight, it provides an efficient and reliable approach to phenotype validation.Conclusion This framework demonstrates the effective integration of LLMs into clinical workflows while ensuring human oversight, enhancing both accuracy and efficiency. | - |
| dc.language | English | - |
| dc.publisher | Oxford University Press | - |
| dc.relation.isPartOf | JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION | - |
| dc.relation.isPartOf | JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION | - |
| dc.title | Confidence-linked and uncertainty-based staged framework for phenotype validation using large language models | - |
| dc.type | Article | - |
| dc.contributor.googleauthor | Lee, Sumin | - |
| dc.contributor.googleauthor | Lee, Hyeok-Hee | - |
| dc.contributor.googleauthor | Lee, Hokyou | - |
| dc.contributor.googleauthor | Yum, Kyu Sun | - |
| dc.contributor.googleauthor | Baek, Jang-Hyun | - |
| dc.contributor.googleauthor | Khil, Jaewon | - |
| dc.contributor.googleauthor | Lee, Jaeyong | - |
| dc.contributor.googleauthor | Shin, Sojung | - |
| dc.contributor.googleauthor | Cho, Minsung | - |
| dc.contributor.googleauthor | Ahn, Na Yeon | - |
| dc.contributor.googleauthor | You, Seng Chan | - |
| dc.contributor.googleauthor | Kim, Hyeon Chang | - |
| dc.identifier.doi | 10.1093/jamia/ocaf099 | - |
| dc.relation.journalcode | J04522 | - |
| dc.identifier.eissn | 1527-974X | - |
| dc.identifier.pmid | 40574695 | - |
| dc.identifier.url | https://academic.oup.com/jamia/article-abstract/32/8/1320/8165643 | - |
| dc.subject.keyword | review | - |
| dc.subject.keyword | phenotype | - |
| dc.subject.keyword | large language models | - |
| dc.subject.keyword | uncertainty | - |
| dc.subject.keyword | entropy | - |
| dc.contributor.affiliatedAuthor | Lee, Sumin | - |
| dc.contributor.affiliatedAuthor | Lee, Hyeok-Hee | - |
| dc.contributor.affiliatedAuthor | Lee, Hokyou | - |
| dc.contributor.affiliatedAuthor | Khil, Jaewon | - |
| dc.contributor.affiliatedAuthor | Lee, Jaeyong | - |
| dc.contributor.affiliatedAuthor | Shin, Sojung | - |
| dc.contributor.affiliatedAuthor | Cho, Minsung | - |
| dc.contributor.affiliatedAuthor | Ahn, Na Yeon | - |
| dc.contributor.affiliatedAuthor | You, Seng Chan | - |
| dc.contributor.affiliatedAuthor | Kim, Hyeon Chang | - |
| dc.identifier.scopusid | 2-s2.0-105011210392 | - |
| dc.identifier.wosid | 001517823500001 | - |
| dc.citation.volume | 32 | - |
| dc.citation.number | 8 | - |
| dc.citation.startPage | 1320 | - |
| dc.citation.endPage | 1327 | - |
| dc.identifier.bibliographicCitation | JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, Vol.32(8) : 1320-1327, 2025-08 | - |
| dc.identifier.rimsid | 89402 | - |
| dc.type.rims | ART | - |
| dc.description.journalClass | 1 | - |
| dc.description.journalClass | 1 | - |
| dc.subject.keywordAuthor | review | - |
| dc.subject.keywordAuthor | phenotype | - |
| dc.subject.keywordAuthor | large language models | - |
| dc.subject.keywordAuthor | uncertainty | - |
| dc.subject.keywordAuthor | entropy | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | ssci | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Interdisciplinary Applications | - |
| dc.relation.journalWebOfScienceCategory | Health Care Sciences & Services | - |
| dc.relation.journalWebOfScienceCategory | Information Science & Library Science | - |
| dc.relation.journalWebOfScienceCategory | Medical Informatics | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Health Care Sciences & Services | - |
| dc.relation.journalResearchArea | Information Science & Library Science | - |
| dc.relation.journalResearchArea | Medical Informatics | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.