5 19

Cited 2 times in

Cited 0 times in

Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates

Authors
 Park, Seong Ho  ;  Suh, Chong Hyun  ;  Lee, Jeong Hyun  ;  Tejani, Ali S.  ;  You, Seng Chan  ;  Kahn Jr, Charles E.  ;  Moy, Linda 
Citation
 KOREAN JOURNAL OF RADIOLOGY, Vol.26(12) : 1123-1132, 2025-12 
Journal Title
KOREAN JOURNAL OF RADIOLOGY
ISSN
 1229-6929 
Issue Date
2025-12
MeSH
Checklist* ; Delivery of Health Care* ; Humans ; Language* ; Large Language Models ; Reproducibility of Results
Keywords
Large language model ; Large multimodal model ; Generative ; Artificial intelligence ; Chatbot ; Application programming interface ; Local deployment ; Reporting ; Guideline ; Checklist ; Healthcare ; Medicine ; Radiology
Abstract
Recent systematic reviews have raised concerns about the quality of reporting in studies evaluating the accuracy of large language models (LLMs) in medical applications. Incomplete and inconsistent reporting hampers the ability of reviewers and readers to assess study methodology, interpret results, and evaluate reproducibility. To address this issue, the MInimum reporting items for CLear Evaluation of Accuracy Reports of Large Language Models in healthcare (MI-CLEAR-LLM) checklist was developed. This article presents an extensively updated version. While the original version focused on proprietary LLMs accessed via web-based chatbot interfaces, the updated checklist incorporates considerations relevant to application programming interfaces and self-managed models, typically based on open-source LLMs. As before, the revised MI-CLEARLLM focuses on reporting practices specific to LLM accuracy evaluations: specifically, the reporting of how LLMs are specified, accessed, adapted, and applied in testing, with special attention to methodological factors that influence outputs. The checklist includes essential items across categories such as model identification, access mode, input data type, adaptation strategy, prompt optimization, prompt execution, stochasticity management, and test data independence. This article also presents reporting examples from the literature. Adoption of the updated MI-CLEAR-LLM can help ensure transparency in reporting and enable more accurate and meaningful evaluation of studies.
Files in This Item:
90950.pdf Download
DOI
10.3348/kjr.2025.1522
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers
Yonsei Authors
You, Seng Chan(유승찬) ORCID logo https://orcid.org/0000-0002-5052-6399
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/210022
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links