Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates

Park, Seong Ho; Suh, Chong Hyun; Lee, Jeong Hyun; Tejani, Ali S.; You, Seng Chan; Kahn Jr, Charles E.; Moy, Linda

doi:10.3348/kjr.2025.1522

YUHSpace

BROWSE

5 19

Cited 2 times in

Cited 0 times in

Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates

DC Field	Value	Language
dc.contributor.author	Park, Seong Ho	-
dc.contributor.author	Suh, Chong Hyun	-
dc.contributor.author	Lee, Jeong Hyun	-
dc.contributor.author	Tejani, Ali S.	-
dc.contributor.author	You, Seng Chan	-
dc.contributor.author	Kahn Jr, Charles E.	-
dc.contributor.author	Moy, Linda	-
dc.date.accessioned	2026-01-20T05:28:02Z	-
dc.date.available	2026-01-20T05:28:02Z	-
dc.date.created	2026-01-14	-
dc.date.issued	2025-12	-
dc.identifier.issn	1229-6929	-
dc.identifier.uri	https://ir.ymlib.yonsei.ac.kr/handle/22282913/210022	-
dc.description.abstract	Recent systematic reviews have raised concerns about the quality of reporting in studies evaluating the accuracy of large language models (LLMs) in medical applications. Incomplete and inconsistent reporting hampers the ability of reviewers and readers to assess study methodology, interpret results, and evaluate reproducibility. To address this issue, the MInimum reporting items for CLear Evaluation of Accuracy Reports of Large Language Models in healthcare (MI-CLEAR-LLM) checklist was developed. This article presents an extensively updated version. While the original version focused on proprietary LLMs accessed via web-based chatbot interfaces, the updated checklist incorporates considerations relevant to application programming interfaces and self-managed models, typically based on open-source LLMs. As before, the revised MI-CLEARLLM focuses on reporting practices specific to LLM accuracy evaluations: specifically, the reporting of how LLMs are specified, accessed, adapted, and applied in testing, with special attention to methodological factors that influence outputs. The checklist includes essential items across categories such as model identification, access mode, input data type, adaptation strategy, prompt optimization, prompt execution, stochasticity management, and test data independence. This article also presents reporting examples from the literature. Adoption of the updated MI-CLEAR-LLM can help ensure transparency in reporting and enable more accurate and meaningful evaluation of studies.	-
dc.language	English	-
dc.publisher	Korean Society of Radiology	-
dc.relation.isPartOf	KOREAN JOURNAL OF RADIOLOGY	-
dc.relation.isPartOf	KOREAN JOURNAL OF RADIOLOGY	-
dc.subject.MESH	Checklist*	-
dc.subject.MESH	Delivery of Health Care*	-
dc.subject.MESH	Humans	-
dc.subject.MESH	Language*	-
dc.subject.MESH	Large Language Models	-
dc.subject.MESH	Reproducibility of Results	-
dc.title	Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates	-
dc.type	Article	-
dc.contributor.googleauthor	Park, Seong Ho	-
dc.contributor.googleauthor	Suh, Chong Hyun	-
dc.contributor.googleauthor	Lee, Jeong Hyun	-
dc.contributor.googleauthor	Tejani, Ali S.	-
dc.contributor.googleauthor	You, Seng Chan	-
dc.contributor.googleauthor	Kahn Jr, Charles E.	-
dc.contributor.googleauthor	Moy, Linda	-
dc.identifier.doi	10.3348/kjr.2025.1522	-
dc.relation.journalcode	J02884	-
dc.identifier.eissn	2005-8330	-
dc.identifier.pmid	41199132	-
dc.subject.keyword	Large language model	-
dc.subject.keyword	Large multimodal model	-
dc.subject.keyword	Generative	-
dc.subject.keyword	Artificial intelligence	-
dc.subject.keyword	Chatbot	-
dc.subject.keyword	Application programming interface	-
dc.subject.keyword	Local deployment	-
dc.subject.keyword	Reporting	-
dc.subject.keyword	Guideline	-
dc.subject.keyword	Checklist	-
dc.subject.keyword	Healthcare	-
dc.subject.keyword	Medicine	-
dc.subject.keyword	Radiology	-
dc.contributor.affiliatedAuthor	You, Seng Chan	-
dc.identifier.scopusid	2-s2.0-105022602794	-
dc.identifier.wosid	001628184400005	-
dc.citation.volume	26	-
dc.citation.number	12	-
dc.citation.startPage	1123	-
dc.citation.endPage	1132	-
dc.identifier.bibliographicCitation	KOREAN JOURNAL OF RADIOLOGY, Vol.26(12) : 1123-1132, 2025-12	-
dc.identifier.rimsid	90950	-
dc.type.rims	ART	-
dc.description.journalClass	1	-
dc.description.journalClass	1	-
dc.subject.keywordAuthor	Large language model	-
dc.subject.keywordAuthor	Large multimodal model	-
dc.subject.keywordAuthor	Generative	-
dc.subject.keywordAuthor	Artificial intelligence	-
dc.subject.keywordAuthor	Chatbot	-
dc.subject.keywordAuthor	Application programming interface	-
dc.subject.keywordAuthor	Local deployment	-
dc.subject.keywordAuthor	Reporting	-
dc.subject.keywordAuthor	Guideline	-
dc.subject.keywordAuthor	Checklist	-
dc.subject.keywordAuthor	Healthcare	-
dc.subject.keywordAuthor	Medicine	-
dc.subject.keywordAuthor	Radiology	-
dc.subject.keywordPlus	GENERATIVE ARTIFICIAL-INTELLIGENCE	-
dc.type.docType	Review	-
dc.identifier.kciid	ART003264398	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.description.journalRegisteredClass	kci	-
dc.relation.journalWebOfScienceCategory	Radiology, Nuclear Medicine & Medical Imaging	-
dc.relation.journalResearchArea	Radiology, Nuclear Medicine & Medical Imaging	-

Appears in Collections:: 1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers

Show simple item record Find it @ YMLIB

License

YUHSpace: Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates

YUHSpace

BROWSE

Browse

Links