5 19

Cited 2 times in

Cited 0 times in

Minimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates

DC Field Value Language
dc.contributor.authorPark, Seong Ho-
dc.contributor.authorSuh, Chong Hyun-
dc.contributor.authorLee, Jeong Hyun-
dc.contributor.authorTejani, Ali S.-
dc.contributor.authorYou, Seng Chan-
dc.contributor.authorKahn Jr, Charles E.-
dc.contributor.authorMoy, Linda-
dc.date.accessioned2026-01-20T05:28:02Z-
dc.date.available2026-01-20T05:28:02Z-
dc.date.created2026-01-14-
dc.date.issued2025-12-
dc.identifier.issn1229-6929-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/210022-
dc.description.abstractRecent systematic reviews have raised concerns about the quality of reporting in studies evaluating the accuracy of large language models (LLMs) in medical applications. Incomplete and inconsistent reporting hampers the ability of reviewers and readers to assess study methodology, interpret results, and evaluate reproducibility. To address this issue, the MInimum reporting items for CLear Evaluation of Accuracy Reports of Large Language Models in healthcare (MI-CLEAR-LLM) checklist was developed. This article presents an extensively updated version. While the original version focused on proprietary LLMs accessed via web-based chatbot interfaces, the updated checklist incorporates considerations relevant to application programming interfaces and self-managed models, typically based on open-source LLMs. As before, the revised MI-CLEARLLM focuses on reporting practices specific to LLM accuracy evaluations: specifically, the reporting of how LLMs are specified, accessed, adapted, and applied in testing, with special attention to methodological factors that influence outputs. The checklist includes essential items across categories such as model identification, access mode, input data type, adaptation strategy, prompt optimization, prompt execution, stochasticity management, and test data independence. This article also presents reporting examples from the literature. Adoption of the updated MI-CLEAR-LLM can help ensure transparency in reporting and enable more accurate and meaningful evaluation of studies.-
dc.languageEnglish-
dc.publisherKorean Society of Radiology-
dc.relation.isPartOfKOREAN JOURNAL OF RADIOLOGY-
dc.relation.isPartOfKOREAN JOURNAL OF RADIOLOGY-
dc.subject.MESHChecklist*-
dc.subject.MESHDelivery of Health Care*-
dc.subject.MESHHumans-
dc.subject.MESHLanguage*-
dc.subject.MESHLarge Language Models-
dc.subject.MESHReproducibility of Results-
dc.titleMinimum Reporting Items for Clear Evaluation of Accuracy Reports of Large Language Models in Healthcare (MI-CLEAR-LLM): 2025 Updates-
dc.typeArticle-
dc.contributor.googleauthorPark, Seong Ho-
dc.contributor.googleauthorSuh, Chong Hyun-
dc.contributor.googleauthorLee, Jeong Hyun-
dc.contributor.googleauthorTejani, Ali S.-
dc.contributor.googleauthorYou, Seng Chan-
dc.contributor.googleauthorKahn Jr, Charles E.-
dc.contributor.googleauthorMoy, Linda-
dc.identifier.doi10.3348/kjr.2025.1522-
dc.relation.journalcodeJ02884-
dc.identifier.eissn2005-8330-
dc.identifier.pmid41199132-
dc.subject.keywordLarge language model-
dc.subject.keywordLarge multimodal model-
dc.subject.keywordGenerative-
dc.subject.keywordArtificial intelligence-
dc.subject.keywordChatbot-
dc.subject.keywordApplication programming interface-
dc.subject.keywordLocal deployment-
dc.subject.keywordReporting-
dc.subject.keywordGuideline-
dc.subject.keywordChecklist-
dc.subject.keywordHealthcare-
dc.subject.keywordMedicine-
dc.subject.keywordRadiology-
dc.contributor.affiliatedAuthorYou, Seng Chan-
dc.identifier.scopusid2-s2.0-105022602794-
dc.identifier.wosid001628184400005-
dc.citation.volume26-
dc.citation.number12-
dc.citation.startPage1123-
dc.citation.endPage1132-
dc.identifier.bibliographicCitationKOREAN JOURNAL OF RADIOLOGY, Vol.26(12) : 1123-1132, 2025-12-
dc.identifier.rimsid90950-
dc.type.rimsART-
dc.description.journalClass1-
dc.description.journalClass1-
dc.subject.keywordAuthorLarge language model-
dc.subject.keywordAuthorLarge multimodal model-
dc.subject.keywordAuthorGenerative-
dc.subject.keywordAuthorArtificial intelligence-
dc.subject.keywordAuthorChatbot-
dc.subject.keywordAuthorApplication programming interface-
dc.subject.keywordAuthorLocal deployment-
dc.subject.keywordAuthorReporting-
dc.subject.keywordAuthorGuideline-
dc.subject.keywordAuthorChecklist-
dc.subject.keywordAuthorHealthcare-
dc.subject.keywordAuthorMedicine-
dc.subject.keywordAuthorRadiology-
dc.subject.keywordPlusGENERATIVE ARTIFICIAL-INTELLIGENCE-
dc.type.docTypeReview-
dc.identifier.kciidART003264398-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.relation.journalWebOfScienceCategoryRadiology, Nuclear Medicine & Medical Imaging-
dc.relation.journalResearchAreaRadiology, Nuclear Medicine & Medical Imaging-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.