Cited 2 times in

Conceptual review of outcome metrics and measures used in clinical evaluation of artificial intelligence in radiology

Authors
 Seong Ho Park  ;  Kyunghwa Han  ;  June-Goo Lee 
Citation
 RADIOLOGIA MEDICA, Vol.129(11) : 1644-1655, 2024-11 
Journal Title
RADIOLOGIA MEDICA
ISSN
 0033-8362 
Issue Date
2024-11
MeSH
Artificial Intelligence* ; Humans ; Outcome Assessment, Health Care / methods ; Radiology*
Keywords
Artificial intelligence ; Evaluation ; Measure ; Metric ; Outcome ; Performance
Abstract
Artificial intelligence (AI) has numerous applications in radiology. Clinical research studies to evaluate the AI models are also diverse. Consequently, diverse outcome metrics and measures are employed in the clinical evaluation of AI, presenting a challenge for clinical radiologists. This review aims to provide conceptually intuitive explanations of the outcome metrics and measures that are most frequently used in clinical research, specifically tailored for clinicians. While we briefly discuss performance metrics for AI models in binary classification, detection, or segmentation tasks, our primary focus is on less frequently addressed topics in published literature. These include metrics and measures for evaluating multiclass classification; those for evaluating generative AI models, such as models used in image generation or modification and large language models; and outcome measures beyond performance metrics, including patient-centered outcome measures. Our explanations aim to guide clinicians in the appropriate use of these metrics and measures.
Full Text
https://link.springer.com/article/10.1007/s11547-024-01886-9
DOI
10.1007/s11547-024-01886-9
Appears in Collections:
1. College of Medicine (의과대학) > Others (기타) > 1. Journal Papers
Yonsei Authors
Han, Kyung Hwa(한경화)
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/201501
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links