Cited 1 times in

Differentiation of glioblastoma from solitary brain metastasis using deep ensembles: Empirical estimation of uncertainty for clinical reliability

Authors
 Yae Won Park  ;  Sujeong Eom  ;  Seungwoo Kim  ;  Sungbin Lim  ;  Ji Eun Park  ;  Ho Sung Kim  ;  Seng Chan You  ;  Sung Soo Ahn  ;  Seung-Koo Lee 
Citation
 COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, Vol.254 : 108288, 2024-09 
Journal Title
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE
ISSN
 0169-2607 
Issue Date
2024-09
MeSH
Adult ; Aged ; Area Under Curve ; Brain Neoplasms* / diagnostic imaging ; Brain Neoplasms* / secondary ; Deep Learning ; Diagnosis, Differential ; Female ; Glioblastoma* / diagnostic imaging ; Glioblastoma* / pathology ; Humans ; Magnetic Resonance Imaging / methods ; Male ; Middle Aged ; Reproducibility of Results ; Sensitivity and Specificity ; Uncertainty
Keywords
Brain metastasis ; Deep learning ; Glioblastoma ; Magnetic resonance imaging ; Reliability ; Uncertainty
Abstract
Background and Objectives: To develop a clinically reliable deep learning model to differentiate glioblastoma (GBM) from solitary brain metastasis (SBM) by providing predictive uncertainty estimates and interpretability. Methods: A total of 469 patients (300 GBM, 169 SBM) were enrolled in the institutional training set. Deep ensembles based on DenseNet121 were trained on multiparametric MRI. The model performance was validated in the external test set consisting of 143 patients (101 GBM, 42 SBM). Entropy values for each input were evaluated for uncertainty measurement; based on entropy values, the datasets were split to high- and low-uncertainty groups. In addition, entropy values of out-of-distribution (OOD) data from unknown class (257 patients with meningioma) were compared to assess uncertainty estimates of the model. The model interpretability was further evaluated by localization accuracy of the model. Results: On external test set, the area under the curve (AUC), accuracy, sensitivity and specificity of the deep ensembles were 0.83 (95 % confidence interval [CI] 0.76–0.90), 76.2 %, 54.8 % and 85.2 %, respectively. The performance was higher in the low-uncertainty group than in the high-uncertainty group, with AUCs of 0.91 (95 % CI 0.83–0.98) and 0.58 (95 % CI 0.44–0.71), indicating that assessment of uncertainty with entropy values ascertained reliable prediction in the low-uncertainty group. Further, deep ensembles classified a high proportion (90.7 %) of predictions on OOD data to be uncertain, showing robustness in dataset shift. Interpretability evaluated by localization accuracy provided further reliability in the “low-uncertainty and high-localization accuracy” subgroup, with an AUC of 0.98 (95 % CI 0.95–1.00). Conclusions: Empirical assessment of uncertainty and interpretability in deep ensembles provides evidence for the robustness of prediction, offering a clinically reliable model in differentiating GBM from SBM.
Full Text
https://www.sciencedirect.com/science/article/pii/S0169260724002839
DOI
10.1016/j.cmpb.2024.108288
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers
Yonsei Authors
Park, Yae Won(박예원) ORCID logo https://orcid.org/0000-0001-8907-5401
Ahn, Sung Soo(안성수) ORCID logo https://orcid.org/0000-0002-0503-5558
You, Seng Chan(유승찬) ORCID logo https://orcid.org/0000-0002-5052-6399
Lee, Seung Koo(이승구) ORCID logo https://orcid.org/0000-0001-5646-4072
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/200734
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links