0 0

Cited 0 times in

Cited 0 times in

Predicting molecular types of adult-type diffuse gliomas based on MRI reports with large language models

Authors
 Suh, Pae Sun  ;  Lee, Dahyoun  ;  Bang, Chang-Bae  ;  Han, Kyunghwa  ;  Choi, Kyu Sung  ;  Kim, Minjae  ;  Park, Ji Eun  ;  Shin, Na-Young  ;  Ahn, Sung Soo  ;  Choi, Seung Hong  ;  Kim, Ho Sung  ;  Lee, Seung-Koo  ;  Chang, Jong Hee  ;  Kim, Se Hoon  ;  Foltyn-Dumitru, Martha  ;  You, Seng Chan  ;  Vollmuth, Philipp  ;  Kim, Byung-Hoon  ;  Park, Yae Won 
Citation
 EUROPEAN RADIOLOGY, 2025-12 
Journal Title
EUROPEAN RADIOLOGY
ISSN
 0938-7994 
Issue Date
2025-12
Keywords
Artificial intelligence ; Glioma ; Large language model ; Magnetic resonance imaging
Abstract
Objectives To evaluate the performance of large language models (LLMs) in predicting molecular types of adult-type diffuse gliomas according to the 2021 WHO classification using MRI radiology reports. Materials and methods This retrospective study included 2169 patients diagnosed with adult-type diffuse gliomas (294 oligodendrogliomas, 295 IDH-mutant astrocytomas, and 1580 IDH-wildtype glioblastomas) between July 2005 and March 2024 from four hospitals in Asia and Europe. Seven proprietary and open-source LLMs were assessed: GPT-4o-mini, GPT-4.1-mini, Llama 3.1 8B, Llama 3.1 70B, Qwen2.5 7B, Deepseek-r1 8B, and Mistal 7B. The performance of LLMs in classifying molecular types was compared based on the provision of relevant knowledge of glioma imaging findings (knowledge-based vs. na & iuml;ve prompt). The impact of radiologists' subspecialization in neuro-oncology, report quality, and reporting language on LLMs' performance was also evaluated. Results LLMs achieved significantly higher (na & iuml;ve vs. knowledge-based; GPT-4o-mini, 77.0% vs. 79.1%, p < 0.001; Qwen2.5 7B, 75.9% vs. 79.5%, p < 0.001; Deepseek-r1 8B, 66.0% vs. 73.2%, p < 0.001) or comparable accuracy (GPT-4.1-mini, 78.7% vs. 78.6%; Llama 3.1 70B, 78.0% vs. 78.1%; Mistral 7B, 58.4% vs. 57.4%) using knowledge-based prompt compared to na & iuml;ve prompt, except for Llama 3.1 8B (65.4% vs. 44.6%, p < 0.001). Differences in accuracy were more pronounced in smaller-sized LLMs. Additionally, the accuracy was significantly higher with reports by neuro-oncology specialists and high-quality reports in all LLMs (p < 0.001). Conclusions LLMs may provide preoperative information on the tumor types of adult-type diffuse gliomas from MRI reports by providing relevant knowledge in the prompt. Informative and descriptive reports could further enhance LLMs' performance.
DOI
10.1007/s00330-025-12211-x
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Neurosurgery (신경외과학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Pathology (병리학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers
Yonsei Authors
Kim, Byung Hoon(김병훈)
Kim, Se Hoon(김세훈) ORCID logo https://orcid.org/0000-0001-7516-7372
Park, Yae Won(박예원) ORCID logo https://orcid.org/0000-0001-8907-5401
Suh, Pae Sun(서배선)
Shin, Na Young(신나영)
Ahn, Sung Soo(안성수) ORCID logo https://orcid.org/0000-0002-0503-5558
You, Seng Chan(유승찬) ORCID logo https://orcid.org/0000-0002-5052-6399
Lee, Seung Koo(이승구) ORCID logo https://orcid.org/0000-0001-5646-4072
Chang, Jong Hee(장종희) ORCID logo https://orcid.org/0000-0003-1509-9800
Han, Kyung Hwa(한경화)
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/210151
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links