78 208

Cited 4 times in

Evaluating diagnostic content of AI-generated chest radiography: A multi-center visual Turing test

Authors
 Youho Myong  ;  Dan Yoon  ;  Byeong Soo Kim  ;  Young Gyun Kim  ;  Yongsik Sim  ;  Suji Lee  ;  Jiyoung Yoon  ;  Minwoo Cho  ;  Sungwan Kim 
Citation
 PLOS ONE, Vol.18(4) : e0279349, 2023-04 
Journal Title
PLOS ONE
Issue Date
2023-04
MeSH
Artificial Intelligence* ; Certification ; Hospitals, University ; Humans ; Neural Networks, Computer* ; Radiography
Abstract
Background Accurate interpretation of chest radiographs requires years of medical training, and many countries face a shortage of medical professionals to meet such requirements. Recent advancements in artificial intelligence (AI) have aided diagnoses; however, their performance is often limited due to data imbalance. The aim of this study was to augment imbalanced medical data using generative adversarial networks (GANs) and evaluate the clinical quality of the generated images via a multi-center visual Turing test. Methods Using six chest radiograph datasets, (MIMIC, CheXPert, CXR8, JSRT, VBD, and OpenI), starGAN v2 generated chest radiographs with specific pathologies. Five board-certified radiologists from three university hospitals, each with at least five years of clinical experience, evaluated the image quality through a visual Turing test. Further evaluations were performed to investigate whether GAN augmentation enhanced the convolutional neural network (CNN) classifier performances. Results In terms of identifying GAN images as artificial, there was no significant difference in the sensitivity between radiologists and random guessing (result of radiologists: 147/275 (53.5%) vs result of random guessing: 137.5/275, (50%); p = .284). GAN augmentation enhanced CNN classifier performance by 11.7%. Conclusion Radiologists effectively classified chest pathologies with synthesized radiographs, suggesting that the images contained adequate clinical information. Furthermore, GAN augmentation enhanced CNN performance, providing a bypass to overcome data imbalance in medical AI training. CNN based methods rely on the amount and quality of training data; the present study showed that GAN augmentation could effectively augment training data for medical AI. © 2023 Myong et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Files in This Item:
T202302756.pdf Download
DOI
10.1371/journal.pone.0279349
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers
Yonsei Authors
Sim, Yongsik(심용식) ORCID logo https://orcid.org/0000-0003-2711-2793
Yoon, Jiyoung(윤지영) ORCID logo https://orcid.org/0000-0003-2266-0803
Lee, Suji(이수지) ORCID logo https://orcid.org/0000-0002-8770-622X
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/194241
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links