0 379

Cited 0 times in

Cited 0 times in

Controllable Text-to-Image Synthesis for Multi-Modality MR Images

DC Field Value Language
dc.contributor.author김휘영-
dc.contributor.author안성수-
dc.date.accessioned2025-07-09T08:34:37Z-
dc.date.available2025-07-09T08:34:37Z-
dc.date.issued2024-04-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/206491-
dc.description.abstractGenerative modeling has seen significant advancements in recent years, especially in the realm of text-to-image synthesis. Despite this progress, the medical field has yet to fully leverage the capabilities of large-scale foundational models for synthetic data generation. This paper introduces a framework for text-conditional magnetic resonance (MR) imaging generation, addressing the complexities associated with multi-modality considerations. The framework comprises a pre-trained large language model, a diffusion-based prompt-conditional image generation architecture, and an additional denoising network for input structural binary masks. Experimental results demonstrate that the proposed framework is capable of generating realistic, high-resolution, and high-fidelity multi-modal MR images that align with medical language text prompts. Further, the study interprets the cross-attention maps of the generated results based on text-conditional statements. The contributions of this research lay a robust foundation for future studies in text-conditional medical image generation and hold significant promise for accelerating advancements in medical imaging research.-
dc.description.statementOfResponsibilityrestriction-
dc.relation.isPartOf2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.titleControllable Text-to-Image Synthesis for Multi-Modality MR Images-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Neurosurgery (신경외과학교실)-
dc.contributor.googleauthorKyuri Kim-
dc.contributor.googleauthorYoonho Na-
dc.contributor.googleauthorSung-Joon Ye-
dc.contributor.googleauthorJimin Lee-
dc.contributor.googleauthorSung Soo Ahn-
dc.contributor.googleauthorJi Eun Park-
dc.identifier.doi10.1109/WACV57701.2024.00775-
dc.contributor.localIdA05971-
dc.contributor.localIdA02234-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10484486-
dc.subject.keywordApplications-
dc.subject.keywordBiomedical / healthcare / medicine-
dc.subject.keywordAlgorithms-
dc.subject.keywordVision + language and/or other modalities-
dc.contributor.alternativeNameKim, Hwiyoung-
dc.contributor.affiliatedAuthor김휘영-
dc.contributor.affiliatedAuthor안성수-
dc.citation.startPage7921-
dc.citation.endPage7930-
dc.identifier.bibliographicCitation2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, : 7921-7930, 2024-04-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Neurosurgery (신경외과학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.