Cited 0 times in
Cited 0 times in
Controllable Text-to-Image Synthesis for Multi-Modality MR Images
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 김휘영 | - |
dc.contributor.author | 안성수 | - |
dc.date.accessioned | 2025-07-09T08:34:37Z | - |
dc.date.available | 2025-07-09T08:34:37Z | - |
dc.date.issued | 2024-04 | - |
dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/206491 | - |
dc.description.abstract | Generative modeling has seen significant advancements in recent years, especially in the realm of text-to-image synthesis. Despite this progress, the medical field has yet to fully leverage the capabilities of large-scale foundational models for synthetic data generation. This paper introduces a framework for text-conditional magnetic resonance (MR) imaging generation, addressing the complexities associated with multi-modality considerations. The framework comprises a pre-trained large language model, a diffusion-based prompt-conditional image generation architecture, and an additional denoising network for input structural binary masks. Experimental results demonstrate that the proposed framework is capable of generating realistic, high-resolution, and high-fidelity multi-modal MR images that align with medical language text prompts. Further, the study interprets the cross-attention maps of the generated results based on text-conditional statements. The contributions of this research lay a robust foundation for future studies in text-conditional medical image generation and hold significant promise for accelerating advancements in medical imaging research. | - |
dc.description.statementOfResponsibility | restriction | - |
dc.relation.isPartOf | 2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024 | - |
dc.rights | CC BY-NC-ND 2.0 KR | - |
dc.title | Controllable Text-to-Image Synthesis for Multi-Modality MR Images | - |
dc.type | Article | - |
dc.contributor.college | College of Medicine (의과대학) | - |
dc.contributor.department | Dept. of Neurosurgery (신경외과학교실) | - |
dc.contributor.googleauthor | Kyuri Kim | - |
dc.contributor.googleauthor | Yoonho Na | - |
dc.contributor.googleauthor | Sung-Joon Ye | - |
dc.contributor.googleauthor | Jimin Lee | - |
dc.contributor.googleauthor | Sung Soo Ahn | - |
dc.contributor.googleauthor | Ji Eun Park | - |
dc.identifier.doi | 10.1109/WACV57701.2024.00775 | - |
dc.contributor.localId | A05971 | - |
dc.contributor.localId | A02234 | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/10484486 | - |
dc.subject.keyword | Applications | - |
dc.subject.keyword | Biomedical / healthcare / medicine | - |
dc.subject.keyword | Algorithms | - |
dc.subject.keyword | Vision + language and/or other modalities | - |
dc.contributor.alternativeName | Kim, Hwiyoung | - |
dc.contributor.affiliatedAuthor | 김휘영 | - |
dc.contributor.affiliatedAuthor | 안성수 | - |
dc.citation.startPage | 7921 | - |
dc.citation.endPage | 7930 | - |
dc.identifier.bibliographicCitation | 2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, : 7921-7930, 2024-04 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.