Cited 0 times in 
Cited 0 times in 
Enhancing Structured Pathology Report Generation With Foundation Model and Modular Design
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Kyung A. | - |
| dc.contributor.author | Hong, Sungman | - |
| dc.contributor.author | Yoo, Sehwan | - |
| dc.contributor.author | Kang, Yousun | - |
| dc.contributor.author | Shim, Hyo Sup | - |
| dc.date.accessioned | 2025-11-05T02:04:06Z | - |
| dc.date.available | 2025-11-05T02:04:06Z | - |
| dc.date.created | 2025-09-11 | - |
| dc.date.issued | 2025-07 | - |
| dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/208217 | - |
| dc.description.abstract | Pathology report generation, which requires analyzing giga-pixel Whole Slide Images (WSIs), is a complex and labor-intensive task. Despite this, it is crucial for clinical decision-making. An AI-driven report generation system can assist in producing accurate and concise reports. To address these needs, we designed a two-stage framework consisting of image analysis and text generation. This framework improves diagnostic performance and optimizes it for real-world clinical application. In the first stage, using 752 high resolution WSIs from bladder tumor tissue samples collected at Korea University Medical Center, we leveraged foundation models to extract robust features. Then, we built a multi-label classification based on key diagnostic elements to achieve high performance with an Attention-based Multi-Instance Learning (MIL) model and a Transformer-based MIL with Knowledge Distillation. For the report generation stage, we designed a T5-based text-to-text model, simplifying input representations while integrating data augmentation to improve stability and generalization. The proposed model achieved a ROUGE score of 0.87, a BLEU-4 score of 0.94, a Jaccard score of 0.89 and a BioLLM score of 0.97. An additional evaluation conducted by the institution maintaining the exclusive K-MEDICON test datasets confirmed a consistent performance yielding an overall score of 0.88, calculated as a weighted sum of aforementioned metrics. Its consistency validated through a Mean Opinion Score (MOS) evaluation by pathologists. By systematically integrating foundation models and modular structure, the benchmark results demonstrate that the task can be effectively solved with limited data and computational resources, indicating promising potential for real-world clinical adaptation. | - |
| dc.language | English | - |
| dc.publisher | Institute of Electrical and Electronics Engineers | - |
| dc.relation.isPartOf | IEEE ACCESS | - |
| dc.relation.isPartOf | IEEE ACCESS | - |
| dc.title | Enhancing Structured Pathology Report Generation With Foundation Model and Modular Design | - |
| dc.type | Article | - |
| dc.contributor.googleauthor | Kim, Kyung A. | - |
| dc.contributor.googleauthor | Hong, Sungman | - |
| dc.contributor.googleauthor | Yoo, Sehwan | - |
| dc.contributor.googleauthor | Kang, Yousun | - |
| dc.contributor.googleauthor | Shim, Hyo Sup | - |
| dc.identifier.doi | 10.1109/ACCESS.2025.3588121 | - |
| dc.relation.journalcode | J03454 | - |
| dc.identifier.eissn | 2169-3536 | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/11078283 | - |
| dc.subject.keyword | Pathology | - |
| dc.subject.keyword | Feature extraction | - |
| dc.subject.keyword | Computational modeling | - |
| dc.subject.keyword | Accuracy | - |
| dc.subject.keyword | Transformers | - |
| dc.subject.keyword | Analytical models | - |
| dc.subject.keyword | Biological system modeling | - |
| dc.subject.keyword | Foundation models | - |
| dc.subject.keyword | Complexity theory | - |
| dc.subject.keyword | Attention mechanisms | - |
| dc.subject.keyword | Knowledge distillation | - |
| dc.subject.keyword | multiple instance learning | - |
| dc.subject.keyword | report generation | - |
| dc.subject.keyword | text-to-text transfer transformer | - |
| dc.subject.keyword | whole slide image | - |
| dc.contributor.affiliatedAuthor | Kim, Kyung A. | - |
| dc.contributor.affiliatedAuthor | Shim, Hyo Sup | - |
| dc.identifier.scopusid | 2-s2.0-105012032433 | - |
| dc.identifier.wosid | 001531861100046 | - |
| dc.citation.volume | 13 | - |
| dc.citation.startPage | 121290 | - |
| dc.citation.endPage | 121299 | - |
| dc.identifier.bibliographicCitation | IEEE ACCESS, Vol.13 : 121290-121299, 2025-07 | - |
| dc.identifier.rimsid | 89299 | - |
| dc.type.rims | ART | - |
| dc.description.journalClass | 1 | - |
| dc.description.journalClass | 1 | - |
| dc.subject.keywordAuthor | Pathology | - |
| dc.subject.keywordAuthor | Feature extraction | - |
| dc.subject.keywordAuthor | Computational modeling | - |
| dc.subject.keywordAuthor | Accuracy | - |
| dc.subject.keywordAuthor | Transformers | - |
| dc.subject.keywordAuthor | Analytical models | - |
| dc.subject.keywordAuthor | Biological system modeling | - |
| dc.subject.keywordAuthor | Foundation models | - |
| dc.subject.keywordAuthor | Complexity theory | - |
| dc.subject.keywordAuthor | Attention mechanisms | - |
| dc.subject.keywordAuthor | Knowledge distillation | - |
| dc.subject.keywordAuthor | multiple instance learning | - |
| dc.subject.keywordAuthor | report generation | - |
| dc.subject.keywordAuthor | text-to-text transfer transformer | - |
| dc.subject.keywordAuthor | whole slide image | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Telecommunications | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.