Cited 0 times in 
Cited 0 times in 
SMOTE-augmented machine learning model predicts recurrent and metastatic breast cancer from microbiome analysis
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Hong, Ji Eun | - |
| dc.contributor.author | Kim, Yeon Eun | - |
| dc.contributor.author | Kang, Yun Soo | - |
| dc.contributor.author | Choi, Dong Hyeok | - |
| dc.contributor.author | Ahn, So Hyun | - |
| dc.contributor.author | An, Jeongshin | - |
| dc.date.accessioned | 2026-01-16T05:56:26Z | - |
| dc.date.available | 2026-01-16T05:56:26Z | - |
| dc.date.created | 2026-01-02 | - |
| dc.date.issued | 2025-09 | - |
| dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/209796 | - |
| dc.description.abstract | Recurrence and metastasis of breast cancer (RMBC) have a decisive impact on patient survival, necessitating reliable biomarkers for its early prediction. This study used machine learning to evaluate blood microbiome profiles as predictive biomarkers of RMBC. A retrospective predictive analysis was conducted on 288 participants, including 96 patients with breast cancer and 192 healthy controls. After 7 years of follow-up, patients were classified into disease-free survival (DFS, n = 88) and RMBC (n = 8) groups. Blood microbiome composition was analysed using 16S rRNA sequencing, followed by quality control. The Synthetic Minority Oversampling Technique (SMOTE) was employed to address class imbalance. Eleven machine learning models were trained using leave-one-out cross-validation (LOOCV) and k-fold cross-validation, and evaluated based on the area under the receiver operating characteristic curve (AUROC), recall, precision, F1-score, and Matthews correlation coefficient (MCC). Alpha diversity was significantly lower in DFS and RMBC groups than in the healthy control group (p < 0.05), while beta diversity analysis revealed distinct clustering. The random forest achieved an AUROC of 0.94, a recall of 0.81, an F1-score of 0.83, and an MCC of 0.88. Enterobacter, Bacteroides, Klebsiella, and Bifidobacterium were among the key microbial genera predicting RMBC in the top five models. Blood microbiome profiling shows potential as a non-invasive RMBC biomarker. Machine learning effectively distinguished RMBC, warranting further validation. | - |
| dc.language | English | - |
| dc.publisher | Nature Publishing Group | - |
| dc.relation.isPartOf | SCIENTIFIC REPORTS | - |
| dc.relation.isPartOf | SCIENTIFIC REPORTS | - |
| dc.subject.MESH | Adult | - |
| dc.subject.MESH | Aged | - |
| dc.subject.MESH | Breast Neoplasms* / blood | - |
| dc.subject.MESH | Breast Neoplasms* / diagnosis | - |
| dc.subject.MESH | Breast Neoplasms* / microbiology | - |
| dc.subject.MESH | Breast Neoplasms* / pathology | - |
| dc.subject.MESH | Female | - |
| dc.subject.MESH | Humans | - |
| dc.subject.MESH | Machine Learning* | - |
| dc.subject.MESH | Microbiota* / genetics | - |
| dc.subject.MESH | Middle Aged | - |
| dc.subject.MESH | Neoplasm Metastasis | - |
| dc.subject.MESH | Neoplasm Recurrence, Local* / microbiology | - |
| dc.subject.MESH | Prognosis | - |
| dc.subject.MESH | RNA, Ribosomal, 16S / genetics | - |
| dc.subject.MESH | ROC Curve | - |
| dc.subject.MESH | Retrospective Studies | - |
| dc.title | SMOTE-augmented machine learning model predicts recurrent and metastatic breast cancer from microbiome analysis | - |
| dc.type | Article | - |
| dc.contributor.googleauthor | Hong, Ji Eun | - |
| dc.contributor.googleauthor | Kim, Yeon Eun | - |
| dc.contributor.googleauthor | Kang, Yun Soo | - |
| dc.contributor.googleauthor | Choi, Dong Hyeok | - |
| dc.contributor.googleauthor | Ahn, So Hyun | - |
| dc.contributor.googleauthor | An, Jeongshin | - |
| dc.identifier.doi | 10.1038/s41598-025-16790-z | - |
| dc.relation.journalcode | J02646 | - |
| dc.identifier.eissn | 2045-2322 | - |
| dc.identifier.pmid | 41006422 | - |
| dc.identifier.url | Adult ; Aged ; Breast Neoplasms* / blood ; Breast Neoplasms* / diagnosis ; Breast Neoplasms* / microbiology ; Breast Neoplasms* / pathology ; Female ; Humans ; Machine Learning* ; Microbiota* / genetics ; Middle Aged ; Neoplasm Metastasis ; Neoplasm Recurrence, Local* / microbiology ; Prognosis ; RNA, Ribosomal, 16S / genetics ; ROC Curve ; Retrospective Studies | - |
| dc.subject.keyword | Breast cancer | - |
| dc.subject.keyword | Recurrence | - |
| dc.subject.keyword | Metastasis | - |
| dc.subject.keyword | Microbiome | - |
| dc.subject.keyword | Machine learning | - |
| dc.contributor.affiliatedAuthor | Choi, Dong Hyeok | - |
| dc.identifier.scopusid | 2-s2.0-105017394458 | - |
| dc.identifier.wosid | 001582548400029 | - |
| dc.citation.volume | 15 | - |
| dc.citation.number | 1 | - |
| dc.identifier.bibliographicCitation | SCIENTIFIC REPORTS, Vol.15(1), 2025-09 | - |
| dc.identifier.rimsid | 90719 | - |
| dc.type.rims | ART | - |
| dc.description.journalClass | 1 | - |
| dc.description.journalClass | 1 | - |
| dc.subject.keywordAuthor | Breast cancer | - |
| dc.subject.keywordAuthor | Recurrence | - |
| dc.subject.keywordAuthor | Metastasis | - |
| dc.subject.keywordAuthor | Microbiome | - |
| dc.subject.keywordAuthor | Machine learning | - |
| dc.subject.keywordPlus | PROGESTERONE | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalWebOfScienceCategory | Multidisciplinary Sciences | - |
| dc.relation.journalResearchArea | Science & Technology - Other Topics | - |
| dc.identifier.articleno | 33096 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.