1 3

Cited 0 times in

Cited 0 times in

SMOTE-augmented machine learning model predicts recurrent and metastatic breast cancer from microbiome analysis

DC Field Value Language
dc.contributor.authorHong, Ji Eun-
dc.contributor.authorKim, Yeon Eun-
dc.contributor.authorKang, Yun Soo-
dc.contributor.authorChoi, Dong Hyeok-
dc.contributor.authorAhn, So Hyun-
dc.contributor.authorAn, Jeongshin-
dc.date.accessioned2026-01-16T05:56:26Z-
dc.date.available2026-01-16T05:56:26Z-
dc.date.created2026-01-02-
dc.date.issued2025-09-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/209796-
dc.description.abstractRecurrence and metastasis of breast cancer (RMBC) have a decisive impact on patient survival, necessitating reliable biomarkers for its early prediction. This study used machine learning to evaluate blood microbiome profiles as predictive biomarkers of RMBC. A retrospective predictive analysis was conducted on 288 participants, including 96 patients with breast cancer and 192 healthy controls. After 7 years of follow-up, patients were classified into disease-free survival (DFS, n = 88) and RMBC (n = 8) groups. Blood microbiome composition was analysed using 16S rRNA sequencing, followed by quality control. The Synthetic Minority Oversampling Technique (SMOTE) was employed to address class imbalance. Eleven machine learning models were trained using leave-one-out cross-validation (LOOCV) and k-fold cross-validation, and evaluated based on the area under the receiver operating characteristic curve (AUROC), recall, precision, F1-score, and Matthews correlation coefficient (MCC). Alpha diversity was significantly lower in DFS and RMBC groups than in the healthy control group (p < 0.05), while beta diversity analysis revealed distinct clustering. The random forest achieved an AUROC of 0.94, a recall of 0.81, an F1-score of 0.83, and an MCC of 0.88. Enterobacter, Bacteroides, Klebsiella, and Bifidobacterium were among the key microbial genera predicting RMBC in the top five models. Blood microbiome profiling shows potential as a non-invasive RMBC biomarker. Machine learning effectively distinguished RMBC, warranting further validation.-
dc.languageEnglish-
dc.publisherNature Publishing Group-
dc.relation.isPartOfSCIENTIFIC REPORTS-
dc.relation.isPartOfSCIENTIFIC REPORTS-
dc.subject.MESHAdult-
dc.subject.MESHAged-
dc.subject.MESHBreast Neoplasms* / blood-
dc.subject.MESHBreast Neoplasms* / diagnosis-
dc.subject.MESHBreast Neoplasms* / microbiology-
dc.subject.MESHBreast Neoplasms* / pathology-
dc.subject.MESHFemale-
dc.subject.MESHHumans-
dc.subject.MESHMachine Learning*-
dc.subject.MESHMicrobiota* / genetics-
dc.subject.MESHMiddle Aged-
dc.subject.MESHNeoplasm Metastasis-
dc.subject.MESHNeoplasm Recurrence, Local* / microbiology-
dc.subject.MESHPrognosis-
dc.subject.MESHRNA, Ribosomal, 16S / genetics-
dc.subject.MESHROC Curve-
dc.subject.MESHRetrospective Studies-
dc.titleSMOTE-augmented machine learning model predicts recurrent and metastatic breast cancer from microbiome analysis-
dc.typeArticle-
dc.contributor.googleauthorHong, Ji Eun-
dc.contributor.googleauthorKim, Yeon Eun-
dc.contributor.googleauthorKang, Yun Soo-
dc.contributor.googleauthorChoi, Dong Hyeok-
dc.contributor.googleauthorAhn, So Hyun-
dc.contributor.googleauthorAn, Jeongshin-
dc.identifier.doi10.1038/s41598-025-16790-z-
dc.relation.journalcodeJ02646-
dc.identifier.eissn2045-2322-
dc.identifier.pmid41006422-
dc.identifier.urlAdult ; Aged ; Breast Neoplasms* / blood ; Breast Neoplasms* / diagnosis ; Breast Neoplasms* / microbiology ; Breast Neoplasms* / pathology ; Female ; Humans ; Machine Learning* ; Microbiota* / genetics ; Middle Aged ; Neoplasm Metastasis ; Neoplasm Recurrence, Local* / microbiology ; Prognosis ; RNA, Ribosomal, 16S / genetics ; ROC Curve ; Retrospective Studies-
dc.subject.keywordBreast cancer-
dc.subject.keywordRecurrence-
dc.subject.keywordMetastasis-
dc.subject.keywordMicrobiome-
dc.subject.keywordMachine learning-
dc.contributor.affiliatedAuthorChoi, Dong Hyeok-
dc.identifier.scopusid2-s2.0-105017394458-
dc.identifier.wosid001582548400029-
dc.citation.volume15-
dc.citation.number1-
dc.identifier.bibliographicCitationSCIENTIFIC REPORTS, Vol.15(1), 2025-09-
dc.identifier.rimsid90719-
dc.type.rimsART-
dc.description.journalClass1-
dc.description.journalClass1-
dc.subject.keywordAuthorBreast cancer-
dc.subject.keywordAuthorRecurrence-
dc.subject.keywordAuthorMetastasis-
dc.subject.keywordAuthorMicrobiome-
dc.subject.keywordAuthorMachine learning-
dc.subject.keywordPlusPROGESTERONE-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.identifier.articleno33096-
Appears in Collections:
1. College of Medicine (의과대학) > Others (기타) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.