7 128

Cited 0 times in

Using gut microbiome metagenomic hypervariable features for diabetes screening and typing through supervised machine learning

DC Field Value Language
dc.contributor.author김주영-
dc.contributor.author최준호-
dc.contributor.author조윤희-
dc.contributor.author이명희-
dc.date.accessioned2025-06-27T02:20:33Z-
dc.date.available2025-06-27T02:20:33Z-
dc.date.issued2025-03-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/205924-
dc.description.abstractDiabetes mellitus is a complex metabolic disorder and one of the fastest-growing global public health concerns. The gut microbiota is implicated in the pathophysiology of various diseases, including diabetes. This study utilized 16S rRNA metagenomic data from a volunteer citizen science initiative to investigate microbial markers associated with diabetes status (positive or negative) and type (type 1 or type 2 diabetes mellitus) using supervised machine learning (ML) models. The diversity of the microbiome varied according to diabetes status and type. Differential microbial signatures between diabetes types and negative group revealed an increased presence of Brucellaceae, Ruminococcaceae, Clostridiaceae, Micrococcaceae, Barnesiellaceae and Fusobacteriaceae in subjects with diabetes type 1, and Veillonellaceae, Streptococcaceae and the order Gammaproteobacteria in subjects with diabetes type 2. The decision tree, elastic net, random forest (RF) and support vector machine with radial kernel ML algorithms were trained to screen and type diabetes based on microbial profiles of 76 subjects with type 1 diabetes, 366 subjects with type 2 diabetes and 250 subjects without diabetes. Using the 1000 most variable features, tree-based models were the highest-performing algorithms. The RF screening models achieved the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.76, although all models lacked sensitivity. Reducing the dataset to 500 features produced an AUC of 0.77 with sensitivity increasing by 74% from 0.46 to 0.80. Model performance improved for the classification of negative-status and type 2 diabetes. Diabetes type models performed best with 500 features, but the metric performed poorly across all model iterations. ML has the potential to facilitate early diagnosis of diabetes based on microbial profiles of the gut microbiome.-
dc.description.statementOfResponsibilityopen-
dc.formatapplication/pdf-
dc.languageEnglish-
dc.publisherMicrobiology Society-
dc.relation.isPartOfMICROBIAL GENOMICS-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.subject.MESHAdult-
dc.subject.MESHBacteria* / classification-
dc.subject.MESHBacteria* / genetics-
dc.subject.MESHDiabetes Mellitus, Type 1* / diagnosis-
dc.subject.MESHDiabetes Mellitus, Type 1* / microbiology-
dc.subject.MESHDiabetes Mellitus, Type 2* / diagnosis-
dc.subject.MESHDiabetes Mellitus, Type 2* / microbiology-
dc.subject.MESHFemale-
dc.subject.MESHGastrointestinal Microbiome* / genetics-
dc.subject.MESHHumans-
dc.subject.MESHMachine Learning-
dc.subject.MESHMale-
dc.subject.MESHMetagenome-
dc.subject.MESHMetagenomics* / methods-
dc.subject.MESHMiddle Aged-
dc.subject.MESHRNA, Ribosomal, 16S / genetics-
dc.subject.MESHSupervised Machine Learning*-
dc.subject.MESHSupport Vector Machine-
dc.titleUsing gut microbiome metagenomic hypervariable features for diabetes screening and typing through supervised machine learning-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Tropical Medicine (열대의학교실)-
dc.contributor.googleauthorXavier Chavarria-
dc.contributor.googleauthorHyun Seo Park-
dc.contributor.googleauthorSingeun Oh-
dc.contributor.googleauthorDongjun Kang-
dc.contributor.googleauthorJun Ho Choi-
dc.contributor.googleauthorMyungjun Kim-
dc.contributor.googleauthorYoon Hee Cho-
dc.contributor.googleauthorMyung-Hee Yi-
dc.contributor.googleauthorJu Yeong Kim-
dc.identifier.doi10.1099/mgen.0.001365-
dc.contributor.localIdA00937-
dc.relation.journalcodeJ04217-
dc.identifier.eissn2057-5858-
dc.identifier.pmid40063675-
dc.subject.keyworddiabetes mellitus-
dc.subject.keywordgut microbiome-
dc.subject.keywordmetabarcoding-
dc.subject.keywordmicrobial markers-
dc.subject.keywordrandom forest-
dc.subject.keywordsupervised machine learning-
dc.contributor.alternativeNameKim, Ju Yeong-
dc.contributor.affiliatedAuthor김주영-
dc.citation.volume11-
dc.citation.number3-
dc.citation.startPage001365-
dc.identifier.bibliographicCitationMICROBIAL GENOMICS, Vol.11(3) : 001365, 2025-03-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Tropica Medicine (열대의학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.