Cited 0 times in
Using gut microbiome metagenomic hypervariable features for diabetes screening and typing through supervised machine learning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 김주영 | - |
dc.contributor.author | 최준호 | - |
dc.contributor.author | 조윤희 | - |
dc.contributor.author | 이명희 | - |
dc.date.accessioned | 2025-06-27T02:20:33Z | - |
dc.date.available | 2025-06-27T02:20:33Z | - |
dc.date.issued | 2025-03 | - |
dc.identifier.uri | https://ir.ymlib.yonsei.ac.kr/handle/22282913/205924 | - |
dc.description.abstract | Diabetes mellitus is a complex metabolic disorder and one of the fastest-growing global public health concerns. The gut microbiota is implicated in the pathophysiology of various diseases, including diabetes. This study utilized 16S rRNA metagenomic data from a volunteer citizen science initiative to investigate microbial markers associated with diabetes status (positive or negative) and type (type 1 or type 2 diabetes mellitus) using supervised machine learning (ML) models. The diversity of the microbiome varied according to diabetes status and type. Differential microbial signatures between diabetes types and negative group revealed an increased presence of Brucellaceae, Ruminococcaceae, Clostridiaceae, Micrococcaceae, Barnesiellaceae and Fusobacteriaceae in subjects with diabetes type 1, and Veillonellaceae, Streptococcaceae and the order Gammaproteobacteria in subjects with diabetes type 2. The decision tree, elastic net, random forest (RF) and support vector machine with radial kernel ML algorithms were trained to screen and type diabetes based on microbial profiles of 76 subjects with type 1 diabetes, 366 subjects with type 2 diabetes and 250 subjects without diabetes. Using the 1000 most variable features, tree-based models were the highest-performing algorithms. The RF screening models achieved the best performance, with an average area under the receiver operating characteristic curve (AUC) of 0.76, although all models lacked sensitivity. Reducing the dataset to 500 features produced an AUC of 0.77 with sensitivity increasing by 74% from 0.46 to 0.80. Model performance improved for the classification of negative-status and type 2 diabetes. Diabetes type models performed best with 500 features, but the metric performed poorly across all model iterations. ML has the potential to facilitate early diagnosis of diabetes based on microbial profiles of the gut microbiome. | - |
dc.description.statementOfResponsibility | open | - |
dc.format | application/pdf | - |
dc.language | English | - |
dc.publisher | Microbiology Society | - |
dc.relation.isPartOf | MICROBIAL GENOMICS | - |
dc.rights | CC BY-NC-ND 2.0 KR | - |
dc.subject.MESH | Adult | - |
dc.subject.MESH | Bacteria* / classification | - |
dc.subject.MESH | Bacteria* / genetics | - |
dc.subject.MESH | Diabetes Mellitus, Type 1* / diagnosis | - |
dc.subject.MESH | Diabetes Mellitus, Type 1* / microbiology | - |
dc.subject.MESH | Diabetes Mellitus, Type 2* / diagnosis | - |
dc.subject.MESH | Diabetes Mellitus, Type 2* / microbiology | - |
dc.subject.MESH | Female | - |
dc.subject.MESH | Gastrointestinal Microbiome* / genetics | - |
dc.subject.MESH | Humans | - |
dc.subject.MESH | Machine Learning | - |
dc.subject.MESH | Male | - |
dc.subject.MESH | Metagenome | - |
dc.subject.MESH | Metagenomics* / methods | - |
dc.subject.MESH | Middle Aged | - |
dc.subject.MESH | RNA, Ribosomal, 16S / genetics | - |
dc.subject.MESH | Supervised Machine Learning* | - |
dc.subject.MESH | Support Vector Machine | - |
dc.title | Using gut microbiome metagenomic hypervariable features for diabetes screening and typing through supervised machine learning | - |
dc.type | Article | - |
dc.contributor.college | College of Medicine (의과대학) | - |
dc.contributor.department | Dept. of Tropical Medicine (열대의학교실) | - |
dc.contributor.googleauthor | Xavier Chavarria | - |
dc.contributor.googleauthor | Hyun Seo Park | - |
dc.contributor.googleauthor | Singeun Oh | - |
dc.contributor.googleauthor | Dongjun Kang | - |
dc.contributor.googleauthor | Jun Ho Choi | - |
dc.contributor.googleauthor | Myungjun Kim | - |
dc.contributor.googleauthor | Yoon Hee Cho | - |
dc.contributor.googleauthor | Myung-Hee Yi | - |
dc.contributor.googleauthor | Ju Yeong Kim | - |
dc.identifier.doi | 10.1099/mgen.0.001365 | - |
dc.contributor.localId | A00937 | - |
dc.relation.journalcode | J04217 | - |
dc.identifier.eissn | 2057-5858 | - |
dc.identifier.pmid | 40063675 | - |
dc.subject.keyword | diabetes mellitus | - |
dc.subject.keyword | gut microbiome | - |
dc.subject.keyword | metabarcoding | - |
dc.subject.keyword | microbial markers | - |
dc.subject.keyword | random forest | - |
dc.subject.keyword | supervised machine learning | - |
dc.contributor.alternativeName | Kim, Ju Yeong | - |
dc.contributor.affiliatedAuthor | 김주영 | - |
dc.citation.volume | 11 | - |
dc.citation.number | 3 | - |
dc.citation.startPage | 001365 | - |
dc.identifier.bibliographicCitation | MICROBIAL GENOMICS, Vol.11(3) : 001365, 2025-03 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.