0 0

Cited 0 times in

Cited 0 times in

Machine learning‐based early prediction of asthma in preschoolers: The COCOA birth cohort study

Authors
 Chang Hoon Han  ;  Seok-Jae Heo  ;  Haerin Jang  ;  So-Yeon Lee  ;  Ji Soo Park  ;  Dong In Suh  ;  Youn Ho Shin  ;  Jihyun Kim  ;  Kangmo Ahn  ;  Myung Hyun Sohn  ;  Eom Ji Choi  ;  Sun Hee Choi  ;  Hey-Sung Baek  ;  Soo-Jong Hong  ;  Kyung Won Kim  ;  Inkyung Jung  ;  Soo Yeon Kim 
Citation
 PEDIATRIC ALLERGY AND IMMUNOLOGY, Vol.36(10) : e70223, 2025-10 
Journal Title
PEDIATRIC ALLERGY AND IMMUNOLOGY
ISSN
 0905-6157 
Issue Date
2025-10
MeSH
Asthma* / diagnosis ; Asthma* / epidemiology ; Birth Cohort ; Child, Preschool ; Cohort Studies ; Female ; Follow-Up Studies ; Humans ; Infant ; Machine Learning* ; Male ; Prospective Studies ; Republic of Korea / epidemiology ; Surveys and Questionnaires
Keywords
asthma ; birth cohort ; child ; machine learning ; preschool
Abstract
Background: Early prediction of asthma in preschoolers, which is crucial for timely intervention, remains challenging. This study aimed to develop a machine learning (ML)-based model and a questionnaire-based scoring tool for the prediction of asthma at age 3 years.

Methods: Data from the COhort for Childhood Origin of Asthma and allergic diseases (COCOA), a comprehensive prospective birth cohort in South Korea, was used. Children with complete 3-year follow-up (n = 2007) were divided into development (n = 1472) and validation (n = 535) cohorts based on birth year. Asthma diagnosis at age 3 years was based on physician diagnosis, recurrent wheezing episodes, asthma treatment, or parental reports. Random Forest-based predictive models were developed using data collected until the age of 2 years, initially selecting features via least absolute shrinkage and selection operator (LASSO) regression. A questionnaire-based scoring tool was also developed and compared with multiple ML algorithms.

Results: The ML-based prediction models showed improved performance as the data accumulated. The 6-month, 1-year, and 2-year models had area under the receiver operating characteristic curve (AUROC) values of 0.614, 0.726, and 0.774, respectively, in the validation cohort. The performance of the questionnaire-based scoring tool (AUROC, 0.790) was comparable to that of the ML-based model. Important predictors included paternal total IgE levels, maternal iron supplementation during pregnancy, parental asthma history, nut allergy history, and recent lower respiratory infections.

Conclusions: Our study successfully developed robust predictive models for early asthma that demonstrated high performance. The questionnaire-based scoring tool offers particular value because of its clinical applicability. Further validation in diverse populations and investigation of the causative pathways of the identified predictors are necessary to enhance clinical utility.
Files in This Item:
T202507358.pdf Download
DOI
10.1111/pai.70223
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Pediatrics (소아과학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers
Yonsei Authors
Kim, Kyung Won(김경원) ORCID logo https://orcid.org/0000-0003-4529-6135
Kim, Soo Yeon(김수연) ORCID logo https://orcid.org/0000-0003-4965-6193
Sohn, Myung Hyun(손명현) ORCID logo https://orcid.org/0000-0002-2478-487X
Jung, Inkyung(정인경) ORCID logo https://orcid.org/0000-0003-3780-3213
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/209321
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links