8 51

Cited 0 times in

Cited 0 times in

Data-driven prediction of cardiovascular and cerebrovascular diseases in a nationwide study

Authors
 Kim, Sehyun  ;  Ryu, Beomsang  ;  Choi, Mingee  ;  Lee, Sangyon  ;  Shin, Jaeyong  ;  Hong, Sok Chul 
Citation
 SCIENTIFIC REPORTS, Vol.15(1), 2025-07 
Article Number
 22585 
Journal Title
SCIENTIFIC REPORTS
Issue Date
2025-07
MeSH
Adult ; Aged ; Cardiovascular Diseases* / epidemiology ; Cerebrovascular Disorders* / epidemiology ; Female ; Humans ; Incidence ; Logistic Models ; Male ; Middle Aged ; ROC Curve ; Risk Factors ; Taiwan / epidemiology
Keywords
Cardiovascular system ; Risk prediction ; Big data ; Health analytics
Abstract
As the importance of the prevention and premanagement of cardiovascular and cerebrovascular diseases continues to emerge, research is being conducted globally to create and compare risk factor prediction models using health examination big data. In this study, health insurance data were used to predict the incidence of cardiocerebrovascular disease using various models and compare the performance of the models on samples with different initial risk levels. This study analyzed data from 410,859 individuals from the National Health Insurance Service between 2002 and 2019. This study deployed various linear models to predict the occurrence of cardiocerebrovascular diseases in two distinct samples. Models based on logistic regression analysis with penalty terms on the objective function were used, and their predictive performances were compared using multiple evaluation metrics, including the area under the receiver operating characteristic curve. The logistic regression model incorporating variables selected by the LASSO algorithm exhibited superior predictive performance relative to other models, although the differences were not statistically significant. The models demonstrated improved performance for samples with higher incidence rates and initial risk levels. This study predicted and compared the incidence of cardiocerebrovascular disease (CCVD) in patients with different health conditions using national sample cohort data from the National Health Insurance Service. The findings underscore the importance of developing diverse models to predict diseases like CCVD, which have high medical costs and incidence rates, thus informing the development of healthcare policies.
Files in This Item:
89598.pdf Download
DOI
10.1038/s41598-025-94888-0
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Preventive Medicine (예방의학교실) > 1. Journal Papers
Yonsei Authors
Shin, Jae Yong(신재용) ORCID logo https://orcid.org/0000-0002-2955-6382
Choi, Mingee(최민지)
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/208014
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links