345 573

Cited 104 times in

Prediction model development of late-onset preeclampsia using machine learning-based methods

Authors
 Jong Hyun Jhee  ;  SungHee Lee  ;  Yejin Park  ;  Sang Eun Lee  ;  Young Ah Kim  ;  Shin-Wook Kang  ;  Ja-Young Kwon  ;  Jung Tak Park 
Citation
 PLoS One, Vol.14(8) : e0221202., 2019 
Journal Title
PLOS ONE
Issue Date
2019
Abstract
Preeclampsia is one of the leading causes of maternal and fetal morbidity and mortality. Due to the lack of effective preventive measures, its prediction is essential to its prompt management. This study aimed to develop models using machine learning to predict late-onset preeclampsia using hospital electronic medical record data. The performance of the machine learning based models and models using conventional statistical methods were also compared. A total of 11,006 pregnant women who received antenatal care at Yonsei University Hospital were included. Maternal data were retrieved from electronic medical records during the early second trimester to 34 weeks. The prediction outcome was late-onset preeclampsia occurrence after 34 weeks' gestation. Pattern recognition and cluster analysis were used to select the parameters included in the prediction models. Logistic regression, decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, and stochastic gradient boosting method were used to construct the prediction models. C-statistics was used to assess the performance of each model. The overall preeclampsia development rate was 4.7% (474 patients). Systolic blood pressure, serum blood urea nitrogen and creatinine levels, platelet counts, serum potassium level, white blood cell count, serum calcium level, and urinary protein were the most influential variables included in the prediction models. C-statistics for the decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, stochastic gradient boosting method, and logistic regression models were 0.857, 0.776, 0.573, 0.894, 0.924, and 0.806, respectively. The stochastic gradient boosting model had the best prediction performance with an accuracy and false positive rate of 0.973 and 0.009, respectively. The combined use of maternal factors and common antenatal laboratory data of the early second trimester through early third trimester could effectively predict late-onset preeclampsia using machine learning algorithms. Future prospective studies are needed to verify the clinical applicability algorithms.
Files in This Item:
T201903259.pdf Download
DOI
10.1371/journal.pone.0221202
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Internal Medicine (내과학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Obstetrics and Gynecology (산부인과학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Radiology (영상의학교실) > 1. Journal Papers
Yonsei Authors
Kang, Shin Wook(강신욱) ORCID logo https://orcid.org/0000-0002-5677-4756
Kwon, Ja Young(권자영) ORCID logo https://orcid.org/0000-0003-3009-6325
Kim, Young Ah(김영아)
Park, Yejin(박예진) ORCID logo https://orcid.org/0000-0002-0545-7267
Park, Jung Tak(박정탁) ORCID logo https://orcid.org/0000-0002-2325-8982
Jhee, Jong Hyun(지종현)
URI
https://ir.ymlib.yonsei.ac.kr/handle/22282913/171303
사서에게 알리기
  feedback

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse

Links