0 5

Cited 0 times in

Cited 0 times in

Liver cancer risk stratification using deep learning on nationwide longitudinal health screening data: a retrospective cohort study

DC Field Value Language
dc.contributor.authorChoi, Yewon-
dc.contributor.authorCho, Sungmin-
dc.contributor.authorGu, Changdai-
dc.contributor.authorKim, Chungho-
dc.contributor.authorPark, Bomi-
dc.contributor.authorKim, Hwiyoung-
dc.contributor.author김휘영-
dc.date.accessioned2026-03-31T02:37:45Z-
dc.date.available2026-03-31T02:37:45Z-
dc.date.created2026-03-20-
dc.date.issued2026-01-
dc.identifier.issn1472-6947-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/211700-
dc.description.abstractBackgroundCurrent liver cancer screening in Korea focuses on viral hepatitis or cirrhosis, despite rising risks from metabolic and alcohol-related liver disease. We aimed to develop a deep learning model that leverages routinely collected national screening and claims data to predict liver cancer risk without requiring additional diagnostic tests.MethodsWe conducted a retrospective cohort study of 3,962,209 adults aged 50-69 years who participated in the Korean National Health Screening program between 2010 and 2015, with follow-up until December 31, 2021. A total of 12,401 liver cancer cases were identified. Using data from three biennial screenings over 6 years, we developed a one-dimensional convolutional neural network model to predict 5-year liver cancer risk. The cohort was randomly divided at the patient level into training (80%) and testing (20%) sets. Predictors included demographic, clinical, behavioral, anthropometric, and laboratory features. Model performance was compared with logistic regression, extreme gradient boosting, multilayer perceptron, and current national surveillance criteria, assessed by the area under the receiver operating characteristic curve (AUROC), sensitivity, and specificity. Interpretability was examined using SHapley values and Cox regression, and sensitivity analyses evaluated the impact of screening timing.ResultsOur model achieved an AUROC of 0.810 (95% CI, 0.802-0.818) and an AUPRC of 0.029 (95% CI, 0.026-0.034), with a sensitivity of 0.736 (95% CI, 0.720-0.753), clearly outperforming the current national criteria which showed an AUROC of 0.552 (95% CI, 0.546-0.558), an AUPRC of 0.007 (95% CI, 0.006-0.008), and a sensitivity of only 0.112 (95% CI, 0.100-0.125). The top-risk quintile accounted for 65% of incident liver cancer cases and had a 27-fold higher hazard compared to the lowest-risk group. Major predictors included age, viral hepatitis, family history of liver cancer, cholesterol levels, alcohol consumption, and metabolic factors. Sensitivity analyses demonstrated that incorporating all three screening time points yielded the highest overall performance.ConclusionsApplying a deep learning model to routinely collected national screening data improved liver cancer risk stratification and enabled early identification of high-risk individuals, including those without prior liver disease. This approach supports scalable, policy-relevant screening strategies within existing public health infrastructure.Trial registrationNot applicable.-
dc.languageEnglish-
dc.publisherBioMed Central-
dc.relation.isPartOfBMC MEDICAL INFORMATICS AND DECISION MAKING-
dc.relation.isPartOfBMC MEDICAL INFORMATICS AND DECISION MAKING-
dc.subject.MESHAged-
dc.subject.MESHDeep Learning*-
dc.subject.MESHEarly Detection of Cancer* / statistics & numerical data-
dc.subject.MESHFemale-
dc.subject.MESHHumans-
dc.subject.MESHLiver Neoplasms* / diagnosis-
dc.subject.MESHLiver Neoplasms* / epidemiology-
dc.subject.MESHMale-
dc.subject.MESHMass Screening / statistics & numerical data-
dc.subject.MESHMiddle Aged-
dc.subject.MESHRepublic of Korea / epidemiology-
dc.subject.MESHRetrospective Studies-
dc.subject.MESHRisk Assessment / methods-
dc.titleLiver cancer risk stratification using deep learning on nationwide longitudinal health screening data: a retrospective cohort study-
dc.typeArticle-
dc.contributor.googleauthorChoi, Yewon-
dc.contributor.googleauthorCho, Sungmin-
dc.contributor.googleauthorGu, Changdai-
dc.contributor.googleauthorKim, Chungho-
dc.contributor.googleauthorPark, Bomi-
dc.contributor.googleauthorKim, Hwiyoung-
dc.identifier.doi10.1186/s12911-025-03323-x-
dc.relation.journalcodeJ00363-
dc.identifier.eissn1472-6947-
dc.identifier.pmid41547794-
dc.subject.keywordHCC-
dc.subject.keywordMachine learning-
dc.subject.keywordLiver neoplasms-
dc.subject.keywordLifestyle factor-
dc.subject.keywordCNN-
dc.subject.keywordPrediction-
dc.contributor.affiliatedAuthorChoi, Yewon-
dc.contributor.affiliatedAuthorCho, Sungmin-
dc.contributor.affiliatedAuthorGu, Changdai-
dc.contributor.affiliatedAuthorKim, Hwiyoung-
dc.identifier.scopusid2-s2.0-105029716742-
dc.identifier.wosid001688842300001-
dc.citation.volume26-
dc.citation.number1-
dc.identifier.bibliographicCitationBMC MEDICAL INFORMATICS AND DECISION MAKING, Vol.26(1), 2026-01-
dc.identifier.rimsid92138-
dc.type.rimsART-
dc.description.journalClass1-
dc.description.journalClass1-
dc.subject.keywordAuthorHCC-
dc.subject.keywordAuthorMachine learning-
dc.subject.keywordAuthorLiver neoplasms-
dc.subject.keywordAuthorLifestyle factor-
dc.subject.keywordAuthorCNN-
dc.subject.keywordAuthorPrediction-
dc.subject.keywordPlusHEPATOCELLULAR-CARCINOMA RISK-
dc.subject.keywordPlusMODERATE ALCOHOL-CONSUMPTION-
dc.subject.keywordPlusINSULIN SENSITIVITY-
dc.subject.keywordPlusPOSTMENOPAUSAL WOMEN-
dc.subject.keywordPlusMEN-
dc.subject.keywordPlusMETAANALYSIS-
dc.subject.keywordPlusMORTALITY-
dc.subject.keywordPlusMODELS-
dc.subject.keywordPlusDIET-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalWebOfScienceCategoryMedical Informatics-
dc.relation.journalResearchAreaMedical Informatics-
dc.identifier.articleno44-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Neurosurgery (신경외과학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.