148 298

Cited 12 times in

Analyzing adverse drug reaction using statistical and machine learning methods: A systematic review

 Hae Reong Kim  ;  MinDong Sung  ;  Ji Ae Park  ;  Kyeongseob Jeong  ;  Ho Heon Kim  ;  Suehyun Lee  ;  Yu Rang Park 
 MEDICINE, Vol.101(25) : e29387, 2022-06 
Journal Title
Issue Date
Adverse Drug Reaction Reporting Systems* ; Databases, Factual ; Drug-Related Side Effects and Adverse Reactions* / epidemiology ; Electronic Health Records ; Humans ; Machine Learning
Background: Adverse drug reactions (ADRs) are unintended negative drug-induced responses. Determining the association between drugs and ADRs is crucial, and several methods have been proposed to demonstrate this association. This systematic review aimed to examine the analytical tools by considering original articles that utilized statistical and machine learning methods for detecting ADRs.

Methods: A systematic literature review was conducted based on articles published between 2015 and 2020. The keywords used were statistical, machine learning, and deep learning methods for detecting ADR signals. The study was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement (PRISMA) guidelines.

Results: We reviewed 72 articles, of which 51 and 21 addressed statistical and machine learning methods, respectively. Electronic medical record (EMR) data were exclusively analyzed using the regression method. For FDA Adverse Event Reporting System (FAERS) data, components of the disproportionality method were preferable. DrugBank was the most used database for machine learning. Other methods accounted for the highest and supervised methods accounted for the second highest.

Conclusions: Using the 72 main articles, this review provides guidelines on which databases are frequently utilized and which analysis methods can be connected. For statistical analysis, >90% of the cases were analyzed by disproportionate or regression analysis with each spontaneous reporting system (SRS) data or electronic medical record (EMR) data; for machine learning research, however, there was a strong tendency to analyze various data combinations. Only half of the DrugBank database was occupied, and the k-nearest neighbor method accounted for the greatest proportion.
Files in This Item:
T202204526.pdf Download
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers
1. College of Medicine (의과대학) > Dept. of Internal Medicine (내과학교실) > 1. Journal Papers
Yonsei Authors
Park, Yu Rang(박유랑) ORCID logo https://orcid.org/0000-0002-4210-2094
Sung, MinDong(성민동) ORCID logo https://orcid.org/0000-0002-5217-8877
사서에게 알리기


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.