520 632

Cited 0 times in

이분형 자료의 분류문제에서 불균형을 다루기 위한 표본재추출 방법 비교

DC Field Value Language
dc.contributor.author정인경-
dc.date.accessioned2019-09-20T07:53:54Z-
dc.date.available2019-09-20T07:53:54Z-
dc.date.issued2019-
dc.identifier.issn1225-066X-
dc.identifier.urihttps://ir.ymlib.yonsei.ac.kr/handle/22282913/171105-
dc.description.abstractA class imbalance problem arises when one class outnumbers the other class by a large proportion in binary data. Studies such as transforming the learning data have been conducted to solve this imbalance problem. In this study, we compared resampling methods among methods to deal with an imbalance in the classification problem. We sought to find a way to more effectively detect the minority class in the data. Through simulation, a total of 20 methods of over-sampling, under-sampling, and combined method of over- and under-sampling were compared. The logistic regression, support vector machine, and random forest models, which are commonly used in classification problems, were used as classifiers. The simulation results showed that the random under sampling (RUS) method had the highest sensitivity with an accuracy over 0.5. The next most sensitive method was an over-sampling adaptive synthetic sampling approach. This revealed that the RUS method was suitable for finding minority class values. The results of applying to some real data sets were similar to those of the simulation.-
dc.description.statementOfResponsibilityrestriction-
dc.formatapplication/pdf-
dc.languageKorean-
dc.publisher한국통계학회-
dc.relation.isPartOfKorean Journal of Applied Statistics (응용통계연구)-
dc.rightsCC BY-NC-ND 2.0 KR-
dc.title이분형 자료의 분류문제에서 불균형을 다루기 위한 표본재추출 방법 비교-
dc.title.alternativeComparison of resampling methods for dealing with imbalanced data in binary classification problem-
dc.typeArticle-
dc.contributor.collegeCollege of Medicine (의과대학)-
dc.contributor.departmentDept. of Biomedical Systems Informatics (의생명시스템정보학교실)-
dc.contributor.googleauthor박근우-
dc.contributor.googleauthor정인경-
dc.identifier.doi10.5351/KJAS.2019.32.3.349-
dc.contributor.localIdA03693-
dc.relation.journalcodeJ01964-
dc.identifier.urlhttp://kiss.kstudy.com/thesis/thesis-view.asp?key=3687102-
dc.contributor.alternativeNameJung, In Kyung-
dc.contributor.affiliatedAuthor정인경-
dc.citation.volume32-
dc.citation.number3-
dc.citation.startPage349-
dc.citation.endPage374-
dc.identifier.bibliographicCitationKorean Journal of Applied Statistics (응용통계연구), Vol.32(3) : 349-374, 2019-
dc.identifier.rimsid64528-
dc.type.rimsART-
Appears in Collections:
1. College of Medicine (의과대학) > Dept. of Biomedical Systems Informatics (의생명시스템정보학교실) > 1. Journal Papers

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.