AUTHORS: Kittipat Savetratanakaree, Member, IAENG Kingkarn Sookhanaphibarn, Sarun Intakosum and Ruck Thawonmas

ABSTRACT: In this paper, we propose a new approach to over-sample new minority-class instances along the borderline using the Euclidean distance in the feature space to improve support vector machine (SVM) performance in imbalanced data environments. SVM has been an outstandingly successful classifier in a wide variety of applications where balanced class data distribution is assumed. However, SVM is ineffective when coping with imbalanced datasets whereby the majority-class instances far outnumber the minority-class instances. Our new approach, called Borderline Over-sampling in the Feature Space, can deal with imbalanced data to effectively recognize new minority-class instances for better classification with SVM. The results of our class prediction experiments using the proposed approach demonstrate better performance than the existing SMOTE, Borderline-SMOTE and borderline over-sampling methods in terms of the g-mean and F-measure.

Keywords: Borderline Over-sampling in the Feature Space, Imbalanced Dataset, Over-sampling, SVM in Imbalanced Data Environments

LINK: http://mit.itu.bu.ac.th/publications/IAENG_Kittipat_Over-sampling2.pdf

REFERENCES: 

MLA   Savetratanakaree, Kittipat, et al. "Borderline Over-sampling in Feature Space for Learning Algorithms in Imbalanced Data Environments." IAENG International Journal of Computer Science 43.3 (2016).
APA Savetratanakaree, K., Sookhanaphibarn, K., Intakosum, S., & Thawonmas, R. (2016). Borderline Over-sampling in Feature Space for Learning Algorithms in Imbalanced Data Environments. IAENG International Journal of Computer Science, 43(3).
ISO 690   SAVETRATANAKAREE, Kittipat, et al. Borderline Over-sampling in Feature Space for Learning Algorithms in Imbalanced Data Environments. IAENG International Journal of Computer Science, 2016, 43.3.