Imbalanced data classification based on hybrid re-sampling and twin support vector machine

Lu Cao, Hong Shen

研究成果: Article同行評審

9 引文 斯高帕斯(Scopus)

摘要

Imbalanced datasets exist widely in real life. The identification of the minority class in imbalanced datasets tends to be the focus of classification. As a variant of enhanced support vector machine (SVM), the twin support vector machine (TWSVM) provides an effective technique for data classification. TWSVM is based on a relative balance in the training sample dataset and distribution to improve the classification accuracy of the whole dataset, however, it is not effective in dealing with imbalanced data classification problems. In this paper, we propose to combine a re-sampling technique, which utilizes over-sampling and under-sampling to balance the training data, with TWSVM to deal with imbalanced data classification. Experimental results show that our proposed approach outperforms other state-of-art methods.

原文English
頁(從 - 到)579-595
頁數17
期刊Computer Science and Information Systems
14
發行號3
DOIs
出版狀態Published - 9月 2017
對外發佈

指紋

深入研究「Imbalanced data classification based on hybrid re-sampling and twin support vector machine」主題。共同形成了獨特的指紋。

引用此