Virtual Sample Generation Approach for Imbalanced Classification

Cao Lu, Hong Shen

研究成果: Conference contribution同行評審

5 引文 斯高帕斯(Scopus)

摘要

Imbalanced classification problem is a hot topic in machine learning and data mining. The traditional classification algorithms assume that class distribution is balanced and the effect is not ideal when handling imbalanced datasets. In this paper, the support vector machine is used as basic classifier and a virtual sample generation method based on support vector is proposed to solve the problem of imbalanced classification and to improve the recognition rate of the minority class according to the characteristic that support vector machine is a classifier that relies heavily on support vectors. Firstly, support vector machine is used to learn training set to obtain support vectors of the minority class. Then, a certain number of virtual samples are generated around the support vector of the minority samples through the smoothness hypothesis to balance the data set. The generated samples can conform to the statistical characteristics of the original training data, which proves the rationality of the generated virtual samples. Finally, the new dataset is learned by support vector machine. Experimental results show that the method is effective in both artificial datasets and UCI standard datasets.

原文English
主出版物標題Proceedings - 2018 9th International Conference on Parallel Architectures, Algorithms and Programming, PAAP 2018
發行者IEEE Computer Society
頁面177-182
頁數6
ISBN(電子)9781538694039
DOIs
出版狀態Published - 2 7月 2018
對外發佈
事件9th International Conference on Parallel Architectures, Algorithms and Programming, PAAP 2018 - Taipei, Taiwan, Province of China
持續時間: 26 12月 201828 12月 2018

出版系列

名字Proceedings - International Symposium on Parallel Architectures, Algorithms and Programming, PAAP
2018-December
ISSN(列印)2168-3034
ISSN(電子)2168-3042

Conference

Conference9th International Conference on Parallel Architectures, Algorithms and Programming, PAAP 2018
國家/地區Taiwan, Province of China
城市Taipei
期間26/12/1828/12/18

指紋

深入研究「Virtual Sample Generation Approach for Imbalanced Classification」主題。共同形成了獨特的指紋。

引用此