Abstract
In the field of data mining, the data classification is an important part of data analysis, which is used to determine the sample category and further extract information and knowledge for the decision-making. K nearest neighbors (KNN) is one kind of the classification methods. Although it can realize the classification without the prior parameter for the data processing, the classification accuracy is not high that the result is not ideal enough. Combining the extension theory and the characteristics of data classification, an extension K nearest neighbors (EKNN) is proposed, in which the matter-element model is used to describe the data in a triple way, the extension distance is applied to realize the calculation of data similarity, and the attribute reduction is introduced for the data-classification. Thought the experiments on three different UCI datasets, EKNN is apparently more effective and extensible than traditional KNN, which has a unified and clear data-description, effective data-classification process and higher classification accuracy.
Original language | English |
---|---|
Pages (from-to) | 403-407 |
Number of pages | 5 |
Journal | Advanced Science Letters |
Volume | 11 |
Issue number | 1 |
DOIs | |
Publication status | Published - May 2012 |
Externally published | Yes |
Keywords
- Data classification
- Extension theory
- K nearest neighbors