TY - GEN
T1 - The framework of relative density-based clustering
AU - Cui, Zelin
AU - Shen, Hong
N1 - Publisher Copyright:
© 2017, Springer Nature Singapore Pte Ltd.
PY - 2017
Y1 - 2017
N2 - Density-based clustering, using two-phase scheme which consists of an online component and an offline component, is an effective framework for data stream clustering, it can find arbitrarily shaped clusters and capture the evolving characteristic of real-time data streams accurately. However, the clustering has some deficiencies on offline component. Most algorithm don’t adapt to the unevenly distributed data streams or the multi density distribution of the data streams. Moreover, they only consider the density and centroid to connect the adjacent grid and ignore similarity of attribute value between adjacent grids. In this paper, we calculate the similarity of neighboring grids and take the similarity as a weight that affects the connection of the neighboring grids and propose the relative density-based clustering that cluster the grids based on relative difference model that considers the density, centroid and the weight of similarity between adjacent grids, simply, we connect neighboring grids which are the relative small difference to form clusters on offline component. The experimental results have shown that our algorithm apply to the unevenly distributed data streams and has better clustering quality.
AB - Density-based clustering, using two-phase scheme which consists of an online component and an offline component, is an effective framework for data stream clustering, it can find arbitrarily shaped clusters and capture the evolving characteristic of real-time data streams accurately. However, the clustering has some deficiencies on offline component. Most algorithm don’t adapt to the unevenly distributed data streams or the multi density distribution of the data streams. Moreover, they only consider the density and centroid to connect the adjacent grid and ignore similarity of attribute value between adjacent grids. In this paper, we calculate the similarity of neighboring grids and take the similarity as a weight that affects the connection of the neighboring grids and propose the relative density-based clustering that cluster the grids based on relative difference model that considers the density, centroid and the weight of similarity between adjacent grids, simply, we connect neighboring grids which are the relative small difference to form clusters on offline component. The experimental results have shown that our algorithm apply to the unevenly distributed data streams and has better clustering quality.
KW - Density-based clustering
KW - Relative density-based clustering
KW - Similarity of the neighboring grids
UR - http://www.scopus.com/inward/record.url?scp=85031410384&partnerID=8YFLogxK
U2 - 10.1007/978-981-10-6442-5_31
DO - 10.1007/978-981-10-6442-5_31
M3 - Conference contribution
AN - SCOPUS:85031410384
SN - 9789811064418
T3 - Communications in Computer and Information Science
SP - 343
EP - 352
BT - Parallel Architecture, Algorithm and Programming - 8th International Symposium, PAAP 2017, Proceedings
A2 - Shen, Hong
A2 - Chen, Guoliang
A2 - Chen, Mingrui
PB - Springer Verlag
T2 - 8th International Symposium on Parallel Architectures, Algorithms, and Programming, PAAP 2017
Y2 - 17 June 2017 through 18 June 2017
ER -