A clustering algorithm based on density-grid for stream data

Dandan Zhang, Hui Tian, Yingpeng Sang, Yidong Li, Yanbo Wu, Jun Wu, Hong Shen

研究成果: Conference contribution同行評審

4 引文 斯高帕斯(Scopus)

摘要

Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an important and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.

原文English
主出版物標題Proceedings - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
頁面398-403
頁數6
DOIs
出版狀態Published - 2012
對外發佈
事件13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012 - Beijing, China
持續時間: 14 12月 201216 12月 2012

出版系列

名字Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings

Conference

Conference13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
國家/地區China
城市Beijing
期間14/12/1216/12/12

指紋

深入研究「A clustering algorithm based on density-grid for stream data」主題。共同形成了獨特的指紋。

引用此