TY - GEN
T1 - A clustering algorithm based on density-grid for stream data
AU - Zhang, Dandan
AU - Tian, Hui
AU - Sang, Yingpeng
AU - Li, Yidong
AU - Wu, Yanbo
AU - Wu, Jun
AU - Shen, Hong
PY - 2012
Y1 - 2012
N2 - Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an important and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.
AB - Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an important and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.
KW - Clustering
KW - Index Tree
KW - density-grid
KW - stream data
UR - http://www.scopus.com/inward/record.url?scp=84884649719&partnerID=8YFLogxK
U2 - 10.1109/PDCAT.2012.13
DO - 10.1109/PDCAT.2012.13
M3 - Conference contribution
AN - SCOPUS:84884649719
SN - 9780769548791
T3 - Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings
SP - 398
EP - 403
BT - Proceedings - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
T2 - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
Y2 - 14 December 2012 through 16 December 2012
ER -