A clustering algorithm based on density-grid for stream data

Dandan Zhang, Hui Tian, Yingpeng Sang, Yidong Li, Yanbo Wu, Jun Wu, Hong Shen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Many real applications, such as network traffic monitoring, intrusion detection, satellite remote sensing, and electronic business, generate data in the form of a stream arriving continuously at high speed. Clustering is an important data analysis tool for knowledge discovery. Compared with traditional clustering algorithms, clustering stream data is an important and challenging problem which has attracted many researchers. Clustering stream data is facing two main challenges. First, as the data is continuously arriving with high rate and the computer storage capacity is limited, raw data can only be scaned in one pass. Second, stream data is always changing with time, so viewing a data stream as a set of static data can deteriorate the clustering quality. In fact, users are more concerned with the evolving behaviors of clusters which can help people making correct decisions. This paper proposes a density-grid based clustering algorithm, PKS-Stream-I, for stream data. It is an optimization of PKS-Stream in density detection period selection, sporadic grid detection and removal. Empirical results show the proposed method yields out better performance.

Original languageEnglish
Title of host publicationProceedings - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
Pages398-403
Number of pages6
DOIs
Publication statusPublished - 2012
Externally publishedYes
Event13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012 - Beijing, China
Duration: 14 Dec 201216 Dec 2012

Publication series

NameParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings

Conference

Conference13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
Country/TerritoryChina
CityBeijing
Period14/12/1216/12/12

Keywords

  • Clustering
  • Index Tree
  • density-grid
  • stream data

Fingerprint

Dive into the research topics of 'A clustering algorithm based on density-grid for stream data'. Together they form a unique fingerprint.

Cite this