TY - GEN
T1 - Probability Density Estimation over evolving data streams using tilted parzen window
AU - Hong, Shen
AU - Xiao-Long, Yan
PY - 2008
Y1 - 2008
N2 - Probability Density Estimation is a very important technology which has been widely used in data mining and data analysis. In this paper, we generalize the traditional Parzen Window method to data streams and propose a new method of Tilted Parzen Window (TPW) for Probability Density Estimation. To adapt to the evolvement of the data streams, we use the tilted window size that is proportional to data's arrival time. instead of the fixed window size. Theoretical analysis shows that the Tilted Parzen Window method is a valid method for estimating the probability density function (pdf) for data streams. We also propose a new strategy for discarding the historical data in data streams. We prove that this strategy can describe the probability density changes more accurately than the conventional discarding strategy. Empirical results on synthetic data set demonstrate the effectiveness and efficiency of this method.
AB - Probability Density Estimation is a very important technology which has been widely used in data mining and data analysis. In this paper, we generalize the traditional Parzen Window method to data streams and propose a new method of Tilted Parzen Window (TPW) for Probability Density Estimation. To adapt to the evolvement of the data streams, we use the tilted window size that is proportional to data's arrival time. instead of the fixed window size. Theoretical analysis shows that the Tilted Parzen Window method is a valid method for estimating the probability density function (pdf) for data streams. We also propose a new strategy for discarding the historical data in data streams. We prove that this strategy can describe the probability density changes more accurately than the conventional discarding strategy. Empirical results on synthetic data set demonstrate the effectiveness and efficiency of this method.
UR - http://www.scopus.com/inward/record.url?scp=55849107819&partnerID=8YFLogxK
U2 - 10.1109/ISCC.2008.4625751
DO - 10.1109/ISCC.2008.4625751
M3 - Conference contribution
AN - SCOPUS:55849107819
SN - 9781424427031
T3 - Proceedings - IEEE Symposium on Computers and Communications
SP - 585
EP - 589
BT - IEEE Symposium on Computers and Communications 2008, ISCC 2008
T2 - 13th IEEE Symposium on Computers and Communications, ISCC 2008
Y2 - 6 July 2008 through 9 July 2008
ER -