跳至主導覽 跳至搜尋 跳過主要內容

Reconstructing data perturbed by random projections when the mixing matrix is known

  • Yingpeng Sang
  • , Hong Shen
  • , Hui Tian

研究成果: Conference contribution同行評審

8 引文 斯高帕斯(Scopus)

摘要

Random Projection (RP) has drawn great interest from the research of privacy-preserving data mining due to its high efficiency and security. It was proposed in [27] where the original data set composed of m attributes, is multiplied with a mixing matrix of dimensions k×m (m;>;k) which is random and orthogonal on expectation, and then the k series of perturbed data are released for mining purposes. To our knowledge little work has been done from the view of the attacker, to reconstruct the original data to get some sensitive information, given the data perturbed by and some priori knowledge, e.g. the mixing matrix, the means and variances of the original data. In the case that the attributes of the original data are mutually independent and sparse, the reconstruction can be treated as a problem of Underdetermined Independent Component Analysis (UICA), but UICA has some permutation and scaling ambiguities. In this paper we propose a reconstruction framework based on UICA and also some techniques to reduce the ambiguities. The cases that the attributes of the original data are correlated and not sparse are also common in data mining. We also propose a reconstruction method for the typical case of Multivariate Gaussian Distribution, based on the method of Maximum A Posterior (MAP). Our experiments show that our reconstructions can achieve high recovery rates, and outperform the reconstructions based on Principle Component Analysis (PCA).

原文English
主出版物標題Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2009, Proceedings
發行者Springer Verlag
頁面334-349
頁數16
版本PART 2
ISBN(列印)3642041736, 9783642041730
DOIs
出版狀態Published - 2009
對外發佈
事件9th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2009 - Bled, Slovenia
持續時間: 7 9月 200911 9月 2009

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
號碼PART 2
5782 LNAI
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference9th Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2009
國家/地區Slovenia
城市Bled
期間7/09/0911/09/09

指紋

深入研究「Reconstructing data perturbed by random projections when the mixing matrix is known」主題。共同形成了獨特的指紋。

引用此