TY - GEN
T1 - Anonymizing hypergraphs with community preservation
AU - Li, Yidong
AU - Shen, Hong
PY - 2011
Y1 - 2011
N2 - Data publishing based on hypergraphs is becoming increasingly popular due to its power in representing multirelations among objects. However, security issues have been little studied on this subject, while most recent work only focuses on the protection of relational data or graphs. As a major privacy breach, identity disclosure reveals the identification of entities with certain background knowledge known by an adversary. In this paper, we first introduce a novel background knowledge attack model based on the property of hyperedge ranks, and formalize the rank-based hypergraph anonymization problem. We then propose a complete solution in a two-step framework, with taking community preservation as the objective data utility. The algorithms run in near-quadratic time on hypergraph size, and protect data from rank attacks with almost same utility preserved. The performances of the methods have been validated by extensive experiments on real-world datasets as well.
AB - Data publishing based on hypergraphs is becoming increasingly popular due to its power in representing multirelations among objects. However, security issues have been little studied on this subject, while most recent work only focuses on the protection of relational data or graphs. As a major privacy breach, identity disclosure reveals the identification of entities with certain background knowledge known by an adversary. In this paper, we first introduce a novel background knowledge attack model based on the property of hyperedge ranks, and formalize the rank-based hypergraph anonymization problem. We then propose a complete solution in a two-step framework, with taking community preservation as the objective data utility. The algorithms run in near-quadratic time on hypergraph size, and protect data from rank attacks with almost same utility preserved. The performances of the methods have been validated by extensive experiments on real-world datasets as well.
KW - Anonymization
KW - Community detection
KW - Identity disclosure
KW - Private data publishing
UR - http://www.scopus.com/inward/record.url?scp=84863075238&partnerID=8YFLogxK
U2 - 10.1109/PDCAT.2011.21
DO - 10.1109/PDCAT.2011.21
M3 - Conference contribution
AN - SCOPUS:84863075238
SN - 9780769545646
T3 - Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings
SP - 185
EP - 190
BT - Proceedings - 2011 12th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2011
T2 - 2011 12th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2011
Y2 - 20 October 2011 through 22 October 2011
ER -