TY - GEN
T1 - Scalable seed expansion for identifying Web communities
AU - Han, Min
AU - Shen, Hong
AU - Zhang, Xianchao
PY - 2011
Y1 - 2011
N2 - We study the problem of identifying Web communities around some seed vertex. In this work, we propose a fast graph algorithm to expand Web communities in a scalable style. Given a seed vertex, our algorithm computes approximate personalized PageRank vectors with better and better approximations, and finds the smallest conductance sets on these vectors as candidate communities in nearly-linear time. At the end, it returns the candidate community with the smallest conductance as the result community. We also define local community profile (LCP) to investigate structural and statistical properties of Web communities in a local range. Theoretical analysis and primary experiments both show the efficiency of the proposed algorithm and the quality of the results.
AB - We study the problem of identifying Web communities around some seed vertex. In this work, we propose a fast graph algorithm to expand Web communities in a scalable style. Given a seed vertex, our algorithm computes approximate personalized PageRank vectors with better and better approximations, and finds the smallest conductance sets on these vectors as candidate communities in nearly-linear time. At the end, it returns the candidate community with the smallest conductance as the result community. We also define local community profile (LCP) to investigate structural and statistical properties of Web communities in a local range. Theoretical analysis and primary experiments both show the efficiency of the proposed algorithm and the quality of the results.
UR - http://www.scopus.com/inward/record.url?scp=84863038886&partnerID=8YFLogxK
U2 - 10.1109/PAAP.2011.64
DO - 10.1109/PAAP.2011.64
M3 - Conference contribution
AN - SCOPUS:84863038886
SN - 9780769545752
T3 - Proceedings - 2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011
SP - 141
EP - 145
BT - Proceedings - 2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011
T2 - 2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011
Y2 - 9 December 2011 through 11 December 2011
ER -