Scalable seed expansion for identifying Web communities

Min Han, Hong Shen, Xianchao Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We study the problem of identifying Web communities around some seed vertex. In this work, we propose a fast graph algorithm to expand Web communities in a scalable style. Given a seed vertex, our algorithm computes approximate personalized PageRank vectors with better and better approximations, and finds the smallest conductance sets on these vectors as candidate communities in nearly-linear time. At the end, it returns the candidate community with the smallest conductance as the result community. We also define local community profile (LCP) to investigate structural and statistical properties of Web communities in a local range. Theoretical analysis and primary experiments both show the efficiency of the proposed algorithm and the quality of the results.

Original languageEnglish
Title of host publicationProceedings - 2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011
Pages141-145
Number of pages5
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011 - Tianjin, China
Duration: 9 Dec 201111 Dec 2011

Publication series

NameProceedings - 2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011

Conference

Conference2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011
Country/TerritoryChina
CityTianjin
Period9/12/1111/12/11

Fingerprint

Dive into the research topics of 'Scalable seed expansion for identifying Web communities'. Together they form a unique fingerprint.

Cite this