Research

Finding Community Structure in Mega-scale Social Networks

Rhia improves upon work done by http://arxiv.org/abs/cond-mat/0408187

Community analysis algorithm proposed by Clauset, Newman, and Moore (CNM algorithm) finds community structure in social networks. Unfortunately, CNM algorithm does not scale well and its use is practically limited to networks whose sizes are up to 500,000 nodes. The paper identifies that this inefficiency is caused from merging communities in unbalanced manner. The paper introduces three kinds of metrics (consolidation ratio) to control the process of community analysis trying to balance the sizes of the communities being merged. Three flavors of CNM algorithms are built incorporating those metrics. The proposed techniques are tested using data sets obtained from existing social networking service that hosts 5.5 million users. All the methods exhibit dramatic improvement of execution efficiency in comparison with the original CNM algorithm and shows high scalability. The fastest method processes a network with 1 million nodes in 5 minutes and a network with 4 million nodes in 35 minutes, respectively. Another one processes a network with 500,000 nodes in 50 minutes (7 times faster than the original algorithm), finds community structures that has improved modularity, and scales to a network with 5.5 million.

via [cs/0702048] Finding Community Structure in Mega-scale Social Networks.

Sorin Adam Matei

Assistant Vice President for Partnerships in Strategic Defense Innnovation and Professor of Communication at Purdue University, Director of the FORCES initiative leads research teams that study the relationship between technological and social systems using big data, simulation, and mapping approaches. He published papers and articles in Journal of Communication, Communication Research, Information Society, National Interest, and Foreign Policy. He is the author or co-editor of several books. The most recent is Structural differentation in social media. He also co-edited Ethical Reasoning in Big Data,Transparency in social media and Roles, Trust, and Reputation in Social Media Knowledge Markets: Theory and Methods (Computational Social Sciences) , all three the product of the NSF funded KredibleNet project. Dr. Matei's teaching portfolio includes technology and strategy, online interaction, and digital media analytics classes. A former BBC World Service journalist, his contributions have been published in Esquire and several leading Romanian newspapers. In Romania, he is known for his books Boierii Mintii (The Mind Boyars), Idolii forului (Idols of the forum), and Idei de schimb (Spare ideas).

2 thoughts on “Finding Community Structure in Mega-scale Social Networks

  • I had only a brief look by now but it looks really interesting. I was looking for this kind of overview for some time.

    Do you know of any good studies that approach the problem from the “other end”? Say, on substantially meaningful network generating processes that map to certain community definition or detection algorithms in the sense of producing networks with given degree of “modularity” (according to the given definition)?

    Reply

Leave a Reply

Your email address will not be published. Required fields are marked *