|
|
 |
News
Colloquia
Community structure in large social and information networks
Michael W. Mahoney
Yahoo Research
Thursday, March 27th, 2008
Troy 2018 - 4:00 p.m. to 5:00 p.m.
Refreshments at 3:30 p.m.
Abstract:
The concept of a community is central to social network analysis,
and thus a large body of work has been devoted to identifying
community structure. For example, a community may be thought of as a
set of web pages on related topics, a set of people who share common
interests, or more generally as a set of nodes in a network more
similar amongst themselves than with the remainder of the network.
Motivated by difficulties we experienced at actually finding
meaningful communities in large real-world networks, we have performed
a large scale analysis of a wide range of social and information
networks. Our results suggest a significantly more refined picture of
community structure than has been appreciated previously. Our most
striking finding is that in nearly every network dataset we examined,
we observe tight but almost trivial communities at very small size
scales, and at larger size scales, the best possible communities
gradually ``blend in'' with the rest of the network and thus become
less ``community-like.'' This behavior is not explained, even at a
qualitative level, by any of the commonly-used network generation
models. Moreover, this behavior is exactly the opposite of what one
would expect based on experience with and intuition from expander
graphs, from graphs that are well-embeddable in a low-dimensional
structure, and from small social networks that have served as testbeds
of community detection algorithms. Possible mechanisms for
reproducing our empirical observations will be discussed, as will
implications of these findings for clustering, classification, and
more general data analysis in modern large social and information
networks.
Bio:
Michael Mahoney joined Yahoo Research in August 2005. Prior to
joining, he was an Assistant Professor at Yale University in the
Department of Mathematics. His primary research interests are the
design and analysis of algorithms, in particular randomized and
approximation algorithms, and the application of these algorithms to
the structuring, analysis, and understanding of massive scientific and
computer scientific data sets.
Hosted by:Petros Drineas (x-8265)
Administrative support: Chris Coonrad (x8412)
For more information:
Dr. Michael Mahoney's webpage
Last updated: March 12, 2008
|
 |
|
|