ID: 1606.02319

Community detection in networks: Modularity optimization and maximum likelihood are equivalent

June 7, 2016

View on ArXiv
M. E. J. Newman
Computer Science
Physics
Social and Information Netwo...
Physics and Society

We demonstrate an exact equivalence between two widely used methods of community detection in networks, the method of modularity maximization in its generalized form which incorporates a resolution parameter controlling the size of the communities discovered, and the method of maximum likelihood applied to the special case of the stochastic block model known as the planted partition model, in which all communities in a network are assumed to have statistically similar properties. Among other things, this equivalence provides a mathematically principled derivation of the modularity function, clarifies the conditions and assumptions of its use, and gives an explicit formula for the optimal value of the resolution parameter.

Similar papers 1

Community Detection through Likelihood Optimization: In Search of a Sound Model

February 13, 2018

93% Match
Liudmila Prokhorenkova, Alexey Tikhonov
Social and Information Netwo...
Statistics Theory
Statistics Theory

Community detection is one of the most important problems in network analysis. Among many algorithms proposed for this task, methods based on statistical inference are of particular interest: they are mathematically sound and were shown to provide partitions of good quality. Statistical inference methods are based on fitting some random graph model (a.k.a. null model) to the observed network by maximizing the likelihood. The choice of this model is extremely important and is ...

Find SimilarView on arXiv

Estimating the number of communities in a network

May 9, 2016

92% Match
M. E. J. Newman, Gesine Reinert
Social and Information Netwo...
Physics and Society

Community detection, the division of a network into dense subnetworks with only sparse connections between them, has been a topic of vigorous study in recent years. However, while there exist a range of powerful and flexible methods for dividing a network into a specified number of communities, it is an open question how to determine exactly how many communities one should use. Here we describe a mathematically principled approach for finding the number of communities in a ne...

Find SimilarView on arXiv

Heuristic Modularity Maximization Algorithms for Community Detection Rarely Return an Optimal Partition or Anything Similar

February 28, 2023

92% Match
Samin Aref, Mahdi Mostajabdaveh, Hriday Chheda
Social and Information Netwo...
Statistical Mechanics
Data Structures and Algorith...
Machine Learning
Optimization and Control

Community detection is a fundamental problem in computational sciences with extensive applications in various fields. The most commonly used methods are the algorithms designed to maximize modularity over different partitions of the network nodes. Using 80 real and random networks from a wide range of contexts, we investigate the extent to which current heuristic modularity maximization algorithms succeed in returning maximum-modularity (optimal) partitions. We evaluate (1) t...

Find SimilarView on arXiv

An information-theoretic framework for resolving community structure in complex networks

December 5, 2006

91% Match
Martin Rosvall, Carl T. Bergstrom
Physics and Society
Data Analysis, Statistics an...

To understand the structure of a large-scale biological, social, or technological network, it can be helpful to decompose the network into smaller subunits or modules. In this article, we develop an information-theoretic foundation for the concept of modularity in networks. We identify the modules of which the network is composed by finding an optimal compression of its topology, capitalizing on regularities in its structure. We explain the advantages of this approach and ill...

Find SimilarView on arXiv

Network community detection using modularity density measures

August 22, 2017

91% Match
Tianlong Chen, Pramesh Singh, Kevin E. Bassler
Physics and Society
Statistical Mechanics
Discrete Mathematics
Social and Information Netwo...

Modularity, since its introduction, has remained one of the most widely used metrics to assess the quality of community structure in a complex network. However the resolution limit problem associated with modularity limits its applicability to networks with community sizes smaller than a certain scale. In the past various attempts have been made to solve this problem. More recently a new metric, modularity density, was introduced for the quality of community structure in netw...

Find SimilarView on arXiv

Statistical inference of assortative community structures

June 25, 2020

91% Match
Lizhi Zhang, Tiago P. Peixoto
Physics and Society
Disordered Systems and Neura...
Social and Information Netwo...
Machine Learning

We develop a principled methodology to infer assortative communities in networks based on a nonparametric Bayesian formulation of the planted partition model. We show that this approach succeeds in finding statistically significant assortative modules in networks, unlike alternatives such as modularity maximization, which systematically overfits both in artificial as well as in empirical examples. In addition, we show that our method is not subject to a resolution limit, and ...

Find SimilarView on arXiv

The Bayan Algorithm: Detecting Communities in Networks Through Exact and Approximate Optimization of Modularity

September 10, 2022

91% Match
Samin Aref, Hriday Chheda, Mahdi Mostajabdaveh
Social and Information Netwo...
Statistical Mechanics
Data Structures and Algorith...
Machine Learning
Optimization and Control

Community detection is a classic problem in network science with extensive applications in various fields. Among numerous approaches, the most common method is modularity maximization. Despite their design philosophy and wide adoption, heuristic modularity maximization algorithms rarely return an optimal partition or anything similar. We propose a specialized algorithm, Bayan, which returns partitions with a guarantee of either optimality or proximity to an optimal partition....

Find SimilarView on arXiv

Detecting Network Communities: a new systematic and efficient algorithm

April 27, 2004

91% Match
Luca Donetti, Miguel A. Munoz
Statistical Mechanics
Other Condensed Matter

An efficient and relatively fast algorithm for the detection of communities in complex networks is introduced. The method exploits spectral properties of the graph Laplacian-matrix combined with hierarchical-clustering techniques, and includes a procedure to maximize the ``modularity'' of the output. Its performance is compared with that of other existing methods, as applied to different well-known instances of complex networks with a community-structure: both computer-genera...

Find SimilarView on arXiv

Community detection in graphs

June 3, 2009

90% Match
Santo Fortunato
physics.soc-ph
cond-mat.stat-mech
cs.IR
physics.bio-ph
physics.comp-ph
q-bio.QM

The modern science of networks has brought significant advances to our understanding of complex systems. One of the most relevant features of graphs representing real systems is community structure, or clustering, i. e. the organization of vertices in clusters, with many edges joining vertices of the same cluster and comparatively few edges joining vertices of different clusters. Such clusters, or communities, can be considered as fairly independent compartments of a graph, p...

Find SimilarView on arXiv

Limits of modularity maximization in community detection

July 6, 2011

90% Match
Andrea Lancichinetti, Santo Fortunato
Physics and Society
Social and Information Netwo...

Modularity maximization is the most popular technique for the detection of community structure in graphs. The resolution limit of the method is supposedly solvable with the introduction of modified versions of the measure, with tunable resolution parameters. We show that multiresolution modularity suffers from two opposite coexisting problems: the tendency to merge small subgraphs, which dominates when the resolution is low; the tendency to split large subgraphs, which domina...

Find SimilarView on arXiv