ID: 1711.05150

Fast and reliable inference algorithm for hierarchical stochastic block models

November 14, 2017

View on ArXiv
Yongjin Park, Joel S. Bader
Statistics
Machine Learning

Network clustering reveals the organization of a network or corresponding complex system with elements represented as vertices and interactions as edges in a (directed, weighted) graph. Although the notion of clustering can be somewhat loose, network clusters or groups are generally considered as nodes with enriched interactions and edges sharing common patterns. Statistical inference often treats groups as latent variables, with observed networks generated from latent group structure, termed a stochastic block model. Regardless of the definitions, statistical inference can be either translated to modularity maximization, which is provably an NP-complete problem. Here we present scalable and reliable algorithms that recover hierarchical stochastic block models fast and accurately. Our algorithm scales almost linearly in number of edges, and inferred models were more accurate that other scalable methods.

Similar papers 1

Bayesian stochastic blockmodeling

May 29, 2017

91% Match
Tiago P. Peixoto
Machine Learning
Statistical Mechanics
Data Analysis, Statistics an...

This chapter provides a self-contained introduction to the use of Bayesian inference to extract large-scale modular structures from network data, based on the stochastic blockmodel (SBM), as well as its degree-corrected and overlapping generalizations. We focus on nonparametric formulations that allow their inference in a manner that prevents overfitting, and enables model selection. We discuss aspects of the choice of priors, in particular how to avoid underfitting via incre...

Find SimilarView on arXiv

Hierarchical Block Structures and High-resolution Model Selection in Large Networks

October 16, 2013

91% Match
Tiago P. Peixoto
physics.data-an
cond-mat.dis-nn
cond-mat.stat-mech
cs.SI
physics.soc-ph
stat.ML

Discovering and characterizing the large-scale topological features in empirical networks are crucial steps in understanding how complex systems function. However, most existing methods used to obtain the modular structure of networks suffer from serious problems, such as being oblivious to the statistical evidence supporting the discovered patterns, which results in the inability to separate actual structure from noise. In addition to this, one also observes a resolution lim...

Find SimilarView on arXiv

Efficient Monte Carlo and greedy heuristic for the inference of stochastic block models

October 16, 2013

91% Match
Tiago P. Peixoto
Data Analysis, Statistics an...
Statistical Mechanics
Social and Information Netwo...
Computational Physics
Machine Learning

We present an efficient algorithm for the inference of stochastic block models in large networks. The algorithm can be used as an optimized Markov chain Monte Carlo (MCMC) method, with a fast mixing time and a much reduced susceptibility to getting trapped in metastable states, or as a greedy agglomerative heuristic, with an almost linear $O(N\ln^2N)$ complexity, where $N$ is the number of nodes in the network, independent on the number of blocks being inferred. We show that ...

Find SimilarView on arXiv

Hierarchical community structure in networks

September 15, 2020

91% Match
Michael T. Schaub, Jiaze Li, Leto Peel
Social and Information Netwo...
Machine Learning

Modular and hierarchical community structures are pervasive in real-world complex systems. A great deal of effort has gone into trying to detect and study these structures. Important theoretical advances in the detection of modular have included identifying fundamental limits of detectability by formally defining community structure using probabilistic generative models. Detecting hierarchical community structure introduces additional challenges alongside those inherited from...

Find SimilarView on arXiv

When Does Bottom-up Beat Top-down in Hierarchical Community Detection?

June 1, 2023

90% Match
Maximilien Dreveton, Daichi Kuroda, ... , Thiran Patrick
cs.SI
cs.LG
math.ST
stat.ME
stat.ML
stat.TH

Hierarchical clustering of networks consists in finding a tree of communities, such that lower levels of the hierarchy reveal finer-grained community structures. There are two main classes of algorithms tackling this problem. Divisive ($\textit{top-down}$) algorithms recursively partition the nodes into two communities, until a stopping rule indicates that no further split is needed. In contrast, agglomerative ($\textit{bottom-up}$) algorithms first identify the smallest comm...

Find SimilarView on arXiv

Community Detection and Classification in Hierarchical Stochastic Blockmodels

March 7, 2015

90% Match
Vince Lyzinski, Minh Tang, Avanti Athreya, ... , Priebe Carey E.
Machine Learning
Applications

We propose a robust, scalable, integrated methodology for community detection and community comparison in graphs. In our procedure, we first embed a graph into an appropriate Euclidean space to obtain a low-dimensional representation, and then cluster the vertices into communities. We next employ nonparametric graph inference techniques to identify structural similarity among these communities. These two steps are then applied recursively on the communities, allowing us to de...

Find SimilarView on arXiv

Adapting the Stochastic Block Model to Edge-Weighted Networks

May 24, 2013

90% Match
Christopher Aicher, Abigail Z. Jacobs, Aaron Clauset
Machine Learning
Machine Learning
Social and Information Netwo...
Data Analysis, Statistics an...

We generalize the stochastic block model to the important case in which edges are annotated with weights drawn from an exponential family distribution. This generalization introduces several technical difficulties for model estimation, which we solve using a Bayesian approach. We introduce a variational algorithm that efficiently approximates the model's posterior distribution for dense graphs. In specific numerical experiments on edge-weighted networks, this weighted stochas...

Find SimilarView on arXiv

Structural Inference of Hierarchies in Networks

October 9, 2006

90% Match
Aaron Clauset, Cristopher Moore, M. E. J. Newman
Physics and Society
Machine Learning
Data Analysis, Statistics an...

One property of networks that has received comparatively little attention is hierarchy, i.e., the property of having vertices that cluster together in groups, which then join to form groups of groups, and so forth, up through all levels of organization in the network. Here, we give a precise definition of hierarchical structure, give a generic model for generating arbitrary hierarchical structure in a random graph, and describe a statistically principled way to learn the set ...

Find SimilarView on arXiv

The Hierarchy of Block Models

February 7, 2020

90% Match
Majid Noroozi, Marianna Pensky
Machine Learning
Machine Learning
Statistics Theory
Statistics Theory

There exist various types of network block models such as the Stochastic Block Model (SBM), the Degree Corrected Block Model (DCBM), and the Popularity Adjusted Block Model (PABM). While this leads to a variety of choices, the block models do not have a nested structure. In addition, there is a substantial jump in the number of parameters from the DCBM to the PABM. The objective of this paper is formulation of a hierarchy of block model which does not rely on arbitrary identi...

Find SimilarView on arXiv

A Review of Stochastic Block Models and Extensions for Graph Clustering

March 1, 2019

90% Match
Clement Lee, Darren J Wilkinson
Machine Learning
Machine Learning

There have been rapid developments in model-based clustering of graphs, also known as block modelling, over the last ten years or so. We review different approaches and extensions proposed for different aspects in this area, such as the type of the graph, the clustering approach, the inference approach, and whether the number of groups is selected or estimated. We also review models that combine block modelling with topic modelling and/or longitudinal modelling, regarding how...

Find SimilarView on arXiv