ID: 1501.05021

Stochastic Block Model and Community Detection in the Sparse Graphs: A spectral algorithm with optimal rate of recovery

January 20, 2015

View on ArXiv

Similar papers 2

Exact Community Recovery (under Side Information): Optimality of Spectral Algorithms

June 18, 2024

90% Match
Julia Gaudio, Nirmit Joshi
Social and Information Netwo...
Machine Learning

In this paper, we study the problem of exact community recovery in general, two-community block models considering both Bernoulli and Gaussian matrix models, capturing the Stochastic Block Model, submatrix localization, and $\mathbb{Z}_2$-synchronization as special cases. We also study the settings where $side$ $information$ about community assignment labels is available, modeled as passing the true labels through a noisy channel: either the binary erasure channel (where some...

Find SimilarView on arXiv

Analysis of spectral clustering algorithms for community detection: the general bipartite setting

March 12, 2018

90% Match
Zhixin Zhou, Arash A. Amini
Statistics Theory
Social and Information Netwo...
Machine Learning
Statistics Theory

We consider spectral clustering algorithms for community detection under a general bipartite stochastic block model (SBM). A modern spectral clustering algorithm consists of three steps: (1) regularization of an appropriate adjacency or Laplacian matrix (2) a form of spectral truncation and (3) a k-means type algorithm in the reduced spectral domain. We focus on the adjacency-based spectral clustering and for the first step, propose a new data-driven regularization that can r...

Find SimilarView on arXiv

Sparse and Smooth: improved guarantees for Spectral Clustering in the Dynamic Stochastic Block Model

February 7, 2020

90% Match
Nicolas Keriven, Samuel Vaiter
Machine Learning
Machine Learning
Statistics Theory
Statistics Theory

In this paper, we analyse classical variants of the Spectral Clustering (SC) algorithm in the Dynamic Stochastic Block Model (DSBM). Existing results show that, in the relatively sparse case where the expected degree grows logarithmically with the number of nodes, guarantees in the static case can be extended to the dynamic case and yield improved error bounds when the DSBM is sufficiently smooth in time, that is, the communities do not change too much between two time steps....

Find SimilarView on arXiv

Consistency of spectral clustering in stochastic block models

December 7, 2013

90% Match
Jing Lei, Alessandro Rinaldo
Statistics Theory
Machine Learning
Statistics Theory

We analyze the performance of spectral clustering for community extraction in stochastic block models. We show that, under mild conditions, spectral clustering applied to the adjacency matrix of the network can consistently recover hidden communities even when the order of the maximum expected degree is as small as $\log n$, with $n$ the number of nodes. This result applies to some popular polynomial time spectral clustering algorithms and is further extended to degree correc...

Find SimilarView on arXiv

Provable Estimation of the Number of Blocks in Block Models

May 24, 2017

90% Match
Bowei Yan, Purnamrita Sarkar, Xiuyuan Cheng
Machine Learning
Methodology

Community detection is a fundamental unsupervised learning problem for unlabeled networks which has a broad range of applications. Many community detection algorithms assume that the number of clusters $r$ is known apriori. In this paper, we propose an approach based on semi-definite relaxations, which does not require prior knowledge of model parameters like many existing convex relaxation methods and recovers the number of clusters and the clustering matrix exactly under a ...

Find SimilarView on arXiv

Optimal Recovery of Block Models with $q$ Communities

October 21, 2020

90% Match
Byron Chin, Allan Sly
Probability
Social and Information Netwo...

This paper is motivated by the reconstruction problem on the sparse stochastic block model. The paper "Belief Propagation, robust reconstruction and optimal recovery of block models" by Mossel, Neeman, and Sly provided and proved a reconstruction algorithm that recovers an optimal fraction of the communities in the 2 community case. The main step in their proof was to show that when the signal to noise ratio is sufficiently large, in particular $\theta^2d > C$, the reconstruc...

Find SimilarView on arXiv

Optimal Cluster Recovery in the Labeled Stochastic Block Model

October 20, 2015

90% Match
Se-Young Yun, Alexandre Proutiere
Probability
Machine Learning
Social and Information Netwo...
Machine Learning

We consider the problem of community detection or clustering in the labeled Stochastic Block Model (LSBM) with a finite number $K$ of clusters of sizes linearly growing with the global population of items $n$. Every pair of items is labeled independently at random, and label $\ell$ appears with probability $p(i,j,\ell)$ between two items in clusters indexed by $i$ and $j$, respectively. The objective is to reconstruct the clusters from the observation of these random labels. ...

Find SimilarView on arXiv

Detecting Overlapping Communities in Networks Using Spectral Methods

December 10, 2014

90% Match
Yuan Zhang, Elizaveta Levina, Ji Zhu
Machine Learning

Community detection is a fundamental problem in network analysis which is made more challenging by overlaps between communities which often occur in practice. Here we propose a general, flexible, and interpretable generative model for overlapping communities, which can be thought of as a generalization of the degree-corrected stochastic block model. We develop an efficient spectral algorithm for estimating the community memberships, which deals with the overlaps by employing ...

Find SimilarView on arXiv

Robustness of spectral methods for community detection

November 14, 2018

90% Match
Ludovic Stephan, Laurent Massoulié
Data Structures and Algorith...
Probability

The present work is concerned with community detection. Specifically, we consider a random graph drawn according to the stochastic block model~: its vertex set is partitioned into blocks, or communities, and edges are placed randomly and independently of each other with probability depending only on the communities of their two endpoints. In this context, our aim is to recover the community labels better than by random guess, based only on the observation of the graph. In t...

Find SimilarView on arXiv

Community Detection in the Hypergraph SBM: Exact Recovery Given the Similarity Matrix

August 23, 2022

90% Match
Julia Gaudio, Nirmit Joshi
Social and Information Netwo...
Machine Learning

Community detection is a fundamental problem in network science. In this paper, we consider community detection in hypergraphs drawn from the $hypergraph$ $stochastic$ $block$ $model$ (HSBM), with a focus on exact community recovery. We study the performance of polynomial-time algorithms which operate on the $similarity$ $matrix$ $W$, where $W_{ij}$ reports the number of hyperedges containing both $i$ and $j$. Under this information model, while the precise information-theore...

Find SimilarView on arXiv