ID: 1311.2694

Hypothesis Testing for Automated Community Detection in Networks

November 12, 2013

View on ArXiv

Similar papers 2

Community detection and graph partitioning

May 21, 2013

91% Match
M. E. J. Newman
Social and Information Netwo...
Data Analysis, Statistics an...
Physics and Society

Many methods have been proposed for community detection in networks. Some of the most promising are methods based on statistical inference, which rest on solid mathematical foundations and return excellent results in practice. In this paper we show that two of the most widely used inference methods can be mapped directly onto versions of the standard minimum-cut graph partitioning problem, which allows us to apply any of the many well-understood partitioning algorithms to the...

Find SimilarView on arXiv

Multiple Hypothesis Testing To Estimate The Number of Communities in Sparse Stochastic Block Models

January 12, 2022

91% Match
Chetkar Jha, Mingyao Li, Ian Barnett
Methodology

Network-based clustering methods frequently require the number of communities to be specified \emph{a priori}. Moreover, most of the existing methods for estimating the number of communities assume the number of communities to be fixed and not scale with the network size $n$. The few methods that assume the number of communities to increase with the network size $n$ are only valid when the average degree $d$ of a network grows at least as fast as $O(n)$ (i.e., the dense case)...

Find SimilarView on arXiv

A Survey on Theoretical Advances of Community Detection in Networks

August 26, 2018

91% Match
Yunpeng Zhao
Social and Information Netwo...
Machine Learning
Machine Learning

Real-world networks usually have community structure, that is, nodes are grouped into densely connected communities. Community detection is one of the most popular and best-studied research topics in network science and has attracted attention in many different fields, including computer science, statistics, social sciences, among others. Numerous approaches for community detection have been proposed in literature, from ad-hoc algorithms to systematic model-based approaches. ...

Find SimilarView on arXiv

A spectral based goodness-of-fit test for stochastic block models

March 25, 2023

91% Match
Qianyong Wu, Jiang Hu
Methodology

Community detection in complex networks has attracted considerable attention, however, most existing methods need the number of communities to be specified beforehand. In this paper, a goodness-of-fit test based on the linear spectral statistic of the centered and rescaled adjacency matrix for the stochastic block model is proposed. We prove that the proposed test statistic converges in distribution to the standard Gaussian distribution under the null hypothesis. The proof us...

Find SimilarView on arXiv

A goodness-of-fit test for stochastic block models

December 16, 2014

91% Match
Jing Lei
Statistics Theory
Methodology
Statistics Theory

The stochastic block model is a popular tool for studying community structures in network data. We develop a goodness-of-fit test for the stochastic block model. The test statistic is based on the largest singular value of a residual matrix obtained by subtracting the estimated block mean effect from the adjacency matrix. Asymptotic null distribution is obtained using recent advances in random matrix theory. The test is proved to have full power against alternative models wit...

Find SimilarView on arXiv

On community structure validation in real networks

October 18, 2017

90% Match
Mirko Signorelli, Luisa Cutillo
Applications
Social and Information Netwo...
Physics and Society
Methodology

Community structure is a commonly observed feature of real networks. The term refers to the presence in a network of groups of nodes (communities) that feature high internal connectivity, but are poorly connected between each other. Whereas the issue of community detection has been addressed in several works, the problem of validating a partition of nodes as a good community structure for a real network has received considerably less attention and remains an open issue. We pr...

Find SimilarView on arXiv

Testing Changes in Communities for the Stochastic Block Model

November 29, 2018

90% Match
Aditya Gangrade, Praveen Venkatesh, ... , Saligrama Venkatesh
cs.IT
cs.LG
cs.SI
math.IT
math.ST
stat.TH

We propose and analyze the problems of \textit{community goodness-of-fit and two-sample testing} for stochastic block models (SBM), where changes arise due to modification in community memberships of nodes. Motivated by practical applications, we consider the challenging sparse regime, where expected node degrees are constant, and the inter-community mean degree ($b$) scales proportionally to intra-community mean degree ($a$). Prior work has sharply characterized partial or f...

Find SimilarView on arXiv

A generalized hypothesis test for community structure in networks

July 8, 2021

90% Match
Eric Yanchenko, Srijan Sengupta
Social and Information Netwo...
Methodology

Researchers theorize that many real-world networks exhibit community structure where within-community edges are more likely than between-community edges. While numerous methods exist to cluster nodes into different communities, less work has addressed this question: given some network, does it exhibit statistically meaningful community structure? We answer this question in a principled manner by framing it as a statistical hypothesis test in terms of a general and model-agnos...

Find SimilarView on arXiv

Hierarchical community detection by recursive partitioning

October 2, 2018

90% Match
Tianxi Li, Lihua Lei, Sharmodeep Bhattacharyya, Koen Van den Berge, Purnamrita Sarkar, ... , Levina Elizaveta
Methodology
Statistics Theory
Machine Learning
Statistics Theory

The problem of community detection in networks is usually formulated as finding a single partition of the network into some "correct" number of communities. We argue that it is more interpretable and in some regimes more accurate to construct a hierarchical tree of communities instead. This can be done with a simple top-down recursive partitioning algorithm, starting with a single community and separating the nodes into two communities by spectral clustering repeatedly, until...

Find SimilarView on arXiv

Scalable and Robust Community Detection with Randomized Sketching

May 25, 2018

90% Match
Mostafa Rahmani, Andre Beckus, ... , Atia George
Social and Information Netwo...
Machine Learning
Machine Learning

This article explores and analyzes the unsupervised clustering of large partially observed graphs. We propose a scalable and provable randomized framework for clustering graphs generated from the stochastic block model. The clustering is first applied to a sub-matrix of the graph's adjacency matrix associated with a reduced graph sketch constructed using random sampling. Then, the clusters of the full graph are inferred based on the clusters extracted from the sketch using a ...

Find SimilarView on arXiv