ID: 1711.11071

HSC: A Novel Method for Clustering Hierarchies of Networked Data

November 29, 2017

View on ArXiv

Similar papers 2

Extracting the hierarchical organization of complex systems

May 11, 2007

88% Match
M. Sales-Pardo, R. Guimera, ... , Amaral L.
Physics and Society

Extracting understanding from the growing ``sea'' of biological and socio-economic data is one of the most pressing scientific challenges facing us. Here, we introduce and validate an unsupervised method that is able to accurately extract the hierarchical organization of complex biological, social, and technological networks. We define an ensemble of hierarchically nested random graphs, which we use to validate the method. We then apply our method to real-world networks, incl...

Find SimilarView on arXiv

Linear-time Hierarchical Community Detection

June 15, 2019

87% Match
Ryan A. Rossi, Nesreen K. Ahmed, ... , Kim Sungchul
Social and Information Netwo...
Distributed, Parallel, and C...
Data Structures and Algorith...

Community detection in graphs has many important and fundamental applications including in distributed systems, compression, image segmentation, divide-and-conquer graph algorithms such as nested dissection, document and word clustering, circuit design, among many others. Finding these densely connected regions of graphs remains an important and challenging problem. Most work has focused on scaling up existing methods to handle large graphs. These methods often partition the ...

Find SimilarView on arXiv

Model-based clustering of multiple networks with a hierarchical algorithm

November 4, 2022

87% Match
Tabea LPSM Rebafka
Statistics Theory
Statistics Theory

The paper tackles the problem of clustering multiple networks, directed or not, that do not share the same set of vertices, into groups of networks with similar topology. A statistical model-based approach based on a finite mixture of stochastic block models is proposed. A clustering is obtained by maximizing the integrated classification likelihood criterion. This is done by a hierarchical agglomerative algorithm, that starts from singleton clusters and successively merges c...

Find SimilarView on arXiv

Community Detection and Classification in Hierarchical Stochastic Blockmodels

March 7, 2015

87% Match
Vince Lyzinski, Minh Tang, Avanti Athreya, ... , Priebe Carey E.
Machine Learning
Applications

We propose a robust, scalable, integrated methodology for community detection and community comparison in graphs. In our procedure, we first embed a graph into an appropriate Euclidean space to obtain a low-dimensional representation, and then cluster the vertices into communities. We next employ nonparametric graph inference techniques to identify structural similarity among these communities. These two steps are then applied recursively on the communities, allowing us to de...

Find SimilarView on arXiv

Finding community structure in very large networks

August 9, 2004

87% Match
Aaron Clauset, M. E. J. Newman, Cristopher Moore
Statistical Mechanics
Disordered Systems and Neura...

The discovery and analysis of community structure in networks is a topic of considerable recent interest within the physics community, but most methods proposed so far are unsuitable for very large networks because of their computational cost. Here we present a hierarchical agglomeration algorithm for detecting community structure which is faster than many competing algorithms: its running time on a network with n vertices and m edges is O(m d log n) where d is the depth of t...

Find SimilarView on arXiv

HiPart: Hierarchical Divisive Clustering Toolbox

September 19, 2022

87% Match
Panagiotis Anagnostou, Sotiris Tasoulis, ... , Tasoulis Dimitris
Machine Learning
Artificial Intelligence
Machine Learning

This paper presents the HiPart package, an open-source native python library that provides efficient and interpret-able implementations of divisive hierarchical clustering algorithms. HiPart supports interactive visualizations for the manipulation of the execution steps allowing the direct intervention of the clustering outcome. This package is highly suited for Big Data applications as the focus has been given to the computational efficiency of the implemented clustering met...

Find SimilarView on arXiv

Hierarchical stochastic graphlet embedding for graph-based pattern recognition

July 8, 2018

87% Match
Anjan Dutta, Pau Riba, ... , Fornés Alicia
Computer Vision and Pattern ...
Machine Learning
Machine Learning

Despite being very successful within the pattern recognition and machine learning community, graph-based methods are often unusable because of the lack of mathematical operations defined in graph domain. Graph embedding, which maps graphs to a vectorial space, has been proposed as a way to tackle these difficulties enabling the use of standard machine learning techniques. However, it is well known that graph embedding functions usually suffer from the loss of structural infor...

Find SimilarView on arXiv

Scaling Hierarchical Agglomerative Clustering to Billion-sized Datasets

May 25, 2021

87% Match
Baris Google Research Sumengen, Anand Google Research Rajagopalan, Gui Google Research Citovsky, David Google Research Simcha, Olivier Google Research Bachem, Pradipta Google Research Mitra, Sam Google Research Blasiak, ... , Kumar Sanjiv Google Research
Machine Learning

Hierarchical Agglomerative Clustering (HAC) is one of the oldest but still most widely used clustering methods. However, HAC is notoriously hard to scale to large data sets as the underlying complexity is at least quadratic in the number of data points and many algorithms to solve HAC are inherently sequential. In this paper, we propose {Reciprocal Agglomerative Clustering (RAC)}, a distributed algorithm for HAC, that uses a novel strategy to efficiently merge clusters in par...

Find SimilarView on arXiv

Hierarchical structure and the prediction of missing links in networks

November 4, 2008

87% Match
Aaron Clauset, Cristopher Moore, M. E. J. Newman
Machine Learning
Physics and Society
Molecular Networks

Networks have in recent years emerged as an invaluable tool for describing and quantifying complex systems in many branches of science. Recent studies suggest that networks often exhibit hierarchical organization, where vertices divide into groups that further subdivide into groups of groups, and so forth over multiple scales. In many cases these groups are found to correspond to known functional units, such as ecological niches in food webs, modules in biochemical networks (...

Find SimilarView on arXiv

A k-core Decomposition Framework for Graph Clustering

July 7, 2016

87% Match
Christos Giatsidis, Fragkiskos D. Malliaros, Nikolaos Tziortziotis, Charanpal Dhanjal, Emmanouil Kiagias, ... , Vazirgiannis Michalis
Social and Information Netwo...
Data Structures and Algorith...

Graph clustering or community detection constitutes an important task for investigating the internal structure of graphs, with a plethora of applications in several domains. Traditional techniques for graph clustering, such as spectral methods, typically suffer from high time and space complexity. In this article, we present CoreCluster, an efficient graph clustering framework based on the concept of graph degeneracy, that can be used along with any known graph clustering alg...

Find SimilarView on arXiv