What you see is not what you get: how sa...

Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks

May 21, 2013

88% Match

Yaonan Zhang, Eric D. Kolaczyk, Bruce D. Spencer

Methodology

Networks are a popular tool for representing elements in a system and their interconnectedness. Many observed networks can be viewed as only samples of some true underlying network. Such is frequently the case, for example, in the monitoring and study of massive, online social networks. We study the problem of how to estimate the degree distribution - an object of fundamental interest - of a true underlying network from its sampled network. In particular, we show that this pr...

Find SimilarView on arXiv

Inferring Degrees from Incomplete Networks and Nonlinear Dynamics

April 21, 2020

88% Match

Chunheng Jiang, Jianxi Gao, Malik Magdon-Ismail

Multiagent Systems

Physics and Society

Inferring topological characteristics of complex networks from observed data is critical to understand the dynamical behavior of networked systems, ranging from the Internet and the World Wide Web to biological networks and social networks. Prior studies usually focus on the structure-based estimation to infer network sizes, degree distributions, average degrees, and more. Little effort attempted to estimate the specific degree of each vertex from a sampled induced graph, whi...

Find SimilarView on arXiv

Sampling properties of directed networks

January 6, 2012

88% Match

Seung-Woo Son, Claire Christensen, Golnoosh Bizhani, David V. Foster, ... , Paczuski Maya

Physics and Society

Social and Information Netwo...

Data Analysis, Statistics an...

For many real-world networks only a small "sampled" version of the original network may be investigated; those results are then used to draw conclusions about the actual system. Variants of breadth-first search (BFS) sampling, which are based on epidemic processes, are widely used. Although it is well established that BFS sampling fails, in most cases, to capture the IN-component(s) of directed networks, a description of the effects of BFS sampling on other topological proper...

Find SimilarView on arXiv

Network Sampling: From Static to Streaming Graphs

November 14, 2012

88% Match

Nesreen K. Ahmed, Jennifer Neville, Ramana Kompella

Social and Information Netwo...

Data Structures and Algorith...

Machine Learning

Physics and Society

Machine Learning

Network sampling is integral to the analysis of social, information, and biological networks. Since many real-world networks are massive in size, continuously evolving, and/or distributed in nature, the network structure is often sampled in order to facilitate study. For these reasons, a more thorough and complete understanding of network sampling is critical to support the field of network science. In this paper, we outline a framework for the general problem of network samp...

Find SimilarView on arXiv

Network Sparsification via Degree- and Subgraph-based Edge Sampling

January 8, 2023

88% Match

Zhen Su, Jürgen Kurths, Henning Meyerhenke

Social and Information Netwo...

Network (or graph) sparsification compresses a graph by removing inessential edges. By reducing the data volume, it accelerates or even facilitates many downstream analyses. Still, the accuracy of many sparsification methods, with filtering-based edge sampling being the most typical one, heavily relies on an appropriate definition of edge importance. Instead, we propose a different perspective with a generalized local-property-based sampling method, which preserves (scaled) l...

Find SimilarView on arXiv

Network Sampling Based on NN Representatives

February 7, 2014

88% Match

Milos Kudelka, Sarka Zehnalova, Jan Platos

Social and Information Netwo...

Physics and Society

The amount of large-scale real data around us increase in size very quickly and so does the necessity to reduce its size by obtaining a representative sample. Such sample allows us to use a great variety of analytical methods, whose direct application on original data would be infeasible. There are many methods used for different purposes and with different results. In this paper we outline a simple and straightforward approach based on analyzing the nearest neighbors (NN) th...

Find SimilarView on arXiv

False Discovery Rates in Biological Networks

July 8, 2019

88% Match

Lu Yu, Tobias Kaufmann, Johannes Lederer

Methodology

Quantitative Methods

Applications

The increasing availability of data has generated unprecedented prospects for network analyses in many biological fields, such as neuroscience (e.g., brain networks), genomics (e.g., gene-gene interaction networks), and ecology (e.g., species interaction networks). A powerful statistical framework for estimating such networks is Gaussian graphical models, but standard estimators for the corresponding graphs are prone to large numbers of false discoveries. In this paper, we in...

Find SimilarView on arXiv

A comprehensive statistical study of metabolic and protein-protein interaction network properties

December 20, 2017

88% Match

D. Gamermann, J. Triana, R. Jaime

Molecular Networks

Applications

Understanding the mathematical properties of graphs underling biological systems could give hints on the evolutionary mechanisms behind these structures. In this article we perform a complete statistical analysis over thousands of graphs representing metabolic and protein-protein interaction (PPI) networks. First, we investigate the quality of fits obtained for the nodes degree distributions to power-law functions. This analysis suggests that a power-law distribution poorly d...

Find SimilarView on arXiv

Connectedness matters: Construction and exact random sampling of connected graphs

September 8, 2020

88% Match

Szabolcs Horvát, Carl D. Modes

Physics and Society

Discrete Mathematics

Combinatorics

We describe a new method for the random sampling of connected networks with a specified degree sequence. We consider both the case of simple graphs and that of loopless multigraphs. The constraints of fixed degrees and of connectedness are two of the most commonly needed ones when constructing null models for the practical analysis of physical or biological networks. Yet handling these constraints, let alone combining them, is non-trivial. Our method builds on a recently intr...

Find SimilarView on arXiv

Estimation of Vertex Degrees in a Sampled Network

January 25, 2017

88% Match

Apratim Ganguly, Eric Kolaczyk

Applications

The need to produce accurate estimates of vertex degree in a large network, based on observation of a subnetwork, arises in a number of practical settings. We study a formalized version of this problem, wherein the goal is, given a randomly sampled subnetwork from a large parent network, to estimate the actual degree of the sampled nodes. Depending on the sampling scheme, trivial method of moments estimators (MMEs) can be used. However, the MME is not expected, in general, to...

Find SimilarView on arXiv

What you see is not what you get: how sampling affects macroscopic features of biological networks

Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks

Inferring Degrees from Incomplete Networks and Nonlinear Dynamics

Sampling properties of directed networks

Network Sampling: From Static to Streaming Graphs

Network Sparsification via Degree- and Subgraph-based Edge Sampling

Network Sampling Based on NN Representatives

False Discovery Rates in Biological Networks

A comprehensive statistical study of metabolic and protein-protein interaction network properties

Connectedness matters: Construction and exact random sampling of connected graphs

Estimation of Vertex Degrees in a Sampled Network