Statistical Mechanics of High-Dimensiona...

Inferring a property of a large system from a small number of samples

December 1, 2021

87% Match

Damián G. Hernández, Inés Samengo

Data Analysis, Statistics an...

Statistical Mechanics

Mathematical Physics

Inferring the value of a property of a large stochastic system is a difficult task when the number of samples is insufficient to reliably estimate the probability distribution. The Bayesian estimator of the property of interest requires the knowledge of the prior distribution, and in many situations, it is not clear which prior should be used. Several estimators have been developed so far, in which the proposed prior was individually tailored for each property of interest; su...

Find SimilarView on arXiv

On polynomial-time computation of high-dimensional posterior measures by Langevin-type algorithms

September 11, 2020

87% Match

Richard Nickl, Sven Wang

math.ST

cs.NA

math.AP

math.NA

math.PR

stat.CO

stat.TH

The problem of generating random samples of high-dimensional posterior distributions is considered. The main results consist of non-asymptotic computational guarantees for Langevin-type MCMC algorithms which scale polynomially in key quantities such as the dimension of the model, the desired precision level, and the number of available statistical measurements. As a direct consequence, it is shown that posterior mean vectors as well as optimisation based maximum a posteriori ...

Find SimilarView on arXiv

Gradient-based data and parameter dimension reduction for Bayesian models: an information theoretic perspective

July 18, 2022

87% Match

Ricardo Baptista, Youssef Marzouk, Olivier Zahm

Computation

Probability

Statistics Theory

We consider the problem of reducing the dimensions of parameters and data in non-Gaussian Bayesian inference problems. Our goal is to identify an "informed" subspace of the parameters and an "informative" subspace of the data so that a high-dimensional inference problem can be approximately reformulated in low-to-moderate dimensions, thereby improving the computational efficiency of many inference techniques. To do so, we exploit gradient evaluations of the log-likelihood fun...

Find SimilarView on arXiv

Statistical mechanics of complex neural systems and high dimensional data

January 30, 2013

87% Match

Madhu Advani, Subhaneil Lahiri, Surya Ganguli

Neurons and Cognition

Disordered Systems and Neura...

Machine Learning

Recent experimental advances in neuroscience have opened new vistas into the immense complexity of neuronal networks. This proliferation of data challenges us on two parallel fronts. First, how can we form adequate theoretical frameworks for understanding how dynamical network processes cooperate across widely disparate spatiotemporal scales to solve important computational problems? And second, how can we extract meaningful models of neuronal systems from high dimensional da...

Find SimilarView on arXiv

Universality laws for randomized dimension reduction, with applications

November 30, 2015

87% Match

Samet Oymak, Joel A. Tropp

math.PR

cs.DS

cs.IT

math.IT

math.ST

stat.ML

stat.TH

Dimension reduction is the process of embedding high-dimensional data into a lower dimensional space to facilitate its analysis. In the Euclidean setting, one fundamental technique for dimension reduction is to apply a random linear map to the data. This dimension reduction procedure succeeds when it preserves certain geometric features of the set. The question is how large the embedding dimension must be to ensure that randomized dimension reduction succeeds with high prob...

Find SimilarView on arXiv

Physically optimizing inference

May 19, 2018

87% Match

Audrey Huang, Benjamin Sheldan, ... , Thomson Matt

Statistical Mechanics

Disordered Systems and Neura...

Data Analysis, Statistics an...

Molecular Networks

Quantitative Methods

Data is scaling exponentially in fields ranging from genomics to neuroscience to economics. A central question is: can modern machine learning methods be applied to construct predictive models of natural systems like cells and brains based on large data sets? In this paper, we examine how inference is impacted when training data is generated by the statistical behavior of a physical system, and hence outside direct control by the experimentalist. We develop an information-the...

Find SimilarView on arXiv

High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

August 7, 2023

87% Match

Tony Bonnaire, Davide Ghio, Kamesh Krishnamurthy, Francesca Mignacco, ... , Biroli Giulio

Disordered Systems and Neura...

Statistical Mechanics

In these lecture notes we present different methods and concepts developed in statistical physics to analyze gradient descent dynamics in high-dimensional non-convex landscapes. Our aim is to show how approaches developed in physics, mainly statistical physics of disordered systems, can be used to tackle open questions on high-dimensional dynamics in Machine Learning.

Find SimilarView on arXiv

Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

June 14, 2009

87% Match

David L. Donoho, Jared Tanner

math.ST

cs.IT

math.IT

physics.data-an

stat.CO

stat.TH

We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers increases beyond a threshold. In combinatorial geometry these transitions appear as abr...

Find SimilarView on arXiv

Fast global convergence of gradient methods for high-dimensional statistical recovery

April 26, 2011

87% Match

Alekh Agarwal, Sahand N. Negahban, Martin J. Wainwright

Machine Learning

Information Theory

Many statistical $M$-estimators are based on convex optimization problems formed by the combination of a data-dependent loss function with a norm-based regularizer. We analyze the convergence rates of projected gradient and composite gradient methods for solving such problems, working within a high-dimensional framework that allows the data dimension $\pdim$ to grow with (and possibly exceed) the sample size $\numobs$. This high-dimensional structure precludes the usual globa...

Find SimilarView on arXiv

High Dimensional Inference with Random Maximum A-Posteriori Perturbations

February 10, 2016

87% Match

Tamir Hazan, Francesco Orabona, Anand D. Sarwate, ... , Jaakkola Tommi

Machine Learning

Information Theory

Machine Learning

This paper presents a new approach, called perturb-max, for high-dimensional statistical inference that is based on applying random perturbations followed by optimization. This framework injects randomness to maximum a-posteriori (MAP) predictors by randomly perturbing the potential function for the input. A classic result from extreme value statistics asserts that perturb-max operations generate unbiased samples from the Gibbs distribution using high-dimensional perturbation...

Find SimilarView on arXiv

Statistical Mechanics of High-Dimensional Inference

Inferring a property of a large system from a small number of samples

On polynomial-time computation of high-dimensional posterior measures by Langevin-type algorithms

Gradient-based data and parameter dimension reduction for Bayesian models: an information theoretic perspective

Statistical mechanics of complex neural systems and high dimensional data

Universality laws for randomized dimension reduction, with applications

Physically optimizing inference

High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

Fast global convergence of gradient methods for high-dimensional statistical recovery

High Dimensional Inference with Random Maximum A-Posteriori Perturbations