Statistical Mechanics of High-Dimensiona...

New approach to Bayesian high-dimensional linear regression

July 9, 2016

86% Match

Shirin Jalali, Arian Maleki

Information Theory

Statistics Theory

Consider the problem of estimating parameters $X^n \in \mathbb{R}^n $, generated by a stationary process, from $m$ response variables $Y^m = AX^n+Z^m$, under the assumption that the distribution of $X^n$ is known. This is the most general version of the Bayesian linear regression problem. The lack of computationally feasible algorithms that can employ generic prior distributions and provide a good estimate of $X^n$ has limited the set of distributions researchers use to model...

Find SimilarView on arXiv

Asymptotic inference for high-dimensional data

February 24, 2010

86% Match

Jim Kuelbs, Anand N. Vidyashankar

Statistics Theory

In this paper, we study inference for high-dimensional data characterized by small sample sizes relative to the dimension of the data. In particular, we provide an infinite-dimensional framework to study statistical models that involve situations in which (i) the number of parameters increase with the sample size (that is, allowed to be random) and (ii) there is a possibility of missing data. Under a variety of tail conditions on the components of the data, we provide precise...

Find SimilarView on arXiv

MCMC-Based Inference in the Era of Big Data: A Fundamental Analysis of the Convergence Complexity of High-Dimensional Chains

August 5, 2015

86% Match

Bala Rajaratnam, Doug Sparks

Statistics Theory

Probability

Methodology

Statistics Theory

Markov chain Monte Carlo (MCMC) lies at the core of modern Bayesian methodology, much of which would be impossible without it. Thus, the convergence properties of MCMCs have received significant attention, and in particular, proving (geometric) ergodicity is of critical interest. Trust in the ability of MCMCs to sample from modern-day high-dimensional posteriors, however, has been limited by a widespread perception that these chains typically experience serious convergence pr...

Find SimilarView on arXiv

Regularization in High-Dimensional Regression and Classification via Random Matrix Theory

March 30, 2020

86% Match

Panagiotis Lolas

Statistics Theory

We study general singular value shrinkage estimators in high-dimensional regression and classification, when the number of features and the sample size both grow proportionally to infinity. We allow models with general covariance matrices that include a large class of data generating distributions. As far as the implications of our results are concerned, we find exact asymptotic formulas for both the training and test errors in regression models fitted by gradient descent, wh...

Find SimilarView on arXiv

Phase transitions and sample complexity in Bayes-optimal matrix factorization

February 6, 2014

86% Match

Yoshiyuki Kabashima, Florent Krzakala, Marc Mézard, ... , Zdeborová Lenka

cs.NA

cond-mat.stat-mech

cs.IT

cs.LG

math.IT

stat.ML

We analyse the matrix factorization problem. Given a noisy measurement of a product of two matrices, the problem is to estimate back the original matrices. It arises in many applications such as dictionary learning, blind matrix calibration, sparse principal component analysis, blind source separation, low rank matrix completion, robust principal component analysis or factor analysis. It is also important in machine learning: unsupervised representation learning can often be ...

Find SimilarView on arXiv

Inference with Deep Generative Priors in High Dimensions

November 8, 2019

86% Match

Parthe Pandit, Mojtaba Sahraee-Ardakan, Sundeep Rangan, ... , Fletcher Alyson K.

cs.LG

cs.IT

cs.NE

eess.SP

math.IT

stat.ML

Deep generative priors offer powerful models for complex-structured data, such as images, audio, and text. Using these priors in inverse problems typically requires estimating the input and/or hidden signals in a multi-layer deep neural network from observation of its output. While these approaches have been successful in practice, rigorous performance analysis is complicated by the non-convex nature of the underlying optimization problems. This paper presents a novel algorit...

Find SimilarView on arXiv

High-dimensional dynamics of generalization error in neural networks

October 10, 2017

86% Match

Madhu S. Advani, Andrew M. Saxe

Machine Learning

Data Analysis, Statistics an...

Neurons and Cognition

We perform an average case analysis of the generalization dynamics of large neural networks trained using gradient descent. We study the practically-relevant "high-dimensional" regime where the number of free parameters in the network is on the order of or even larger than the number of examples in the dataset. Using random matrix theory and exact solutions in linear models, we derive the generalization error and training error dynamics of learning and analyze how they depend...

Find SimilarView on arXiv

Estimation in high dimensions: a geometric perspective

May 20, 2014

86% Match

Roman Vershynin

Statistics Theory

This tutorial provides an exposition of a flexible geometric framework for high dimensional estimation problems with constraints. The tutorial develops geometric intuition about high dimensional sets, justifies it with some results of asymptotic convex geometry, and demonstrates connections between geometric results and estimation problems. The theory is illustrated with applications to sparse recovery, matrix completion, quantization, linear and logistic regression and gener...

Find SimilarView on arXiv

Far from Asymptopia

May 6, 2022

86% Match

Michael C. Abbott, Benjamin B. Machta

Other Statistics

Information Theory

Data Analysis, Statistics an...

Machine Learning

Inference from limited data requires a notion of measure on parameter space, most explicit in the Bayesian framework as a prior. Here we demonstrate that Jeffreys prior, the best-known uninformative choice, introduces enormous bias when applied to typical scientific models. Such models have a relevant effective dimensionality much smaller than the number of microscopic parameters. Because Jeffreys prior treats all microscopic parameters equally, it is from uniform when projec...

Find SimilarView on arXiv

Asymptotic Analysis of MAP Estimation via the Replica Method and Applications to Compressed Sensing

June 17, 2009

86% Match

Sundeep Rangan, Alyson K. Fletcher, Vivek K Goyal

Information Theory

The replica method is a non-rigorous but well-known technique from statistical physics used in the asymptotic analysis of large, random, nonlinear problems. This paper applies the replica method, under the assumption of replica symmetry, to study estimators that are maximum a posteriori (MAP) under a postulated prior distribution. It is shown that with random linear measurements and Gaussian noise, the replica-symmetric prediction of the asymptotic behavior of the postulated ...

Find SimilarView on arXiv

Statistical Mechanics of High-Dimensional Inference

New approach to Bayesian high-dimensional linear regression

Asymptotic inference for high-dimensional data

MCMC-Based Inference in the Era of Big Data: A Fundamental Analysis of the Convergence Complexity of High-Dimensional Chains

Regularization in High-Dimensional Regression and Classification via Random Matrix Theory

Phase transitions and sample complexity in Bayes-optimal matrix factorization

Inference with Deep Generative Priors in High Dimensions

High-dimensional dynamics of generalization error in neural networks

Estimation in high dimensions: a geometric perspective

Far from Asymptopia

Asymptotic Analysis of MAP Estimation via the Replica Method and Applications to Compressed Sensing