Sub-optimality of the Naive Mean Field a...

Asymptotic errors for convex penalized linear regression beyond Gaussian matrices

February 11, 2020

86% Match

Cédric Gerbelot, Alia Abbara, Florent Krzakala

Machine Learning

Disordered Systems and Neura...

Statistical Mechanics

We consider the problem of learning a coefficient vector $x_{0}$ in $R^{N}$ from noisy linear observations $y=Fx_{0}+w$ in $R^{M}$ in the high dimensional limit $M,N$ to infinity with $\alpha=M/N$ fixed. We provide a rigorous derivation of an explicit formula -- first conjectured using heuristic methods from statistical physics -- for the asymptotic mean squared error obtained by penalized convex regression estimators such as the LASSO or the elastic net, for a class of very ...

Find SimilarView on arXiv

Mean-field variational inference with the TAP free energy: Geometric and statistical properties in linear models

November 14, 2023

86% Match

Michael Celentano, Zhou Fan, ... , Mei Song

Statistics Theory

Machine Learning

Statistics Theory

We study mean-field variational inference in a Bayesian linear model when the sample size n is comparable to the dimension p. In high dimensions, the common approach of minimizing a Kullback-Leibler divergence from the posterior distribution, or maximizing an evidence lower bound, may deviate from the true posterior mean and underestimate posterior uncertainty. We study instead minimization of the TAP free energy, showing in a high-dimensional asymptotic framework that it has...

Find SimilarView on arXiv

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

June 7, 2021

86% Match

Bruno Loureiro, Gabriele Sicuro, Cédric Gerbelot, Alessandro Pacco, ... , Zdeborová Lenka

Machine Learning

Disordered Systems and Neura...

Machine Learning

Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of $K$ Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian...

Find SimilarView on arXiv

Empirical priors for prediction in sparse high-dimensional linear regression

March 3, 2019

86% Match

Ryan Martin, Yiqi Tang

Statistics Theory

Methodology

Statistics Theory

In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical B...

Find SimilarView on arXiv

Uncertainty quantification for iterative algorithms in linear models with application to early stopping

April 27, 2024

86% Match

Pierre C. Bellec, Kai Tan

stat.ML

cs.LG

math.ST

stat.CO

stat.ME

stat.TH

This paper investigates the iterates $\hbb^1,\dots,\hbb^T$ obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension $p$ is comparable with the sample size $n$, i.e., $p \asymp n$. The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thresholding (FISTA). The paper proposes novel estimators for the generalization er...

Find SimilarView on arXiv

Concentration Inequalities for Statistical Inference

November 4, 2020

86% Match

Huiming Zhang, Song Xi Chen

Statistics Theory

Machine Learning

Probability

Machine Learning

Statistics Theory

This paper gives a review of concentration inequalities which are widely employed in non-asymptotical analyses of mathematical statistics in a wide range of settings, from distribution-free to distribution-dependent, from sub-Gaussian to sub-exponential, sub-Gamma, and sub-Weibull random variables, and from the mean to the maximum concentration. This review provides results in these settings with some fresh new results. Given the increasing popularity of high-dimensional data...

Find SimilarView on arXiv

Mean field approximations via log-concavity

June 2, 2022

86% Match

Daniel Lacker, Sumit Mukherjee, Lane Chun Yeung

Probability

We propose a new approach to deriving quantitative mean field approximations for any probability measure $P$ on $\mathbb{R}^n$ with density proportional to $e^{f(x)}$, for $f$ strongly concave. We bound the mean field approximation for the log partition function $\log \int e^{f(x)}dx$ in terms of $\sum_{i \neq j}\mathbb{E}_{Q^*}|\partial_{ij}f|^2$, for a semi-explicit probability measure $Q^*$ characterized as the unique mean field optimizer, or equivalently as the minimizer ...

Find SimilarView on arXiv

Convergence of Sparse Variational Inference in Gaussian Processes Regression

August 1, 2020

86% Match

David R. Burt, Carl Edward Rasmussen, der Wilk Mark van

Machine Learning

Gaussian processes are distributions over functions that are versatile and mathematically convenient priors in Bayesian modelling. However, their use is often impeded for data with large numbers of observations, $N$, due to the cubic (in $N$) cost of matrix operations used in exact inference. Many solutions have been proposed that rely on $M \ll N$ inducing variables to form an approximation at a cost of $\mathcal{O}(NM^2)$. While the computational cost appears linear in $N$,...

Find SimilarView on arXiv

Dimension-free PAC-Bayesian bounds for matrices, vectors, and linear least squares regression

December 7, 2017

86% Match

Olivier Catoni, Ilaria Giulini

Statistics Theory

This paper is focused on dimension-free PAC-Bayesian bounds, under weak polynomial moment assumptions, allowing for heavy tailed sample distributions. It covers the estimation of the mean of a vector or a matrix, with applications to least squares linear regression. Special efforts are devoted to the estimation of Gram matrices, due to their prominent role in high-dimension data analysis.

Find SimilarView on arXiv

Statistical Inference in Mean-Field Variational Bayes

November 4, 2019

86% Match

Wei Han, Yun Yang

math.ST

stat.AP

stat.CO

stat.ME

stat.ML

stat.TH

We conduct non-asymptotic analysis on the mean-field variational inference for approximating posterior distributions in complex Bayesian models that may involve latent variables. We show that the mean-field approximation to the posterior can be well-approximated relative to the Kullback-Leibler divergence discrepancy measure by a normal distribution whose center is the maximum likelihood estimator (MLE). In particular, our results imply that the center of the mean-field appro...

Find SimilarView on arXiv

Sub-optimality of the Naive Mean Field approximation for proportional high-dimensional Linear Regression

Asymptotic errors for convex penalized linear regression beyond Gaussian matrices

Mean-field variational inference with the TAP free energy: Geometric and statistical properties in linear models

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

Empirical priors for prediction in sparse high-dimensional linear regression

Uncertainty quantification for iterative algorithms in linear models with application to early stopping

Concentration Inequalities for Statistical Inference

Mean field approximations via log-concavity

Convergence of Sparse Variational Inference in Gaussian Processes Regression

Dimension-free PAC-Bayesian bounds for matrices, vectors, and linear least squares regression

Statistical Inference in Mean-Field Variational Bayes