Understanding Phase Transitions via Mutu...

Replicated Vector Approximate Message Passing For Resampling Problem

May 23, 2019

85% Match

Takashi Takahashi, Yoshiyuki Kabashima

Machine Learning

Disordered Systems and Neura...

Statistical Mechanics

Machine Learning

Methodology

Resampling techniques are widely used in statistical inference and ensemble learning, in which estimators' statistical properties are essential. However, existing methods are computationally demanding, because repetitions of estimation/learning via numerical optimization/integral for each resampled data are required. In this study, we introduce a computationally efficient method to resolve such problem: replicated vector approximate message passing. This is based on a combina...

Find SimilarView on arXiv

Tractable Post-Selection Maximum Likelihood Inference for the Lasso

May 26, 2017

85% Match

Amit Meir, Mathias Drton

Methodology

Applying standard statistical methods after model selection may yield inefficient estimators and hypothesis tests that fail to achieve nominal type-I error rates. The main issue is the fact that the post-selection distribution of the data differs from the original distribution. In particular, the observed data is constrained to lie in a subset of the original sample space that is determined by the selected model. This often makes the post-selection likelihood of the observed ...

Find SimilarView on arXiv

Optimum estimation via gradients of partition functions and information measures: a statistical-mechanical perspective

November 17, 2009

85% Match

Neri Merhav

Information Theory

In continuation to a recent work on the statistical--mechanical analysis of minimum mean square error (MMSE) estimation in Gaussian noise via its relation to the mutual information (the I-MMSE relation), here we propose a simple and more direct relationship between optimum estimation and certain information measures (e.g., the information density and the Fisher information), which can be viewed as partition functions and hence are amenable to analysis using statistical--mecha...

Find SimilarView on arXiv

Statistical physics through the lens of real-space mutual information

January 27, 2021

85% Match

Doruk Efe Gökmen, Zohar Ringel, ... , Koch-Janusz Maciej

Statistical Mechanics

Disordered Systems and Neura...

Identifying the relevant coarse-grained degrees of freedom in a complex physical system is a key stage in developing powerful effective theories in and out of equilibrium. The celebrated renormalization group provides a framework for this task, but its practical execution in unfamiliar systems is fraught with ad hoc choices, whereas machine learning approaches, though promising, often lack formal interpretability. Recently, the optimal coarse-graining in a statistical system ...

Find SimilarView on arXiv

Estimation for High-Dimensional Multi-Layer Generalized Linear Model -- Part I: The Exact MMSE Estimator

July 20, 2020

85% Match

Haochuan Zhang, Qiuyun Zou, Hongwen Yang

Information Theory

This two-part work considers the minimum means square error (MMSE) estimation problem for a high dimensional multi-layer generalized linear model (ML-GLM), which resembles a feed-forward fully connected deep learning network in that each of its layer mixes up the random input with a known weighting matrix and activates the results via non-linear functions, except that the activation here is stochastic and following some random distribution. Part I of the work focuses on the e...

Find SimilarView on arXiv

Sparse Representations, Inference and Learning

June 28, 2023

85% Match

Clarissa Lauditi, Emanuele Troiani, Marc Mézard

Statistical Mechanics

Information Theory

Machine Learning

Information Theory

Machine Learning

In recent years statistical physics has proven to be a valuable tool to probe into large dimensional inference problems such as the ones occurring in machine learning. Statistical physics provides analytical tools to study fundamental limitations in their solutions and proposes algorithms to solve individual instances. In these notes, based on the lectures by Marc M\'ezard in 2022 at the summer school in Les Houches, we will present a general framework that can be used in a l...

Find SimilarView on arXiv

The phase transition for the existence of the maximum likelihood estimate in high-dimensional logistic regression

April 25, 2018

85% Match

Emmanuel J. Candes, Pragya Sur

Methodology

Machine Learning

This paper rigorously establishes that the existence of the maximum likelihood estimate (MLE) in high-dimensional logistic regression models with Gaussian covariates undergoes a sharp `phase transition'. We introduce an explicit boundary curve $h_{\text{MLE}}$, parameterized by two scalars measuring the overall magnitude of the unknown sequence of regression coefficients, with the following property: in the limit of large sample sizes $n$ and number of features $p$ proportion...

Find SimilarView on arXiv

Fundamental limits and algorithms for sparse linear regression with sublinear sparsity

January 27, 2021

85% Match

Lan V. Truong

Information Theory

Machine Learning

Information Theory

Machine Learning

We establish exact asymptotic expressions for the normalized mutual information and minimum mean-square-error (MMSE) of sparse linear regression in the sub-linear sparsity regime. Our result is achieved by a generalization of the adaptive interpolation method in Bayesian inference for linear regimes to sub-linear ones. A modification of the well-known approximate message passing algorithm to approach the MMSE fundamental limit is also proposed, and its state evolution is rigo...

Find SimilarView on arXiv

Phase Diagram and Approximate Message Passing for Blind Calibration and Dictionary Learning

January 24, 2013

85% Match

Florent Krzakala, Marc Mézard, Lenka Zdeborová

Information Theory

Statistical Mechanics

Machine Learning

Information Theory

We consider dictionary learning and blind calibration for signals and matrices created from a random ensemble. We study the mean-squared error in the limit of large signal dimension using the replica method and unveil the appearance of phase transitions delimiting impossible, possible-but-hard and possible inference regions. We also introduce an approximate message passing algorithm that asymptotically matches the theoretical performance, and show through numerical tests that...

Find SimilarView on arXiv

Anytime-Valid Generalized Universal Inference on Risk Minimizers

January 31, 2024

85% Match

Neil Dey, Ryan Martin, Jonathan P. Williams

Methodology

A common goal in statistics and machine learning is estimation of unknowns. Point estimates alone are of little value without an accompanying measure of uncertainty, but traditional uncertainty quantification methods, such as confidence sets and p-values, often require strong distributional or structural assumptions that may not be justified in modern problems. The present paper considers a very common case in machine learning, where the quantity of interest is the minimizer ...

Find SimilarView on arXiv

Understanding Phase Transitions via Mutual Information and MMSE

Replicated Vector Approximate Message Passing For Resampling Problem

Tractable Post-Selection Maximum Likelihood Inference for the Lasso

Optimum estimation via gradients of partition functions and information measures: a statistical-mechanical perspective

Statistical physics through the lens of real-space mutual information

Estimation for High-Dimensional Multi-Layer Generalized Linear Model -- Part I: The Exact MMSE Estimator

Sparse Representations, Inference and Learning

The phase transition for the existence of the maximum likelihood estimate in high-dimensional logistic regression

Fundamental limits and algorithms for sparse linear regression with sublinear sparsity

Phase Diagram and Approximate Message Passing for Blind Calibration and Dictionary Learning

Anytime-Valid Generalized Universal Inference on Risk Minimizers