Understanding Phase Transitions via Mutu...

Exact results on high-dimensional linear regression via statistical physics

September 28, 2020

89% Match

Alexander Mozeika, Mansoor Sheikh, Fabian Aguirre-Lopez, ... , Coolen Anthony CC

Statistics Theory

Disordered Systems and Neura...

Statistics Theory

It is clear that conventional statistical inference protocols need to be revised to deal correctly with the high-dimensional data that are now common. Most recent studies aimed at achieving this revision rely on powerful approximation techniques, that call for rigorous results against which they can be tested. In this context, the simplest case of high-dimensional linear regression has acquired significant new relevance and attention. In this paper we use the statistical phys...

Find Similar View on arXiv

Asymptotics of Non-Convex Generalized Linear Models in High-Dimensions: A proof of the replica formula

February 27, 2025

89% Match

Matteo Vilucchio, Yatin Dandi, ... , Krzakala Florent

Machine Learning

The analytic characterization of the high-dimensional behavior of optimization for Generalized Linear Models (GLMs) with Gaussian data has been a central focus in statistics and probability in recent years. While convex cases, such as the LASSO, ridge regression, and logistic regression, have been extensively studied using a variety of techniques, the non-convex case remains far less understood despite its significance. A non-rigorous statistical physics framework has provide...

Find SimilarView on arXiv

Isolating the hard core of phaseless inference: the Phase selection formulation

February 6, 2025

89% Match

Davide Straziota, Luca Saglietti

Disordered Systems and Neura...

Real-valued Phase retrieval is a non-convex continuous inference problem, where a high-dimensional signal is to be reconstructed from a dataset of signless linear measurements. Focusing on the noiseless case, we aim to disentangle the two distinct sub-tasks entailed in the Phase retrieval problem: the hard combinatorial problem of retrieving the missing signs of the measurements, and the nested convex problem of regressing the input-output observations to recover the hidden s...

Find SimilarView on arXiv

High-dimensional inference: a statistical mechanics perspective

October 28, 2020

89% Match

Jean Barbier

Disordered Systems and Neura...

Statistical Mechanics

Information Theory

Machine Learning

Information Theory

Statistical inference is the science of drawing conclusions about some system from data. In modern signal processing and machine learning, inference is done in very high dimension: very many unknown characteristics about the system have to be deduced from a lot of high-dimensional noisy data. This "high-dimensional regime" is reminiscent of statistical mechanics, which aims at describing the macroscopic behavior of a complex system based on the knowledge of its microscopic in...

Find SimilarView on arXiv

Mutual information for symmetric rank-one matrix estimation: A proof of the replica formula

June 13, 2016

89% Match

Jean Barbier, Mohamad Dia, Nicolas Macris, Florent Krzakala, ... , Zdeborova Lenka

Information Theory

Disordered Systems and Neura...

Machine Learning

Information Theory

Mathematical Physics

Factorizing low-rank matrices has many applications in machine learning and statistics. For probabilistic models in the Bayes optimal setting, a general expression for the mutual information has been proposed using heuristic statistical physics computations, and proven in few specific cases. Here, we show how to rigorously prove the conjectured formula for the symmetric rank-one case. This allows to express the minimal mean-square-error and to characterize the detectability p...

Find SimilarView on arXiv

Optimal Errors and Phase Transitions in High-Dimensional Generalized Linear Models

August 10, 2017

89% Match

Jean Barbier, Florent Krzakala, Nicolas Macris, ... , Zdeborová Lenka

cs.IT

cond-mat.dis-nn

cs.AI

cs.LG

math.IT

math.MP

Generalized linear models (GLMs) arise in high-dimensional machine learning, statistics, communications and signal processing. In this paper we analyze GLMs when the data matrix is random, as relevant in problems such as compressed sensing, error-correcting codes or benchmark models in neural networks. We evaluate the mutual information (or "free entropy") from which we deduce the Bayes-optimal estimation and generalization errors. Our analysis applies to the high-dimensional...

Find SimilarView on arXiv

Fundamental Limits of PhaseMax for Phase Retrieval: A Replica Analysis

August 10, 2017

88% Match

Oussama Dhifallah, Yue M. Lu

Information Theory

We consider a recently proposed convex formulation, known as the PhaseMax method, for solving the phase retrieval problem. Using the replica method from statistical mechanics, we analyze the performance of PhaseMax in the high-dimensional limit. Our analysis predicts the \emph{exact} asymptotic performance of PhaseMax. In particular, we show that a sharp phase transition phenomenon takes place, with a simple analytical formula characterizing the phase transition boundary. Thi...

Find SimilarView on arXiv

Statistical Mechanics of High-Dimensional Inference

January 18, 2016

88% Match

Madhu Advani, Surya Ganguli

stat.ML

cond-mat.dis-nn

cond-mat.stat-mech

math.ST

q-bio.QM

stat.TH

To model modern large-scale datasets, we need efficient algorithms to infer a set of $P$ unknown model parameters from $N$ noisy measurements. What are fundamental limits on the accuracy of parameter inference, given finite signal-to-noise ratios, limited measurements, prior information, and computational tractability requirements? How can we combine prior information with measurements to achieve these limits? Classical statistics gives incisive answers to these questions as ...

Find SimilarView on arXiv

Replica Analysis for Generalized Linear Regression with IID Row Prior

September 25, 2021

88% Match

Qiuyun Zou, Hongwen Yang

Information Theory

Different from a typical independent identically distributed (IID) element assumption, this paper studies the estimation of IID row random matrix for the generalized linear model constructed by a linear mixing space and a row-wise mapping channel. The objective inference problem arises in many engineering fields, such as wireless communications, compressed sensing, and phase retrieval. We apply the replica method from statistical mechanics to analyze the exact minimum mean sq...

Find SimilarView on arXiv

Statistical Physics of Signal Estimation in Gaussian Noise: Theory and Examples of Phase Transitions

December 29, 2008

87% Match

Neri Shitz Merhav, Dongning Shitz Guo, Shlomo Shitz Shamai

Information Theory

We consider the problem of signal estimation (denoising) from a statistical mechanical perspective, using a relationship between the minimum mean square error (MMSE), of estimating a signal, and the mutual information between this signal and its noisy version. The paper consists of essentially two parts. In the first, we derive several statistical-mechanical relationships between a few important quantities in this problem area, such as the MMSE, the differential entropy, the ...

Find SimilarView on arXiv

Understanding Phase Transitions via Mutual Information and MMSE

Exact results on high-dimensional linear regression via statistical physics

Asymptotics of Non-Convex Generalized Linear Models in High-Dimensions: A proof of the replica formula

Isolating the hard core of phaseless inference: the Phase selection formulation

High-dimensional inference: a statistical mechanics perspective

Mutual information for symmetric rank-one matrix estimation: A proof of the replica formula

Optimal Errors and Phase Transitions in High-Dimensional Generalized Linear Models

Fundamental Limits of PhaseMax for Phase Retrieval: A Replica Analysis

Statistical Mechanics of High-Dimensional Inference

Replica Analysis for Generalized Linear Regression with IID Row Prior

Statistical Physics of Signal Estimation in Gaussian Noise: Theory and Examples of Phase Transitions