ID: 1907.02095

Understanding Phase Transitions via Mutual Information and MMSE

July 3, 2019

View on ArXiv
Galen Reeves, Henry Pfister
Computer Science
Mathematics
Statistics
Information Theory
Information Theory
Statistics Theory
Statistics Theory

The ability to understand and solve high-dimensional inference problems is essential for modern data science. This article examines high-dimensional inference problems through the lens of information theory and focuses on the standard linear model as a canonical example that is both rich enough to be practically useful and simple enough to be studied rigorously. In particular, this model can exhibit phase transitions where an arbitrarily small change in the model parameters can induce large changes in the quality of estimates. For this model, the performance of optimal inference can be studied using the replica method from statistical physics but, until recently, it was not known if the resulting formulas were actually correct. In this chapter, we present a tutorial description of the standard linear model and its connection to information theory. We also describe the replica prediction for this model and outline the authors' recent proof that it is exact.

Similar papers 1

Alexander Mozeika, Mansoor Sheikh, Fabian Aguirre-Lopez, ... , Coolen Anthony CC
Statistics Theory
Disordered Systems and Neura...
Statistics Theory

It is clear that conventional statistical inference protocols need to be revised to deal correctly with the high-dimensional data that are now common. Most recent studies aimed at achieving this revision rely on powerful approximation techniques, that call for rigorous results against which they can be tested. In this context, the simplest case of high-dimensional linear regression has acquired significant new relevance and attention. In this paper we use the statistical phys...

High-dimensional inference: a statistical mechanics perspective

October 28, 2020

89% Match
Jean Barbier
Disordered Systems and Neura...
Statistical Mechanics
Information Theory
Machine Learning
Information Theory

Statistical inference is the science of drawing conclusions about some system from data. In modern signal processing and machine learning, inference is done in very high dimension: very many unknown characteristics about the system have to be deduced from a lot of high-dimensional noisy data. This "high-dimensional regime" is reminiscent of statistical mechanics, which aims at describing the macroscopic behavior of a complex system based on the knowledge of its microscopic in...

Find SimilarView on arXiv

Optimal Errors and Phase Transitions in High-Dimensional Generalized Linear Models

August 10, 2017

89% Match
Jean Barbier, Florent Krzakala, Nicolas Macris, ... , Zdeborová Lenka
cs.IT
cond-mat.dis-nn
cs.AI
cs.LG
math.IT
math.MP

Generalized linear models (GLMs) arise in high-dimensional machine learning, statistics, communications and signal processing. In this paper we analyze GLMs when the data matrix is random, as relevant in problems such as compressed sensing, error-correcting codes or benchmark models in neural networks. We evaluate the mutual information (or "free entropy") from which we deduce the Bayes-optimal estimation and generalization errors. Our analysis applies to the high-dimensional...

Find SimilarView on arXiv

Mutual information for symmetric rank-one matrix estimation: A proof of the replica formula

June 13, 2016

89% Match
Jean Barbier, Mohamad Dia, Nicolas Macris, Florent Krzakala, ... , Zdeborova Lenka
Information Theory
Disordered Systems and Neura...
Machine Learning
Information Theory
Mathematical Physics

Factorizing low-rank matrices has many applications in machine learning and statistics. For probabilistic models in the Bayes optimal setting, a general expression for the mutual information has been proposed using heuristic statistical physics computations, and proven in few specific cases. Here, we show how to rigorously prove the conjectured formula for the symmetric rank-one case. This allows to express the minimal mean-square-error and to characterize the detectability p...

Find SimilarView on arXiv

Fundamental Limits of PhaseMax for Phase Retrieval: A Replica Analysis

August 10, 2017

88% Match
Oussama Dhifallah, Yue M. Lu
Information Theory
Information Theory

We consider a recently proposed convex formulation, known as the PhaseMax method, for solving the phase retrieval problem. Using the replica method from statistical mechanics, we analyze the performance of PhaseMax in the high-dimensional limit. Our analysis predicts the \emph{exact} asymptotic performance of PhaseMax. In particular, we show that a sharp phase transition phenomenon takes place, with a simple analytical formula characterizing the phase transition boundary. Thi...

Find SimilarView on arXiv

Statistical Mechanics of High-Dimensional Inference

January 18, 2016

88% Match
Madhu Advani, Surya Ganguli
stat.ML
cond-mat.dis-nn
cond-mat.stat-mech
math.ST
q-bio.QM
stat.TH

To model modern large-scale datasets, we need efficient algorithms to infer a set of $P$ unknown model parameters from $N$ noisy measurements. What are fundamental limits on the accuracy of parameter inference, given finite signal-to-noise ratios, limited measurements, prior information, and computational tractability requirements? How can we combine prior information with measurements to achieve these limits? Classical statistics gives incisive answers to these questions as ...

Find SimilarView on arXiv

Replica Analysis for Generalized Linear Regression with IID Row Prior

September 25, 2021

88% Match
Qiuyun Zou, Hongwen Yang
Information Theory
Information Theory

Different from a typical independent identically distributed (IID) element assumption, this paper studies the estimation of IID row random matrix for the generalized linear model constructed by a linear mixing space and a row-wise mapping channel. The objective inference problem arises in many engineering fields, such as wireless communications, compressed sensing, and phase retrieval. We apply the replica method from statistical mechanics to analyze the exact minimum mean sq...

Find SimilarView on arXiv

Statistical Physics of Signal Estimation in Gaussian Noise: Theory and Examples of Phase Transitions

December 29, 2008

87% Match
Neri Shitz Merhav, Dongning Shitz Guo, Shlomo Shitz Shamai
Information Theory
Information Theory

We consider the problem of signal estimation (denoising) from a statistical mechanical perspective, using a relationship between the minimum mean square error (MMSE), of estimating a signal, and the mutual information between this signal and its noisy version. The paper consists of essentially two parts. In the first, we derive several statistical-mechanical relationships between a few important quantities in this problem area, such as the MMSE, the differential entropy, the ...

Find SimilarView on arXiv

Mean-field methods and algorithmic perspectives for high-dimensional machine learning

March 10, 2021

87% Match
Benjamin Aubin
Disordered Systems and Neura...
Machine Learning

The main difficulty that arises in the analysis of most machine learning algorithms is to handle, analytically and numerically, a large number of interacting random variables. In this Ph.D manuscript, we revisit an approach based on the tools of statistical physics of disordered systems. Developed through a rich literature, they have been precisely designed to infer the macroscopic behavior of a large number of particles from their microscopic interactions. At the heart of th...

Find SimilarView on arXiv

The Mutual Information in Random Linear Estimation

July 8, 2016

87% Match
Jean Barbier, Mohamad Dia, ... , Krzakala Florent
Information Theory
Information Theory
Mathematical Physics

We consider the estimation of a signal from the knowledge of its noisy linear random Gaussian projections, a problem relevant in compressed sensing, sparse superposition codes or code division multiple access just to cite few. There has been a number of works considering the mutual information for this problem using the heuristic replica method from statistical physics. Here we put these considerations on a firm rigorous basis. First, we show, using a Guerra-type interpolatio...

Find SimilarView on arXiv