High-dimensional inference: a statistical mechanics perspective

October 28, 2020

View on ArXiv

Jean Barbier

Condensed Matter

Computer Science

Mathematics

Disordered Systems and Neura...

Statistical Mechanics

Information Theory

Machine Learning

Information Theory

Statistical inference is the science of drawing conclusions about some system from data. In modern signal processing and machine learning, inference is done in very high dimension: very many unknown characteristics about the system have to be deduced from a lot of high-dimensional noisy data. This "high-dimensional regime" is reminiscent of statistical mechanics, which aims at describing the macroscopic behavior of a complex system based on the knowledge of its microscopic interactions. It is by now clear that there are many connections between inference and statistical physics. This article aims at emphasizing some of the deep links connecting these apparently separated disciplines through the description of paradigmatic models of high-dimensional inference in the language of statistical mechanics. This article has been published in the issue on artificial intelligence of Ithaca, an Italian popularization-of-science journal. The selected topics and references are highly biased and not intended to be exhaustive in any ways. Its purpose is to serve as introduction to statistical mechanics of inference through a very specific angle that corresponds to my own tastes and limited knowledge.

Statistical Mechanics of High-Dimensional Inference

January 18, 2016

92% Match

Madhu Advani, Surya Ganguli

stat.ML

cond-mat.dis-nn

cond-mat.stat-mech

math.ST

q-bio.QM

stat.TH

To model modern large-scale datasets, we need efficient algorithms to infer a set of $P$ unknown model parameters from $N$ noisy measurements. What are fundamental limits on the accuracy of parameter inference, given finite signal-to-noise ratios, limited measurements, prior information, and computational tractability requirements? How can we combine prior information with measurements to achieve these limits? Classical statistics gives incisive answers to these questions as ...

Find SimilarView on arXiv

Exact results on high-dimensional linear regression via statistical physics

September 28, 2020

91% Match

Alexander Mozeika, Mansoor Sheikh, Fabian Aguirre-Lopez, ... , Coolen Anthony CC

Statistics Theory

Disordered Systems and Neura...

Statistics Theory

It is clear that conventional statistical inference protocols need to be revised to deal correctly with the high-dimensional data that are now common. Most recent studies aimed at achieving this revision rely on powerful approximation techniques, that call for rigorous results against which they can be tested. In this context, the simplest case of high-dimensional linear regression has acquired significant new relevance and attention. In this paper we use the statistical phys...

Find Similar View on arXiv

Statistical Mechanics of Learning: A Variational Approach for Real Data

September 6, 2002

90% Match

D. Malzahn, M. Opper

Disordered Systems and Neura...

Data Analysis, Statistics an...

Using a variational technique, we generalize the statistical physics approach of learning from random examples to make it applicable to real data. We demonstrate the validity and relevance of our method by computing approximate estimators for generalization errors that are based on training data alone.

Find SimilarView on arXiv

Statistical Physics and Information Theory Perspectives on Linear Inverse Problems

May 15, 2017

90% Match

Junan Zhu

Information Theory

Many real-world problems in machine learning, signal processing, and communications assume that an unknown vector $x$ is measured by a matrix A, resulting in a vector $y=Ax+z$, where $z$ denotes the noise; we call this a single measurement vector (SMV) problem. Sometimes, multiple dependent vectors $x^{(j)}, j\in \{1,...,J\}$, are measured at the same time, forming the so-called multi-measurement vector (MMV) problem. Both SMV and MMV are linear models (LM's), and the process...

Find SimilarView on arXiv

On the correspondence between thermodynamics and inference

June 5, 2017

90% Match

Colin H. LaMont, Paul A. Wiggins

Statistics Theory

Data Analysis, Statistics an...

Statistics Theory

We expand upon a natural analogy between Bayesian statistics and statistical physics in which sample size corresponds to inverse temperature. This analogy motivates the definition of two novel statistical quantities: a learning capacity and a Gibbs entropy. The analysis of the learning capacity, corresponding to the heat capacity in thermal physics, leads to new insight into the mechanism of learning and explains why some models have anomalously-high learning performance. We ...

Find SimilarView on arXiv

Understanding Machine Learning Paradigms through the Lens of Statistical Thermodynamics: A tutorial

November 24, 2024

90% Match

Xinxin Star, Liu

Machine Learning

Materials Science

Statistics Theory

Chemical Physics

Statistics Theory

This tutorial investigates the convergence of statistical mechanics and learning theory, elucidating the potential enhancements in machine learning methodologies through the integration of foundational principles from physics. The tutorial delves into advanced techniques like entropy, free energy, and variational inference which are utilized in machine learning, illustrating their significant contributions to model efficiency and robustness. By bridging these scientific disci...

Find SimilarView on arXiv

An Introduction to Modern Statistical Learning

July 20, 2022

89% Match

Joseph G. Makin

Machine Learning

This work in progress aims to provide a unified introduction to statistical learning, building up slowly from classical models like the GMM and HMM to modern neural networks like the VAE and diffusion models. There are today many internet resources that explain this or that new machine-learning algorithm in isolation, but they do not (and cannot, in so brief a space) connect these algorithms with each other or with the classical literature on statistical models, out of which ...

Find SimilarView on arXiv

Statistical mechanics of complex neural systems and high dimensional data

January 30, 2013

89% Match

Madhu Advani, Subhaneil Lahiri, Surya Ganguli

Neurons and Cognition

Disordered Systems and Neura...

Machine Learning

Recent experimental advances in neuroscience have opened new vistas into the immense complexity of neuronal networks. This proliferation of data challenges us on two parallel fronts. First, how can we form adequate theoretical frameworks for understanding how dynamical network processes cooperate across widely disparate spatiotemporal scales to solve important computational problems? And second, how can we extract meaningful models of neuronal systems from high dimensional da...

Find SimilarView on arXiv

High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

August 7, 2023

89% Match

Tony Bonnaire, Davide Ghio, Kamesh Krishnamurthy, Francesca Mignacco, ... , Biroli Giulio

Disordered Systems and Neura...

Statistical Mechanics

In these lecture notes we present different methods and concepts developed in statistical physics to analyze gradient descent dynamics in high-dimensional non-convex landscapes. Our aim is to show how approaches developed in physics, mainly statistical physics of disordered systems, can be used to tackle open questions on high-dimensional dynamics in Machine Learning.

Find SimilarView on arXiv

Fundamental problems in statistical physics XIV: Lecture on Machine Learning

February 11, 2022

89% Match

Aurélien Decelle

Disordered Systems and Neura...

Statistical Mechanics

The recent progresses in Machine Learning opened the door to actual applications of learning algorithms but also to new research directions both in the field of Machine Learning directly and, at the edges with other disciplines. The case that interests us is the interface with physics, and more specifically Statistical Physics. In this short lecture, I will try to present first a brief introduction to Machine Learning from the angle of neural networks. After explaining quickl...

Find SimilarView on arXiv