From complex to simple : hierarchical fr...

Perspective: A Phase Diagram for Deep Learning unifying Jamming, Feature Learning and Lazy Training

December 30, 2020

88% Match

Mario Geiger, Leonardo Petrini, Matthieu Wyart

Machine Learning

Deep learning algorithms are responsible for a technological revolution in a variety of tasks including image recognition or Go playing. Yet, why they work is not understood. Ultimately, they manage to classify data lying in high dimension -- a feat generically impossible due to the geometry of high dimensional space and the associated curse of dimensionality. Understanding what kind of structure, symmetry or invariance makes data such as images learnable is a fundamental cha...

Find SimilarView on arXiv

Emergent learning in physical systems as feedback-based aging in a glassy landscape

September 8, 2023

88% Match

Vidyesh Rao Anisetti, Ananth Kandala, J. M. Schwarz

Disordered Systems and Neura...

Machine Learning

Neural and Evolutionary Comp...

By training linear physical networks to learn linear transformations, we discern how their physical properties evolve due to weight update rules. Our findings highlight a striking similarity between the learning behaviors of such networks and the processes of aging and memory formation in disordered and glassy systems. We show that the learning dynamics resembles an aging process, where the system relaxes in response to repeated application of the feedback boundary forces in ...

Find SimilarView on arXiv

Criticality & Deep Learning I: Generally Weighted Nets

February 26, 2017

88% Match

Dan Oprisa, Peter Toth

Artificial Intelligence

Machine Learning

Motivated by the idea that criticality and universality of phase transitions might play a crucial role in achieving and sustaining learning and intelligent behaviour in biological and artificial networks, we analyse a theoretical and a pragmatic experimental set up for critical phenomena in deep learning. On the theoretical side, we use results from statistical physics to carry out critical point calculations in feed-forward/fully connected networks, while on the experimental...

Find SimilarView on arXiv

High-dimensional learning of narrow neural networks

September 20, 2024

88% Match

Hugo Cui

Machine Learning

Disordered Systems and Neura...

Machine Learning

Recent years have been marked with the fast-pace diversification and increasing ubiquity of machine learning applications. Yet, a firm theoretical understanding of the surprising efficiency of neural networks to learn from high-dimensional data still proves largely elusive. In this endeavour, analyses inspired by statistical physics have proven instrumental, enabling the tight asymptotic characterization of the learning of neural networks in high dimensions, for a broad class...

Find SimilarView on arXiv

Weight decay induced phase transitions in multilayer neural networks

January 19, 1999

88% Match

M. Ahr, M. Biehl, E. Schloesser

Disordered Systems and Neura...

We investigate layered neural networks with differentiable activation function and student vectors without normalization constraint by means of equilibrium statistical physics. We consider the learning of perfectly realizable rules and find that the length of student vectors becomes infinite, unless a proper weight decay term is added to the energy. Then, the system undergoes a first order phase transition between states with very long student vectors and states where the len...

Find SimilarView on arXiv

Equivalence between algorithmic instability and transition to replica symmetry breaking in perceptron learning systems

November 26, 2021

88% Match

Yang Zhao, Junbin Qiu, ... , Huang Haiping

Disordered Systems and Neura...

Statistical Mechanics

Machine Learning

Binary perceptron is a fundamental model of supervised learning for the non-convex optimization, which is a root of the popular deep learning. Binary perceptron is able to achieve a classification of random high-dimensional data by computing the marginal probabilities of binary synapses. The relationship between the algorithmic instability and the equilibrium analysis of the model remains elusive. Here, we establish the relationship by showing that the instability condition a...

Find SimilarView on arXiv

A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit

September 11, 2022

88% Match

R. Pacelli, S. Ariosto, M. Pastore, F. Ginelli, ... , Rotondo P.

Disordered Systems and Neura...

Despite the practical success of deep neural networks, a comprehensive theoretical framework that can predict practically relevant scores, such as the test accuracy, from knowledge of the training data is currently lacking. Huge simplifications arise in the infinite-width limit, where the number of units $N_\ell$ in each hidden layer ($\ell=1,\dots, L$, being $L$ the depth of the network) far exceeds the number $P$ of training examples. This idealisation, however, blatantly d...

Find SimilarView on arXiv

Dynamics of Learning with Restricted Training Sets I: General Theory

September 28, 1999

88% Match

A. C. C. Coolen, D. Saad

Disordered Systems and Neura...

We study the dynamics of supervised learning in layered neural networks, in the regime where the size $p$ of the training set is proportional to the number $N$ of inputs. Here the local fields are no longer described by Gaussian probability distributions and the learning dynamics is of a spin-glass nature, with the composition of the training set playing the role of quenched disorder. We show how dynamical replica theory can be used to predict the evolution of macroscopic obs...

Find SimilarView on arXiv

Replica Symmetry Breaking in Bipartite Spin Glasses and Neural Networks

March 17, 2018

88% Match

Gavin Hartnett, Edward Parker, Edward Geist

Disordered Systems and Neura...

Machine Learning

Some interesting recent advances in the theoretical understanding of neural networks have been informed by results from the physics of disordered many-body systems. Motivated by these findings, this work uses the replica technique to study the mathematically tractable bipartite Sherrington-Kirkpatrick (SK) spin glass model, which is formally similar to a Restricted Boltzmann Machine (RBM) neural network. The bipartite SK model has been previously studied assuming replica symm...

Find SimilarView on arXiv

Random active path model of deep neural networks with diluted binary synapses

May 2, 2017

87% Match

Haiping Huang, Alireza Goudarzi

Machine Learning

Statistical Mechanics

Machine Learning

Deep learning has become a powerful and popular tool for a variety of machine learning tasks. However, it is challenging to understand the mechanism of deep learning from a theoretical perspective. In this work, we propose a random active path model to study collective properties of deep neural networks with binary synapses, under the removal perturbation of connections between layers. In the model, the path from input to output is randomly activated, and the corresponding in...

Find SimilarView on arXiv

From complex to simple : hierarchical free-energy landscape renormalized in deep neural networks

Perspective: A Phase Diagram for Deep Learning unifying Jamming, Feature Learning and Lazy Training

Emergent learning in physical systems as feedback-based aging in a glassy landscape

Criticality & Deep Learning I: Generally Weighted Nets

High-dimensional learning of narrow neural networks

Weight decay induced phase transitions in multilayer neural networks

Equivalence between algorithmic instability and transition to replica symmetry breaking in perceptron learning systems

A statistical mechanics framework for Bayesian deep neural networks beyond the infinite-width limit

Dynamics of Learning with Restricted Training Sets I: General Theory

Replica Symmetry Breaking in Bipartite Spin Glasses and Neural Networks

Random active path model of deep neural networks with diluted binary synapses