From complex to simple : hierarchical fr...

Statistical Physics of Deep Neural Networks: Generalization Capability, Beyond the Infinite Width, and Feature Learning

January 31, 2025

89% Match

Sebastiano Ariosto

Disordered Systems and Neura...

Machine Learning

Deep Neural Networks (DNNs) excel at many tasks, often rivaling or surpassing human performance. Yet their internal processes remain elusive, frequently described as "black boxes." While performance can be refined experimentally, achieving a fundamental grasp of their inner workings is still a challenge. Statistical Mechanics has long tackled computational problems, and this thesis applies physics-based insights to understand DNNs via three complementary approaches. First...

Find SimilarView on arXiv

Annealing and replica-symmetry in Deep Boltzmann Machines

January 21, 2020

89% Match

Diego Alberici, Adriano Barra, ... , Mingione Emanuele

Statistical Mechanics

Disordered Systems and Neura...

Mathematical Physics

In this paper we study the properties of the quenched pressure of a multi-layer spin-glass model (a deep Boltzmann Machine in artificial intelligence jargon) whose pairwise interactions are allowed between spins lying in adjacent layers and not inside the same layer nor among layers at distance larger than one. We prove a theorem that bounds the quenched pressure of such a K-layer machine in terms of K Sherrington-Kirkpatrick spin glasses and use it to investigate its anneale...

Find SimilarView on arXiv

Doing the impossible: Why neural networks can be trained at all

May 13, 2018

89% Match

Nathan O. Hodas, Panos Stinis

Machine Learning

As deep neural networks grow in size, from thousands to millions to billions of weights, the performance of those networks becomes limited by our ability to accurately train them. A common naive question arises: if we have a system with billions of degrees of freedom, don't we also need billions of samples to train it? Of course, the success of deep learning indicates that reliable models can be learned with reasonable amounts of data. Similar questions arise in protein foldi...

Find SimilarView on arXiv

Spin glass theory and its new challenge: structured disorder

September 13, 2023

89% Match

Marc Mézard

Disordered Systems and Neura...

This paper first describes, from a high level viewpoint, the main challenges that had to be solved in order to develop a theory of spin glasses in the last fifty years. It then explains how important inference problems, notably those occurring in machine learning, can be formulated as problems in statistical physics of disordered systems. However, the main questions that we face in the analysis of deep networks require to develop a new chapter of spin glass theory, which will...

Find SimilarView on arXiv

Learning Curves for Deep Neural Networks: A Gaussian Field Theory Perspective

June 12, 2019

89% Match

Omry Cohen, Or Malka, Zohar Ringel

Machine Learning

Statistical Mechanics

Neural and Evolutionary Comp...

Data Analysis, Statistics an...

Machine Learning

In the past decade, deep neural networks (DNNs) came to the fore as the leading machine learning algorithms for a variety of tasks. Their raise was founded on market needs and engineering craftsmanship, the latter based more on trial and error than on theory. While still far behind the application forefront, the theoretical study of DNNs has recently made important advancements in analyzing the highly over-parameterized regime where some exact results have been obtained. Leve...

Find SimilarView on arXiv

Dense Hebbian neural networks: a replica symmetric picture of unsupervised learning

November 25, 2022

89% Match

Elena Agliari, Linda Albanese, Francesco Alemanno, Andrea Alessandrelli, Adriano Barra, Fosca Giannotti, ... , Pedreschi Dino

Disordered Systems and Neura...

Machine Learning

We consider dense, associative neural-networks trained with no supervision and we investigate their computational capabilities analytically, via a statistical-mechanics approach, and numerically, via Monte Carlo simulations. In particular, we obtain a phase diagram summarizing their performance as a function of the control parameters such as the quality and quantity of the training dataset and the network storage, valid in the limit of large network size and structureless dat...

Find SimilarView on arXiv

Exact Phase Transitions in Deep Learning

May 25, 2022

88% Match

Liu Ziyin, Masahito Ueda

Machine Learning

Disordered Systems and Neura...

Applied Physics

This work reports deep-learning-unique first-order and second-order phase transitions, whose phenomenology closely follows that in statistical physics. In particular, we prove that the competition between prediction error and model complexity in the training loss leads to the second-order phase transition for nets with one hidden layer and the first-order phase transition for nets with more than one hidden layer. The proposed theory is directly relevant to the optimization of...

Find SimilarView on arXiv

Dynamic neurons: A statistical physics approach for analyzing deep neural networks

October 1, 2024

88% Match

Donghee Lee, Hye-Sung Lee, Jaeok Yi

Statistical Mechanics

Disordered Systems and Neura...

Machine Learning

Deep neural network architectures often consist of repetitive structural elements. We introduce a new approach that reveals these patterns and can be broadly applied to the study of deep learning. Similar to how a power strip helps untangle and organize complex cable connections, this approach treats neurons as additional degrees of freedom in interactions, simplifying the structure and enhancing the intuitive understanding of interactions within deep neural networks. Further...

Find SimilarView on arXiv

Statistical physics of complex systems: glasses, spin glasses, continuous constraint satisfaction problems, high-dimensional inference and neural networks

May 10, 2024

88% Match

Pierfrancesco Urbani

Disordered Systems and Neura...

The purpose of this manuscript is to review my recent activity on three main research topics. The first concerns the nature of low temperature amorphous solids and their relation with the spin glass transition in a magnetic field. This is the subject of the first chapter where I discuss a new model, the KHGPS model, which allows to make some progress. In the second chapter I review a second research line that concerns the study of the rigidity/jamming transitions in particle ...

Find SimilarView on arXiv

Statistical-mechanical analysis of pre-training and fine tuning in deep learning

January 19, 2015

88% Match

Masayuki Ohzeki

Machine Learning

Disordered Systems and Neura...

Statistical Mechanics

Artificial Intelligence

Machine Learning

In this paper, we present a statistical-mechanical analysis of deep learning. We elucidate some of the essential components of deep learning---pre-training by unsupervised learning and fine tuning by supervised learning. We formulate the extraction of features from the training data as a margin criterion in a high-dimensional feature-vector space. The self-organized classifier is then supplied with small amounts of labelled data, as in deep learning. Although we employ a simp...

Find SimilarView on arXiv

From complex to simple : hierarchical free-energy landscape renormalized in deep neural networks

Statistical Physics of Deep Neural Networks: Generalization Capability, Beyond the Infinite Width, and Feature Learning

Annealing and replica-symmetry in Deep Boltzmann Machines

Doing the impossible: Why neural networks can be trained at all

Spin glass theory and its new challenge: structured disorder

Learning Curves for Deep Neural Networks: A Gaussian Field Theory Perspective

Dense Hebbian neural networks: a replica symmetric picture of unsupervised learning

Exact Phase Transitions in Deep Learning

Dynamic neurons: A statistical physics approach for analyzing deep neural networks

Statistical physics of complex systems: glasses, spin glasses, continuous constraint satisfaction problems, high-dimensional inference and neural networks

Statistical-mechanical analysis of pre-training and fine tuning in deep learning