Space of Functions Computed by Deep-Laye...

Exploring the Function Space of Deep-Learning Machines

August 4, 2017

88% Match

Bo Li, David Saad

Disordered Systems and Neura...

Machine Learning

The function space of deep-learning machines is investigated by studying growth in the entropy of functions of a given error with respect to a reference function, realized by a deep-learning machine. Using physics-inspired methods we study both sparsely and densely-connected architectures to discover a layer-wise convergence of candidate functions, marked by a corresponding reduction in entropy when approaching the reference function, gain insight into the importance of havin...

Find Similar View on arXiv

Multilayer neural networks with extensively many hidden units

April 1, 2001

88% Match

Michal Rosen-Zvi, Andreas Engel, Ido Kanter

Statistical Mechanics

Disordered Systems and Neura...

The information processing abilities of a multilayer neural network with a number of hidden units scaling as the input dimension are studied using statistical mechanics methods. The mapping from the input layer to the hidden units is performed by general symmetric Boolean functions whereas the hidden layer is connected to the output by either discrete or continuous couplings. Introducing an overlap in the space of Boolean functions as order parameter the storage capacity if f...

Find Similar View on arXiv

Neural networks are a priori biased towards Boolean functions with low entropy

September 25, 2019

86% Match

Chris Mingard, Joar Skalse, Guillermo Valle-Pérez, David Martínez-Rubio, ... , Louis Ard A.

Machine Learning

Understanding the inductive bias of neural networks is critical to explaining their ability to generalise. Here, for one of the simplest neural networks -- a single-layer perceptron with n input neurons, one output neuron, and no threshold bias term -- we prove that upon random initialisation of weights, the a priori probability $P(t)$ that it represents a Boolean function that classifies t points in ${0,1}^n$ as 1 has a remarkably simple form: $P(t) = 2^{-n}$ for $0\leq t < ...

Find Similar View on arXiv

Lower bounds over Boolean inputs for deep neural networks with ReLU gates

November 8, 2017

86% Match

Anirbit Mukherjee, Amitabh Basu

Computational Complexity

Machine Learning

Neural and Evolutionary Comp...

Machine Learning

Motivated by the resurgence of neural networks in being able to solve complex learning tasks we undertake a study of high depth networks using ReLU gates which implement the function $x \mapsto \max\{0,x\}$. We try to understand the role of depth in such neural networks by showing size lowerbounds against such network architectures in parameter regimes hitherto unexplored. In particular we show the following two main results about neural nets computing Boolean functions of in...

Find Similar View on arXiv

Learning Boolean Circuits with Neural Networks

October 25, 2019

86% Match

Eran Malach, Shai Shalev-Shwartz

Machine Learning

While on some natural distributions, neural-networks are trained efficiently using gradient-based algorithms, it is known that learning them is computationally hard in the worst-case. To separate hard from easy to learn distributions, we observe the property of local correlation: correlation between local patterns of the input and the target label. We focus on learning deep neural-networks using a gradient-based algorithm, when the target function is a tree-structured Boolean...

Find Similar View on arXiv

Noisy Random Boolean Formulae - a Statistical Physics Perspective

March 19, 2010

86% Match

Alexander Mozeika, David Saad, Jack Raymond

Disordered Systems and Neura...

Statistical Mechanics

Adaptation and Self-Organizi...

Typical properties of computing circuits composed of noisy logical gates are studied using the statistical physics methodology. A growth model that gives rise to typical random Boolean functions is mapped onto a layered Ising spin system, which facilitates the study of their ability to represent arbitrary formulae with a given level of error, the tolerable level of gate-noise, and its dependence on the formulae depth and complexity, the gates used and properties of the functi...

Find Similar View on arXiv

Computing with Noise - Phase Transitions in Boolean Formulas

August 27, 2009

86% Match

Alexander Mozeika, David Saad, Jack Raymond

Disordered Systems and Neura...

Computational Complexity

Computing circuits composed of noisy logical gates and their ability to represent arbitrary Boolean functions with a given level of error are investigated within a statistical mechanics setting. Bounds on their performance, derived in the information theory literature for specific gates, are straightforwardly retrieved, generalized and identified as the corresponding typical-case phase transitions. This framework paves the way for obtaining new results on error-rates, functio...

Find Similar View on arXiv

A random energy approach to deep learning

December 17, 2021

86% Match

Rongrong Xie, Matteo Marsili

Disordered Systems and Neura...

Machine Learning

We study a generic ensemble of deep belief networks which is parametrized by the distribution of energy levels of the hidden states of each layer. We show that, within a random energy approach, statistical dependence can propagate from the visible to deep layers only if each layer is tuned close to the critical point during learning. As a consequence, efficiently trained learning machines are characterised by a broad distribution of energy levels. The analysis of Deep Belief ...

Find Similar View on arXiv

A measure for the complexity of Boolean functions related to their implementation in neural networks

November 9, 2001

85% Match

Leonardo Franco

Disordered Systems and Neura...

Neurons and Cognition

We define a measure for the complexity of Boolean functions related to their implementation in neural networks, and in particular close related to the generalization ability that could be obtained through the learning process. The measure is computed through the calculus of the number of neighbor examples that differ in their output value. Pairs of these examples have been previously shown to be part of the minimum size training set needed to obtain perfect generalization in ...

Find Similar View on arXiv

A Convex Analysis Approach to Computational Entropy

May 14, 2013

85% Match

Maciej Skórski

Information Theory

This paper studies the notion of computational entropy. Using techniques from convex optimization, we investigate the following problems: (a) Can we derandomize the computational entropy? More precisely, for the computational entropy, what is the real difference in security defined using the three important classes of circuits: deterministic boolean, deterministic real valued, or (the most powerful) randomized ones? (b) How large the difference in the computational entropy fo...

Find Similar View on arXiv

Space of Functions Computed by Deep-Layered Machines

Exploring the Function Space of Deep-Learning Machines

Multilayer neural networks with extensively many hidden units

Neural networks are a priori biased towards Boolean functions with low entropy

Lower bounds over Boolean inputs for deep neural networks with ReLU gates

Learning Boolean Circuits with Neural Networks

Noisy Random Boolean Formulae - a Statistical Physics Perspective

Computing with Noise - Phase Transitions in Boolean Formulas

A random energy approach to deep learning

A measure for the complexity of Boolean functions related to their implementation in neural networks

A Convex Analysis Approach to Computational Entropy