Learning Boolean Circuits with Neural Networks

October 25, 2019

Eran Malach, Shai Shalev-Shwartz

Computer Science

Statistics

Machine Learning

While on some natural distributions, neural-networks are trained efficiently using gradient-based algorithms, it is known that learning them is computationally hard in the worst-case. To separate hard from easy to learn distributions, we observe the property of local correlation: correlation between local patterns of the input and the target label. We focus on learning deep neural-networks using a gradient-based algorithm, when the target function is a tree-structured Boolean circuit. We show that in this case, the existence of correlation between the gates of the circuit and the target label determines whether the optimization succeeds or fails. Using this result, we show that neural-networks can learn the (log n)-parity problem for most product distributions. These results hint that local correlation may play an important role in separating easy/hard to learn distributions. We also obtain a novel depth separation result, in which we show that a shallow network cannot express some functions, while there exists an efficient gradient-based algorithm that can learn the very same functions using a deep network. The negative expressivity result for shallow networks is obtained by a reduction from results in communication complexity, that may be of independent interest.

When Hardness of Approximation Meets Hardness of Learning

August 18, 2020

90% Match

Eran Malach, Shai Shalev-Shwartz

Machine Learning

A supervised learning algorithm has access to a distribution of labeled examples, and needs to return a function (hypothesis) that correctly labels the examples. The hypothesis of the learner is taken from some fixed class of functions (e.g., linear classifiers, neural networks etc.). A failure of the learning algorithm can occur due to two possible reasons: wrong choice of hypothesis class (hardness of approximation), or failure to find the best function within the hypothesi...

Find SimilarView on arXiv

Learning Parities with Neural Networks

February 18, 2020

89% Match

Amit Daniely, Eran Malach

Machine Learning

In recent years we see a rapidly growing line of research which shows learnability of various models via common neural network algorithms. Yet, besides a very few outliers, these results show learnability of models that can be learned using linear methods. Namely, such results show that learning neural-networks with gradient-descent is competitive with learning a linear classifier on top of a data-independent representation of the examples. This leaves much to be desired, as ...

Find SimilarView on arXiv

Distribution-Specific Hardness of Learning Neural Networks

September 5, 2016

88% Match

Ohad Shamir

Machine Learning

Neural and Evolutionary Comp...

Machine Learning

Although neural networks are routinely and successfully trained in practice using simple gradient-based methods, most existing theoretical results are negative, showing that learning such networks is difficult, in a worst-case sense over all data distributions. In this paper, we take a more nuanced view, and consider whether specific assumptions on the "niceness" of the input distribution, or "niceness" of the target function (e.g. in terms of smoothness, non-degeneracy, inco...

Find SimilarView on arXiv

From Local Pseudorandom Generators to Hardness of Learning

January 20, 2021

88% Match

Amit Daniely, Gal Vardi

Machine Learning

We prove hardness-of-learning results under a well-studied assumption on the existence of local pseudorandom generators. As we show, this assumption allows us to surpass the current state of the art, and prove hardness of various basic problems, with no hardness results to date. Our results include: hardness of learning shallow ReLU neural networks under the Gaussian distribution and other distributions; hardness of learning intersections of $\omega(1)$ halfspaces, DNF form...

Find SimilarView on arXiv

Rethinking Arithmetic for Deep Neural Networks

May 7, 2019

87% Match

George A. Constantinides

Machine Learning

Hardware Architecture

Neural and Evolutionary Comp...

Machine Learning

We consider efficiency in the implementation of deep neural networks. Hardware accelerators are gaining interest as machine learning becomes one of the drivers of high-performance computing. In these accelerators, the directed graph describing a neural network can be implemented as a directed graph describing a Boolean circuit. We make this observation precise, leading naturally to an understanding of practical neural networks as discrete functions, and show that so-called bi...

Find SimilarView on arXiv

Adaptive n-ary Activation Functions for Probabilistic Boolean Logic

March 16, 2022

87% Match

Jed A. Duersch, Thomas A. Catanach, Niladri Das

Machine Learning

Artificial Intelligence

Balancing model complexity against the information contained in observed data is the central challenge to learning. In order for complexity-efficient models to exist and be discoverable in high dimensions, we require a computational framework that relates a credible notion of complexity to simple parameter representations. Further, this framework must allow excess complexity to be gradually removed via gradient-based optimization. Our n-ary, or n-argument, activation function...

Find SimilarView on arXiv

Are Efficient Deep Representations Learnable?

July 17, 2018

87% Match

Maxwell Nye, Andrew Saxe

Machine Learning

Neural and Evolutionary Comp...

Machine Learning

Many theories of deep learning have shown that a deep network can require dramatically fewer resources to represent a given function compared to a shallow network. But a question remains: can these efficient representations be learned using current deep learning techniques? In this work, we test whether standard deep learning methods can in fact find the efficient representations posited by several theories of deep representation. Specifically, we train deep neural networks t...

Find SimilarView on arXiv

Provable limitations of deep learning

December 16, 2018

87% Match

Emmanuel Abbe, Colin Sandon

Machine Learning

Computational Complexity

Information Theory

Machine Learning

As the success of deep learning reaches more grounds, one would like to also envision the potential limits of deep learning. This paper gives a first set of results proving that certain deep learning algorithms fail at learning certain efficiently learnable functions. The results put forward a notion of cross-predictability that characterizes when such failures take place. Parity functions provide an extreme example with a cross-predictability that decays exponentially, while...

Find SimilarView on arXiv

A new approach in machine learning

September 14, 2014

87% Match

Alain Tapp

Machine Learning

In this technical report we presented a novel approach to machine learning. Once the new framework is presented, we will provide a simple and yet very powerful learning algorithm which will be benchmark on various dataset. The framework we proposed is based on booleen circuits; more specifically the classifier produced by our algorithm have that form. Using bits and boolean gates instead of real numbers and multiplication enable the the learning algorithm and classifier to ...

Find SimilarView on arXiv

Understanding Boolean Function Learnability on Deep Neural Networks: PAC Learning Meets Neurosymbolic Models

September 13, 2020

86% Match

Marcio Nicolau, Anderson R. Tavares, Zhiwei Zhang, Pedro Avelar, João M. Flach, ... , Vardi Moshe Y.

Machine Learning

Computational learning theory states that many classes of boolean formulas are learnable in polynomial time. This paper addresses the understudied subject of how, in practice, such formulas can be learned by deep neural networks. Specifically, we analyze boolean formulas associated with model-sampling benchmarks, combinatorial optimization problems, and random 3-CNFs with varying degrees of constrainedness. Our experiments indicate that: (i) neural learning generalizes better...

Find SimilarView on arXiv