What do AI algorithms actually learn? - On false structures in deep learning

June 4, 2019

View on ArXiv

Laura Thesing, Vegard Antun, Anders C. Hansen

Statistics

Computer Science

Machine Learning

Cryptography and Security

Computer Vision and Pattern ...

Machine Learning

There are two big unsolved mathematical questions in artificial intelligence (AI): (1) Why is deep learning so successful in classification problems and (2) why are neural nets based on deep learning at the same time universally unstable, where the instabilities make the networks vulnerable to adversarial attacks. We present a solution to these questions that can be summed up in two words; false structures. Indeed, deep learning does not learn the original structures that humans use when recognising images (cats have whiskers, paws, fur, pointy ears, etc), but rather different false structures that correlate with the original structure and hence yield the success. However, the false structure, unlike the original structure, is unstable. The false structure is simpler than the original structure, hence easier to learn with less data and the numerical algorithm used in the training will more easily converge to the neural network that captures the false structure. We formally define the concept of false structures and formulate the solution as a conjecture. Given that trained neural networks always are computed with approximations, this conjecture can only be established through a combination of theoretical and computational results similar to how one establishes a postulate in theoretical physics (e.g. the speed of light is constant). Establishing the conjecture fully will require a vast research program characterising the false structures. We provide the foundations for such a program establishing the existence of the false structures in practice. Finally, we discuss the far reaching consequences the existence of the false structures has on state-of-the-art AI and Smale's 18th problem.

The mathematics of adversarial attacks in AI -- Why deep learning is unstable despite the existence of stable neural networks

September 13, 2021

92% Match

Alexander Bastounis, Anders C Hansen, Verner Vlačić

Machine Learning

Computer Vision and Pattern ...

Numerical Analysis

Machine Learning

The unprecedented success of deep learning (DL) makes it unchallenged when it comes to classification problems. However, it is well established that the current DL methodology produces universally unstable neural networks (NNs). The instability problem has caused an enormous research effort -- with a vast literature on so-called adversarial attacks -- yet there has been no solution to the problem. Our paper addresses why there has been no solution to the problem, as we prove ...

Find SimilarView on arXiv

Identifying and Exploiting Structures for Reliable Deep Learning

August 16, 2021

90% Match

Amartya Sanyal

Machine Learning

Deep learning research has recently witnessed an impressively fast-paced progress in a wide range of tasks including computer vision, natural language processing, and reinforcement learning. The extraordinary performance of these systems often gives the impression that they can be used to revolutionise our lives for the better. However, as recent works point out, these systems suffer from several issues that make them unreliable for use in the real world, including vulnerabil...

Find SimilarView on arXiv

The Unreasonable Effectiveness of Deep Learning in Artificial Intelligence

February 12, 2020

90% Match

Terrence J. Sejnowski

Neurons and Cognition

Artificial Intelligence

Machine Learning

Neural and Evolutionary Comp...

Deep learning networks have been trained to recognize speech, caption photographs and translate text between languages at high levels of performance. Although applications of deep learning networks to real world problems have become ubiquitous, our understanding of why they are so effective is lacking. These empirical results should not be possible according to sample complexity in statistics and non-convex optimization theory. However, paradoxes in the training and effective...

Find SimilarView on arXiv

Breaking the Curse of Dimensionality in Deep Neural Networks by Learning Invariant Representations

October 24, 2023

90% Match

Leonardo Petrini

Machine Learning

Artificial intelligence, particularly the subfield of machine learning, has seen a paradigm shift towards data-driven models that learn from and adapt to data. This has resulted in unprecedented advancements in various domains such as natural language processing and computer vision, largely attributed to deep learning, a special class of machine learning models. Deep learning arguably surpasses traditional approaches by learning the relevant features from raw data through a s...

Find SimilarView on arXiv

The Mathematics of Artificial Intelligence

March 16, 2022

89% Match

Gitta Kutyniok

Machine Learning

History and Overview

Machine Learning

We currently witness the spectacular success of artificial intelligence in both science and public life. However, the development of a rigorous mathematical foundation is still at an early stage. In this survey article, which is based on an invited lecture at the International Congress of Mathematicians 2022, we will in particular focus on the current "workhorse" of artificial intelligence, namely deep neural networks. We will present the main theoretical directions along wit...

Find SimilarView on arXiv

Adversarial Examples and the Deeper Riddle of Induction: The Need for a Theory of Artifacts in Deep Learning

March 20, 2020

89% Match

Cameron Buckner

Machine Learning

Artificial Intelligence

Deep learning is currently the most widespread and successful technology in artificial intelligence. It promises to push the frontier of scientific discovery beyond current limits. However, skeptics have worried that deep neural networks are black boxes, and have called into question whether these advances can really be deemed scientific progress if humans cannot understand them. Relatedly, these systems also possess bewildering new vulnerabilities: most notably a susceptibil...

Find SimilarView on arXiv

The Modern Mathematics of Deep Learning

May 9, 2021

89% Match

Julius Berner, Philipp Grohs, ... , Petersen Philipp

Machine Learning

We describe the new field of mathematical analysis of deep learning. This field emerged around a list of research questions that were not answered within the classical framework of learning theory. These questions concern: the outstanding generalization power of overparametrized neural networks, the role of depth in deep architectures, the apparent absence of the curse of dimensionality, the surprisingly successful optimization performance despite the non-convexity of the pro...

Find SimilarView on arXiv

Why & When Deep Learning Works: Looking Inside Deep Learnings

May 10, 2017

89% Match

Ronny Ronen

Machine Learning

The Intel Collaborative Research Institute for Computational Intelligence (ICRI-CI) has been heavily supporting Machine Learning and Deep Learning research from its foundation in 2012. We have asked six leading ICRI-CI Deep Learning researchers to address the challenge of "Why & When Deep Learning works", with the goal of looking inside Deep Learning, providing insights on how deep networks function, and uncovering key observations on their expressiveness, limitations, and po...

Find SimilarView on arXiv

On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?

July 25, 2022

89% Match

Alessandro Achille, Stefano Soatto

Machine Learning

Artificial Intelligence

Computer Vision and Pattern ...

We revisit the classic signal-to-symbol barrier in light of the remarkable ability of deep neural networks to generate realistic synthetic data. DeepFakes and spoofing highlight the feebleness of the link between physical reality and its abstract representation, whether learned by a digital computer or a biological agent. Starting from a widely applicable definition of abstract concept, we show that standard feed-forward architectures cannot capture but trivial concepts, rega...

Find SimilarView on arXiv

Can stable and accurate neural networks be computed? -- On the barriers of deep learning and Smale's 18th problem

January 20, 2021

89% Match

Matthew J. Colbrook, Vegard Antun, Anders C. Hansen

Machine Learning

Computer Vision and Pattern ...

Numerical Analysis

Neural and Evolutionary Comp...

Numerical Analysis

Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, current DL methods typically suffer from instability, even when universal approximation properties guarantee the existence of stable neural networks (NNs). We address this paradox by demonstrating basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities, however, there does not exist any ...

Find SimilarView on arXiv