The role of prior information and comput...

A Mathematical Theory of Learning

May 7, 2014

87% Match

Ibrahim Alabdulmohsin

Machine Learning

Artificial Intelligence

Information Theory

In this paper, a mathematical theory of learning is proposed that has many parallels with information theory. We consider Vapnik's General Setting of Learning in which the learning process is defined to be the act of selecting a hypothesis in response to a given training set. Such hypothesis can, for example, be a decision boundary in classification, a set of centroids in clustering, or a set of frequent item-sets in association rule mining. Depending on the hypothesis space ...

Find SimilarView on arXiv

The no-free-lunch theorems of supervised learning

February 9, 2022

87% Match

Tom F. Sterkenburg, Peter D. Grünwald

Machine Learning

The no-free-lunch theorems promote a skeptical conclusion that all possible machine learning algorithms equally lack justification. But how could this leave room for a learning theory, that shows that some algorithms are better than others? Drawing parallels to the philosophy of induction, we point out that the no-free-lunch results presuppose a conception of learning algorithms as purely data-driven. On this conception, every algorithm must have an inherent inductive bias, t...

Find SimilarView on arXiv

What is Learning? A primary discussion about information and Representation

May 19, 2015

87% Match

Hao Wu

Artificial Intelligence

Nowadays, represented by Deep Learning techniques, the field of machine learning is experiencing unprecedented prosperity and its influence is demonstrated in academia, industry and civil society. "Intelligent" has become a label which could not be neglected for most applications; celebrities and scientists also warned that the development of full artificial intelligence may spell the end of the human race. It seems that the answer to building a computer system that could aut...

Find SimilarView on arXiv

An Information-Theoretic Perspective on Overfitting and Underfitting

October 13, 2020

87% Match

Daniel Bashir, George D. Montanez, Sonia Sehra, ... , Lauw Julius

Machine Learning

Artificial Intelligence

Information Theory

Machine Learning

We present an information-theoretic framework for understanding overfitting and underfitting in machine learning and prove the formal undecidability of determining whether an arbitrary classification algorithm will overfit a dataset. Measuring algorithm capacity via the information transferred from datasets to models, we consider mismatches between algorithm capacities and datasets to provide a signature for when a model can overfit or underfit a dataset. We present results u...

Find SimilarView on arXiv

A complexity analysis of statistical learning algorithms

December 19, 2012

87% Match

Mark A. Kon

Machine Learning

We apply information-based complexity analysis to support vector machine (SVM) algorithms, with the goal of a comprehensive continuous algorithmic analysis of such algorithms. This involves complexity measures in which some higher order operations (e.g., certain optimizations) are considered primitive for the purposes of measuring complexity. We consider classes of information operators and algorithms made up of scaled families, and investigate the utility of scaling the comp...

Find SimilarView on arXiv

Logic of Machine Learning

June 16, 2020

87% Match

Marina Sapir

Machine Learning

The main question is: why and how can we ever predict based on a finite sample? The question is not answered by statistical learning theory. Here, I suggest that prediction requires belief in "predictability" of the underlying dependence, and learning involves search for a hypothesis where these beliefs are violated the least given the observations. The measure of these violations ("errors") for given data, hypothesis and particular type of predictability beliefs is formalize...

Find SimilarView on arXiv

Parsimonious Inference

March 3, 2021

87% Match

Jed A. Duersch, Thomas A. Catanach

Machine Learning

Information Theory

Machine Learning

Information Theory

Bayesian inference provides a uniquely rigorous approach to obtain principled justification for uncertainty in predictions, yet it is difficult to articulate suitably general prior belief in the machine learning context, where computational architectures are pure abstractions subject to frequent modifications by practitioners attempting to improve results. Parsimonious inference is an information-theoretic formulation of inference over arbitrary architectures that formalizes ...

Find SimilarView on arXiv

Information, learning and falsification

October 17, 2011

87% Match

David Balduzzi

Information Theory

Machine Learning

Information Theory

Machine Learning

There are (at least) three approaches to quantifying information. The first, algorithmic information or Kolmogorov complexity, takes events as strings and, given a universal Turing machine, quantifies the information content of a string as the length of the shortest program producing it. The second, Shannon information, takes events as belonging to ensembles and quantifies the information resulting from observing the given event in terms of the number of alternate events that...

Find SimilarView on arXiv

Statistical Learning Theory: Models, Concepts, and Results

October 27, 2008

87% Match

Luxburg Ulrike von, Bernhard Schoelkopf

Machine Learning

Statistics Theory

Statistical learning theory provides the theoretical basis for many of today's machine learning algorithms. In this article we attempt to give a gentle, non-technical overview over the key ideas and insights of statistical learning theory. We target at a broad audience, not necessarily machine learning researchers. This paper can serve as a starting point for people who want to get an overview on the field before diving into technical details.

Find SimilarView on arXiv

Pac-Bayesian Supervised Classification: The Thermodynamics of Statistical Learning

December 3, 2007

87% Match

Olivier Catoni

Machine Learning

This monograph deals with adaptive supervised classification, using tools borrowed from statistical mechanics and information theory, stemming from the PACBayesian approach pioneered by David McAllester and applied to a conception of statistical learning theory forged by Vladimir Vapnik. Using convex analysis on the set of posterior probability measures, we show how to get local measures of the complexity of the classification model involving the relative entropy of posterior...

Find SimilarView on arXiv

The role of prior information and computational power in Machine Learning

A Mathematical Theory of Learning

The no-free-lunch theorems of supervised learning

What is Learning? A primary discussion about information and Representation

An Information-Theoretic Perspective on Overfitting and Underfitting

A complexity analysis of statistical learning algorithms

Logic of Machine Learning

Parsimonious Inference

Information, learning and falsification

Statistical Learning Theory: Models, Concepts, and Results

Pac-Bayesian Supervised Classification: The Thermodynamics of Statistical Learning