The role of prior information and computational power in Machine Learning

October 31, 2022

Diego Marcondes, Adilson Simonis, Junior Barrera

Computer Science

Statistics

Machine Learning

Science consists on conceiving hypotheses, confronting them with empirical evidence, and keeping only hypotheses which have not yet been falsified. Under deductive reasoning they are conceived in view of a theory and confronted with empirical evidence in an attempt to falsify it, and under inductive reasoning they are conceived based on observation, confronted with empirical evidence and a theory is established based on the not falsified hypotheses. When the hypotheses testing can be performed with quantitative data, the confrontation can be achieved with Machine Learning methods, whose quality is highly dependent on the hypotheses' complexity, hence on the proper insertion of prior information into the set of hypotheses seeking to decrease its complexity without loosing good hypotheses. However, Machine Learning tools have been applied under the pragmatic view of instrumentalism, which is concerned only with the performance of the methods and not with the understanding of their behavior, leading to methods which are not fully understood. In this context, we discuss how prior information and computational power can be employed to solve a learning problem, but while prior information and a careful design of the hypotheses space has as advantage the interpretability of the results, employing high computational power has the advantage of a higher performance. We discuss why learning methods which combine both should work better from an understanding and performance perspective, arguing in favor of basic theoretical research on Machine Learning, in special about how properties of classifiers may be identified in parameters of modern learning models.

Information-Theoretic Foundations for Machine Learning

July 17, 2024

90% Match

Hong Jun Jeon, Roy Benjamin Van

Machine Learning

Artificial Intelligence

Machine Learning

The staggering progress of machine learning in the past decade has been a sight to behold. In retrospect, it is both remarkable and unsettling that these milestones were achievable with little to no rigorous theory to guide experimentation. Despite this fact, practitioners have been able to guide their future experimentation via observations from previous large-scale empirical investigations. However, alluding to Plato's Allegory of the cave, it is likely that the observation...

Find SimilarView on arXiv

Components of Machine Learning: Binding Bits and FLOPS

October 25, 2019

89% Match

Alexander Jung

Machine Learning

Many machine learning problems and methods are combinations of three components: data, hypothesis space and loss function. Different machine learning methods are obtained as combinations of different choices for the representation of data, hypothesis space and loss function. After reviewing the mathematical structure of these three components, we discuss intrinsic trade-offs between statistical and computational properties of machine learning methods.

Find SimilarView on arXiv

Learning the hypotheses space from data through a U-curve algorithm

September 8, 2021

89% Match

Diego Marcondes, Adilson Simonis, Junior Barrera

Machine Learning

This paper proposes a data-driven systematic, consistent and non-exhaustive approach to Model Selection, that is an extension of the classical agnostic PAC learning model. In this approach, learning problems are modeled not only by a hypothesis space $\mathcal{H}$, but also by a Learning Space $\mathbb{L}(\mathcal{H})$, a poset of subspaces of $\mathcal{H}$, which covers $\mathcal{H}$ and satisfies a property regarding the VC dimension of related subspaces, that is a suitable...

Find SimilarView on arXiv

Importance of the Mathematical Foundations of Machine Learning Methods for Scientific and Engineering Applications

August 7, 2018

89% Match

Paul J. Atzberger

Machine Learning

There has been a lot of recent interest in adopting machine learning methods for scientific and engineering applications. This has in large part been inspired by recent successes and advances in the domains of Natural Language Processing (NLP) and Image Classification (IC). However, scientific and engineering problems have their own unique characteristics and requirements raising new challenges for effective design and deployment of machine learning approaches. There is a str...

Find SimilarView on arXiv

A Reflection on Learning from Data: Epistemology Issues and Limitations

July 28, 2021

89% Match

Ahmad Hammoudeh, Sara Tedmori, Nadim Obeid

Machine Learning

Artificial Intelligence

Although learning from data is effective and has achieved significant milestones, it has many challenges and limitations. Learning from data starts from observations and then proceeds to broader generalizations. This framework is controversial in science, yet it has achieved remarkable engineering successes. This paper reflects on some epistemological issues and some of the limitations of the knowledge discovered in data. The document discusses the common perception that gett...

Find SimilarView on arXiv

Classifier Technology and the Illusion of Progress

June 19, 2006

89% Match

David J. Hand

Statistics Theory

A great many tools have been developed for supervised classification, ranging from early methods such as linear discriminant analysis through to modern developments such as neural networks and support vector machines. A large number of comparative studies have been conducted in attempts to establish the relative superiority of these methods. This paper argues that these comparisons often fail to take into account important aspects of real problems, so that the apparent superi...

Find SimilarView on arXiv

Theory of Machine Learning with Limited Data

June 15, 2022

88% Match

Marina Sapir

Artificial Intelligence

Machine Learning

Logic in Computer Science

Application of machine learning may be understood as deriving new knowledge for practical use through explaining accumulated observations, training set. Peirce used the term abduction for this kind of inference. Here I formalize the concept of abduction for real valued hypotheses, and show that 14 of the most popular textbook ML learners (every learner I tested), covering classification, regression and clustering, implement this concept of abduction inference. The approach is...

Find SimilarView on arXiv

A Theory of Machine Learning

July 8, 2024

88% Match

Jinsook Kim, Jinho Kang

Machine Learning

We critically review three major theories of machine learning and provide a new theory according to which machines learn a function when the machines successfully compute it. We show that this theory challenges common assumptions in the statistical and the computational learning theories, for it implies that learning true probabilities is equivalent neither to obtaining a correct calculation of the true probabilities nor to obtaining an almost-sure convergence to them. We als...

Find SimilarView on arXiv

Information Theory and its Relation to Machine Learning

January 18, 2015

88% Match

Bao-Gang Hu

Information Theory

Machine Learning

Information Theory

In this position paper, I first describe a new perspective on machine learning (ML) by four basic problems (or levels), namely, "What to learn?", "How to learn?", "What to evaluate?", and "What to adjust?". The paper stresses more on the first level of "What to learn?", or "Learning Target Selection". Towards this primary problem within the four levels, I briefly review the existing studies about the connection between information theoretical learning (ITL [1]) and machine le...

Find SimilarView on arXiv

A Falsificationist Account of Artificial Neural Networks

May 3, 2022

88% Match

Oliver Buchholz, Eric Raidl

Machine Learning

Artificial Intelligence

Machine Learning

Machine learning operates at the intersection of statistics and computer science. This raises the question as to its underlying methodology. While much emphasis has been put on the close link between the process of learning from data and induction, the falsificationist component of machine learning has received minor attention. In this paper, we argue that the idea of falsification is central to the methodology of machine learning. It is commonly thought that machine learning...

Find SimilarView on arXiv