Machine learning the vanishing order of ...

The Unreasonable Effectiveness of Deep Learning in Artificial Intelligence

February 12, 2020

82% Match

Terrence J. Sejnowski

Neurons and Cognition

Artificial Intelligence

Machine Learning

Neural and Evolutionary Comp...

Deep learning networks have been trained to recognize speech, caption photographs and translate text between languages at high levels of performance. Although applications of deep learning networks to real world problems have become ubiquitous, our understanding of why they are so effective is lacking. These empirical results should not be possible according to sample complexity in statistics and non-convex optimization theory. However, paradoxes in the training and effective...

Find SimilarView on arXiv

On the non-vanishing of Dirichlet $L$-functions at the central point

March 27, 2014

82% Match

Daniel Fiorilli

Number Theory

We investigate the consequences of natural conjectures of Montgomery type on the non-vanishing of Dirichlet $L$-functions at the central point. We first justify these conjectures using probabilistic arguments. We then show using a result of Bombieri, Friedlander and Iwaniec and a result of the author that they imply that almost all Dirichlet $L$-functions do not vanish at the central point. We also deduce a quantitative upper bound for the proportion of Dirichlet $L$-function...

Find SimilarView on arXiv

Sketching Datasets for Large-Scale Learning (long version)

August 4, 2020

82% Match

Rémi Gribonval, Antoine Chatalic, Nicolas Keriven, Vincent Schellekens, ... , Schniter Philip

Machine Learning

Information Theory

Machine Learning

Information Theory

This article considers "compressive learning," an approach to large-scale machine learning where datasets are massively compressed before learning (e.g., clustering, classification, or regression) is performed. In particular, a "sketch" is first constructed by computing carefully chosen nonlinear random features (e.g., random Fourier features) and averaging them over the whole dataset. Parameters are then learned from the sketch, without access to the original dataset. This a...

Find SimilarView on arXiv

Machine-Learning Mathematical Structures

January 15, 2021

82% Match

Yang-Hui He

Machine Learning

History and Overview

History and Philosophy of Ph...

We review, for a general audience, a variety of recent experiments on extracting structure from machine-learning mathematical data that have been compiled over the years. Focusing on supervised machine-learning on labeled data from different fields ranging from geometry to representation theory, from combinatorics to number theory, we present a comparative study of the accuracies on different problems. The paradigm should be useful for conjecture formulation, finding more eff...

Find SimilarView on arXiv

On the Low-lying zeros of Hasse-Weil L-functions for Elliptic Curves

August 22, 2007

82% Match

Stephan Baier, Liangyi Zhao

Number Theory

In this paper, we obtain an unconditional density theorem concerning the low-lying zeros of Hasse-Weil L-functions for a family of elliptic curves. From this together with the Riemann hypothesis for these L-functions, we infer the majorant of 27/14 (which is strictly less than 2) for the average rank of the elliptic curves in the family under consideration. This upper bound for the average rank enables us to deduce that, under the same assumption, a positive proportion of ell...

Find SimilarView on arXiv

Machine Learning the Dimension of a Polytope

July 15, 2022

82% Match

Tom Coates, Johannes Hofscheier, Alexander Kasprzyk

Combinatorics

We use machine learning to predict the dimension of a lattice polytope directly from its Ehrhart series. This is highly effective, achieving almost 100% accuracy. We also use machine learning to recover the volume of a lattice polytope from its Ehrhart series, and to recover the dimension, volume, and quasi-period of a rational polytope from its Ehrhart series. In each case we achieve very high accuracy, and we propose mathematical explanations for why this should be so.

Find SimilarView on arXiv

Bounding the Order of Vanishing of Cuspidal Newforms via the nth Centered Moments

November 3, 2022

82% Match

Sohom Dutta, Steven J. Miller

Number Theory

Building on the work of Iwaniec, Luo and Sarnak, we use the $n$-level density to bound the probability of vanishing to order at least $r$ at the central point for families of cuspidal newforms of prime level $N \to \infty$, split by sign. There are three methods to improve bounds on the order of vanishing: optimizing the test functions, increasing the support, and increasing the $n$-level density studied. Previous work determined the optimal test functions for the $1$ and $2$...

Find SimilarView on arXiv

Learning from few examples with nonlinear feature maps

March 31, 2022

82% Match

Ivan Y. Tyukin, Oliver Sutton, Alexander N. Gorban

Machine Learning

In this work we consider the problem of data classification in post-classical settings were the number of training examples consists of mere few data points. We explore the phenomenon and reveal key relationships between dimensionality of AI model's feature space, non-degeneracy of data distributions, and the model's generalisation capabilities. The main thrust of our present analysis is on the influence of nonlinear feature transformations mapping original data into higher- ...

Find SimilarView on arXiv

Representation Learning: A Review and New Perspectives

June 24, 2012

82% Match

Yoshua Bengio, Aaron Courville, Pascal Vincent

Machine Learning

The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more or less the different explanatory factors of variation behind the data. Although specific domain knowledge can be used to help design representations, learning with generic priors can also be used, and the quest for AI is motivating the design of more powerful representation-learning algorithms implem...

Find SimilarView on arXiv

Representation Learning: A Statistical Perspective

November 26, 2019

82% Match

Jianwen Xie, Ruiqi Gao, Erik Nijkamp, ... , Wu Ying Nian

Machine Learning

Learning representations of data is an important problem in statistics and machine learning. While the origin of learning representations can be traced back to factor analysis and multidimensional scaling in statistics, it has become a central theme in deep learning with important applications in computer vision and computational neuroscience. In this article, we review recent advances in learning representations from a statistical perspective. In particular, we review the fo...

Find SimilarView on arXiv

Machine learning the vanishing order of rational L-functions

The Unreasonable Effectiveness of Deep Learning in Artificial Intelligence

On the non-vanishing of Dirichlet $L$-functions at the central point

Sketching Datasets for Large-Scale Learning (long version)

Machine-Learning Mathematical Structures

On the Low-lying zeros of Hasse-Weil L-functions for Elliptic Curves

Machine Learning the Dimension of a Polytope

Bounding the Order of Vanishing of Cuspidal Newforms via the nth Centered Moments

Learning from few examples with nonlinear feature maps

Representation Learning: A Review and New Perspectives

Representation Learning: A Statistical Perspective