Machine learning and invariant theory

Provably Strict Generalisation Benefit for Equivariant Models

February 20, 2021

90% Match

Bryn Elesedy, Sheheryar Zaidi

Machine Learning

It is widely believed that engineering a model to be invariant/equivariant improves generalisation. Despite the growing popularity of this approach, a precise characterisation of the generalisation benefit is lacking. By considering the simplest case of linear models, this paper provides the first provably non-zero improvement in generalisation for invariant/equivariant models when the target distribution is invariant/equivariant with respect to a compact group. Moreover, our...

Find SimilarView on arXiv

Decomposition of Equivariant Maps via Invariant Maps: Application to Universal Approximation under Symmetry

September 25, 2024

90% Match

Akiyoshi Sannai, Yuuki Takai, Matthieu Cordonnier

Machine Learning

In this paper, we develop a theory about the relationship between invariant and equivariant maps with regard to a group $G$. We then leverage this theory in the context of deep neural networks with group symmetries in order to obtain novel insight into their mechanisms. More precisely, we establish a one-to-one relationship between equivariant maps and certain invariant maps. This allows us to reduce arguments for equivariant maps to those for invariant maps and vice versa. A...

Find SimilarView on arXiv

Universal Equivariant Multilayer Perceptrons

February 7, 2020

90% Match

Siamak Ravanbakhsh

Machine Learning

Neural and Evolutionary Comp...

Group Theory

Machine Learning

Group invariant and equivariant Multilayer Perceptrons (MLP), also known as Equivariant Networks, have achieved remarkable success in learning on a variety of data structures, such as sequences, images, sets, and graphs. Using tools from group theory, this paper proves the universality of a broad class of equivariant MLPs with a single hidden layer. In particular, it is shown that having a hidden layer on which the group acts regularly is sufficient for universal equivariance...

Find SimilarView on arXiv

Symmetry and Generalisation in Machine Learning

January 7, 2025

89% Match

Hayder Elesedy

Machine Learning

This work is about understanding the impact of invariance and equivariance on generalisation in supervised learning. We use the perspective afforded by an averaging operator to show that for any predictor that is not equivariant, there is an equivariant predictor with strictly lower test risk on all regression problems where the equivariance is correctly specified. This constitutes a rigorous proof that symmetry, in the form of invariance or equivariance, is a useful inductiv...

Find SimilarView on arXiv

A Characterization Theorem for Equivariant Networks with Point-wise Activations

January 17, 2024

89% Match

Marco Pacini, Xiaowen Dong, ... , Santin Gabriele

Machine Learning

Artificial Intelligence

Equivariant neural networks have shown improved performance, expressiveness and sample complexity on symmetrical domains. But for some specific symmetries, representations, and choice of coordinates, the most common point-wise activations, such as ReLU, are not equivariant, hence they cannot be employed in the design of equivariant neural networks. The theorem we present in this paper describes all possible combinations of finite-dimensional representations, choice of coordin...

Find SimilarView on arXiv

Universal approximations of permutation invariant/equivariant functions by deep neural networks

March 5, 2019

89% Match

Akiyoshi Sannai, Yuuki Takai, Matthieu Cordonnier

Machine Learning

In this paper, we develop a theory about the relationship between $G$-invariant/equivariant functions and deep neural networks for finite group $G$. Especially, for a given $G$-invariant/equivariant function, we construct its universal approximator by deep neural network whose layers equip $G$-actions and each affine transformations are $G$-equivariant/invariant. Due to representation theory, we can show that this approximator has exponentially fewer free parameters than usua...

Find SimilarView on arXiv

A New Neural Network Architecture Invariant to the Action of Symmetry Subgroups

December 11, 2020

89% Match

Piotr Kicki, Mete Ozay, Piotr Skrzypczyński

Machine Learning

Artificial Intelligence

We propose a computationally efficient $G$-invariant neural network that approximates functions invariant to the action of a given permutation subgroup $G \leq S_n$ of the symmetric group on input data. The key element of the proposed network architecture is a new $G$-invariant transformation module, which produces a $G$-invariant latent representation of the input data. Theoretical considerations are supported by numerical experiments, which demonstrate the effectiveness and...

Find SimilarView on arXiv

Learning with Group Invariant Features: A Kernel Perspective

June 8, 2015

89% Match

Youssef Mroueh, Stephen Voinea, Tomaso Poggio

Machine Learning

Computer Vision and Pattern ...

Machine Learning

We analyze in this paper a random feature map based on a theory of invariance I-theory introduced recently. More specifically, a group invariant signal signature is obtained through cumulative distributions of group transformed random projections. Our analysis bridges invariant feature learning with kernel methods, as we show that this feature map defines an expected Haar integration kernel that is invariant to the specified group action. We show how this non-linear random fe...

Find SimilarView on arXiv

Equivariant neural networks and piecewise linear representation theory

August 2, 2024

89% Match

Joel Gibson, Daniel Tubbenhauer, Geordie Williamson

Machine Learning

Group Theory

Representation Theory

Machine Learning

Equivariant neural networks are neural networks with symmetry. Motivated by the theory of group representations, we decompose the layers of an equivariant neural network into simple representations. The nonlinear activation functions lead to interesting nonlinear equivariant maps between simple representations. For example, the rectified linear unit (ReLU) gives rise to piecewise linear maps. We show that these considerations lead to a filtration of equivariant neural network...

Find SimilarView on arXiv

Scalars are universal: Equivariant machine learning, structured like classical physics

June 11, 2021

89% Match

Soledad JHU Villar, David W. Flatiron, NYU Hogg, Kate NYU Storey-Fisher, ... , Blum-Smith Ben NYU

Machine Learning

Mathematical Physics

Machine Learning

There has been enormous progress in the last few years in designing neural networks that respect the fundamental symmetries and coordinate freedoms of physical law. Some of these frameworks make use of irreducible representations, some make use of high-order tensor objects, and some apply symmetry-enforcing constraints. Different physical laws obey different combinations of fundamental symmetries, but a large fraction (possibly all) of classical physics is equivariant to tran...

Find SimilarView on arXiv