Grokking Group Multiplication with Coset...

A PAC-Bayesian Generalization Bound for Equivariant Networks

October 24, 2022

85% Match

Arash Behboodi, Gabriele Cesa, Taco Cohen

Machine Learning

Equivariant networks capture the inductive bias about the symmetry of the learning task by building those symmetries into the model. In this paper, we study how equivariance relates to generalization error utilizing PAC Bayesian analysis for equivariant networks, where the transformation laws of feature spaces are determined by group representations. By using perturbation analysis of equivariant networks in Fourier domain for each layer, we derive norm-based PAC-Bayesian gene...

Find SimilarView on arXiv

Accelerated Discovery of Machine-Learned Symmetries: Deriving the Exceptional Lie Groups G2, F4 and E6

July 10, 2023

85% Match

Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, ... , Verner Sarunas

Machine Learning

Group Theory

Mathematical Physics

Recent work has applied supervised deep learning to derive continuous symmetry transformations that preserve the data labels and to obtain the corresponding algebras of symmetry generators. This letter introduces two improved algorithms that significantly speed up the discovery of these symmetry transformations. The new methods are demonstrated by deriving the complete set of generators for the unitary groups U(n) and the exceptional Lie groups $G_2$, $F_4$, and $E_6$. A thir...

Find SimilarView on arXiv

Symmetry constrained machine learning

November 16, 2018

85% Match

Doron L. Bergman

Machine Learning

Data Analysis, Statistics an...

Symmetry, a central concept in understanding the laws of nature, has been used for centuries in physics, mathematics, and chemistry, to help make mathematical models tractable. Yet, despite its power, symmetry has not been used extensively in machine learning, until rather recently. In this article we show a general way to incorporate symmetries into machine learning models. We demonstrate this with a detailed analysis on a rather simple real world machine learning system - a...

Find SimilarView on arXiv

Identifying the Group-Theoretic Structure of Machine-Learned Symmetries

September 14, 2023

85% Match

Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, ... , Verner Sarunas

Machine Learning

Group Theory

Mathematical Physics

Deep learning was recently successfully used in deriving symmetry transformations that preserve important physics quantities. Being completely agnostic, these techniques postpone the identification of the discovered symmetries to a later stage. In this letter we propose methods for examining and identifying the group-theoretic structure of such machine-learned symmetries. We design loss functions which probe the subalgebra structure either during the deep learning stage of sy...

Find SimilarView on arXiv

A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups

April 19, 2021

85% Match

Marc Finzi, Max Welling, Andrew Gordon Wilson

Machine Learning

Dynamical Systems

Machine Learning

Symmetries and equivariance are fundamental to the generalization of neural networks on domains such as images, graphs, and point clouds. Existing work has primarily focused on a small number of groups, such as the translation, rotation, and permutation groups. In this work we provide a completely general algorithm for solving for the equivariant layers of matrix groups. In addition to recovering solutions from other works as special cases, we construct multilayer perceptrons...

Find SimilarView on arXiv

A Classification of $G$-invariant Shallow Neural Networks

May 18, 2022

85% Match

Devanshu Agrawal, James Ostrowski

Machine Learning

When trying to fit a deep neural network (DNN) to a $G$-invariant target function with $G$ a group, it only makes sense to constrain the DNN to be $G$-invariant as well. However, there can be many different ways to do this, thus raising the problem of ``$G$-invariant neural architecture design'': What is the optimal $G$-invariant architecture for a given problem? Before we can consider the optimization problem itself, we must understand the search space, the architectures in ...

Find SimilarView on arXiv

On the hardness of learning under symmetries

January 3, 2024

85% Match

Bobak T. Kiani, Thien Le, Hannah Lawrence, ... , Weber Melanie

Machine Learning

Data Structures and Algorith...

Statistics Theory

Machine Learning

Statistics Theory

We study the problem of learning equivariant neural networks via gradient descent. The incorporation of known symmetries ("equivariance") into neural nets has empirically improved the performance of learning pipelines, in domains ranging from biology to computer vision. However, a rich yet separate line of learning theoretic research has demonstrated that actually learning shallow, fully-connected (i.e. non-symmetric) networks has exponential complexity in the correlational s...

Find SimilarView on arXiv

Progress measures for grokking via mechanistic interpretability

January 12, 2023

85% Match

Neel Nanda, Lawrence Chan, Tom Lieberum, ... , Steinhardt Jacob

Machine Learning

Artificial Intelligence

Neural networks often exhibit emergent behavior, where qualitatively new capabilities arise from scaling up the amount of parameters, training data, or training steps. One approach to understanding emergence is to find continuous \textit{progress measures} that underlie the seemingly discontinuous qualitative changes. We argue that progress measures can be found via mechanistic interpretability: reverse-engineering learned behaviors into their individual components. As a case...

Find SimilarView on arXiv

Universal Equivariant Multilayer Perceptrons

February 7, 2020

85% Match

Siamak Ravanbakhsh

Machine Learning

Neural and Evolutionary Comp...

Group Theory

Machine Learning

Group invariant and equivariant Multilayer Perceptrons (MLP), also known as Equivariant Networks, have achieved remarkable success in learning on a variety of data structures, such as sequences, images, sets, and graphs. Using tools from group theory, this paper proves the universality of a broad class of equivariant MLPs with a single hidden layer. In particular, it is shown that having a hidden layer on which the group acts regularly is sufficient for universal equivariance...

Find SimilarView on arXiv

Deep Learning Symmetries and Their Lie Groups, Algebras, and Subalgebras from First Principles

January 13, 2023

85% Match

Roy T. Forestano, Konstantin T. Matchev, Katia Matcheva, Alexander Roman, ... , Verner Sarunas

Machine Learning

Data Analysis, Statistics an...

We design a deep-learning algorithm for the discovery and identification of the continuous group of symmetries present in a labeled dataset. We use fully connected neural networks to model the symmetry transformations and the corresponding generators. We construct loss functions that ensure that the applied transformations are symmetries and that the corresponding set of generators forms a closed (sub)algebra. Our procedure is validated with several examples illustrating diff...

Find SimilarView on arXiv

Grokking Group Multiplication with Cosets

A PAC-Bayesian Generalization Bound for Equivariant Networks

Accelerated Discovery of Machine-Learned Symmetries: Deriving the Exceptional Lie Groups G2, F4 and E6

Symmetry constrained machine learning

Identifying the Group-Theoretic Structure of Machine-Learned Symmetries

A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups

A Classification of $G$-invariant Shallow Neural Networks

On the hardness of learning under symmetries

Progress measures for grokking via mechanistic interpretability

Universal Equivariant Multilayer Perceptrons

Deep Learning Symmetries and Their Lie Groups, Algebras, and Subalgebras from First Principles