aXi: Techniques in optimizing neural net...

A scalable constructive algorithm for the optimization of neural network architectures

September 7, 2019

82% Match

Massimiliano Lupo Pasini, Junqi Yin, ... , Eisenbach Markus

Machine Learning

Neural and Evolutionary Comp...

Machine Learning

We propose a new scalable method to optimize the architecture of an artificial neural network. The proposed algorithm, called Greedy Search for Neural Network Architecture, aims to determine a neural network with minimal number of layers that is at least as performant as neural networks of the same structure identified by other hyperparameter search algorithms in terms of accuracy and computational cost. Numerical results performed on benchmark datasets show that, for these d...

Find SimilarView on arXiv

Automated Architecture Design for Deep Neural Networks

August 22, 2019

82% Match

Steven Abreu

Machine Learning

Neural and Evolutionary Comp...

Machine Learning

Machine learning has made tremendous progress in recent years and received large amounts of public attention. Though we are still far from designing a full artificially intelligent agent, machine learning has brought us many applications in which computers solve human learning tasks remarkably well. Much of this progress comes from a recent trend within machine learning, called deep learning. Deep learning models are responsible for many state-of-the-art applications of machi...

Find SimilarView on arXiv

Lecture Notes: Neural Network Architectures

April 11, 2023

81% Match

Evelyn Herberg

Machine Learning

Optimization and Control

These lecture notes provide an overview of Neural Network architectures from a mathematical point of view. Especially, Machine Learning with Neural Networks is seen as an optimization problem. Covered are an introduction to Neural Networks and the following architectures: Feedforward Neural Network, Convolutional Neural Network, ResNet, and Recurrent Neural Network.

Find SimilarView on arXiv

Structure of Artificial Neural Networks -- Empirical Investigations

October 12, 2024

81% Match

Julian Stier

Machine Learning

Artificial Intelligence

Neural and Evolutionary Comp...

Within one decade, Deep Learning overtook the dominating solution methods of countless problems of artificial intelligence. ``Deep'' refers to the deep architectures with operations in manifolds of which there are no immediate observations. For these deep architectures some kind of structure is pre-defined -- but what is this structure? With a formal definition for structures of neural networks, neural architecture search problems and solution methods can be formulated under ...

Find SimilarView on arXiv

Deep Algorithms: designs for networks

June 6, 2018

81% Match

Abhejit Rajagopal, Shivkumar Chandrasekaran, Hrushikesh N. Mhaskar

Machine Learning

Computer Vision and Pattern ...

Neural and Evolutionary Comp...

Machine Learning

A new design methodology for neural networks that is guided by traditional algorithm design is presented. To prove our point, we present two heuristics and demonstrate an algorithmic technique for incorporating additional weights in their signal-flow graphs. We show that with training the performance of these networks can not only exceed the performance of the initial network, but can match the performance of more-traditional neural network architectures. A key feature of our...

Find SimilarView on arXiv

How to Train your DNN: The Network Operator Edition

April 21, 2020

80% Match

Michael Alan Chang, Domenic Bottini, Lisa Jian, Pranay Kumar, ... , Shenker Scott

Networking and Internet Arch...

Distributed, Parallel, and C...

Machine Learning

Deep Neural Nets have hit quite a crest, But physical networks are where they must rest, And here we put them all to the test, To see which network optimization is best.

Find SimilarView on arXiv

Computational issues in Optimization for Deep networks

May 3, 2024

80% Match

Corrado Coppola, Lorenzo Papa, Marco Boresta, ... , Palagi Laura

Optimization and Control

The paper aims to investigate relevant computational issues of deep neural network architectures with an eye to the interaction between the optimization algorithm and the classification performance. In particular, we aim to analyze the behaviour of state-of-the-art optimization algorithms in relationship to their hyperparameters setting in order to detect robustness with respect to the choice of a certain starting point in ending on different local solutions. We conduct exten...

Find SimilarView on arXiv

Neural Architecture Search: A Survey

August 16, 2018

80% Match

Thomas Elsken, Jan Hendrik Metzen, Frank Hutter

Machine Learning

Neural and Evolutionary Comp...

Deep Learning has enabled remarkable progress over the last years on a variety of tasks, such as image recognition, speech recognition, and machine translation. One crucial aspect for this progress are novel neural architectures. Currently employed architectures have mostly been developed manually by human experts, which is a time-consuming and error-prone process. Because of this, there is growing interest in automated neural architecture search methods. We provide an overvi...

Find SimilarView on arXiv

Bag of Tricks for Neural Architecture Search

July 8, 2021

80% Match

Thomas Elsken, Benedikt Staffler, Arber Zela, ... , Hutter Frank

Machine Learning

Artificial Intelligence

Machine Learning

While neural architecture search methods have been successful in previous years and led to new state-of-the-art performance on various problems, they have also been criticized for being unstable, being highly sensitive with respect to their hyperparameters, and often not performing better than random search. To shed some light on this issue, we discuss some practical considerations that help improve the stability, efficiency and overall performance.

Find SimilarView on arXiv

Optimizing Neural Networks through Activation Function Discovery and Automatic Weight Initialization

April 6, 2023

80% Match

Garrett Bingham

Machine Learning

Automated machine learning (AutoML) methods improve upon existing models by optimizing various aspects of their design. While present methods focus on hyperparameters and neural network topologies, other aspects of neural network design can be optimized as well. To further the state of the art in AutoML, this dissertation introduces techniques for discovering more powerful activation functions and establishing more robust weight initialization for neural networks. These contr...

Find SimilarView on arXiv