aXi: Techniques in optimizing neural net...

Lecture Notes: Neural Network Architectures

April 11, 2023

81% Match

Evelyn Herberg

Machine Learning

Optimization and Control

These lecture notes provide an overview of Neural Network architectures from a mathematical point of view. Especially, Machine Learning with Neural Networks is seen as an optimization problem. Covered are an introduction to Neural Networks and the following architectures: Feedforward Neural Network, Convolutional Neural Network, ResNet, and Recurrent Neural Network.

Find SimilarView on arXiv

Computational issues in Optimization for Deep networks

May 3, 2024

80% Match

Corrado Coppola, Lorenzo Papa, Marco Boresta, ... , Palagi Laura

Optimization and Control

The paper aims to investigate relevant computational issues of deep neural network architectures with an eye to the interaction between the optimization algorithm and the classification performance. In particular, we aim to analyze the behaviour of state-of-the-art optimization algorithms in relationship to their hyperparameters setting in order to detect robustness with respect to the choice of a certain starting point in ending on different local solutions. We conduct exten...

Find SimilarView on arXiv

Optimizing Neural Networks through Activation Function Discovery and Automatic Weight Initialization

April 6, 2023

80% Match

Garrett Bingham

Machine Learning

Automated machine learning (AutoML) methods improve upon existing models by optimizing various aspects of their design. While present methods focus on hyperparameters and neural network topologies, other aspects of neural network design can be optimized as well. To further the state of the art in AutoML, this dissertation introduces techniques for discovering more powerful activation functions and establishing more robust weight initialization for neural networks. These contr...

Find SimilarView on arXiv

Optimizing Dense Feed-Forward Neural Networks

December 16, 2023

80% Match

Luis Balderas, Miguel Lastra, José M. Benítez

Machine Learning

Artificial Intelligence

Deep learning models have been widely used during the last decade due to their outstanding learning and abstraction capacities. However, one of the main challenges any scientist has to face using deep learning models is to establish the network's architecture. Due to this difficulty, data scientists usually build over complex models and, as a result, most of them result computationally intensive and impose a large memory footprint, generating huge costs, contributing to clima...

Find SimilarView on arXiv

Additive regularization schedule for neural architecture search

June 18, 2024

79% Match

Mark Potanin, Kirill Vayser, Vadim Strijov

Machine Learning

Neural network structures have a critical impact on the accuracy and stability of forecasting. Neural architecture search procedures help design an optimal neural network according to some loss function, which represents a set of quality criteria. This paper investigates the problem of neural network structure optimization. It proposes a way to construct a loss function, which contains a set of additive elements. Each element is called the regularizer. It corresponds to some ...

Find SimilarView on arXiv

Braid-based architecture search

July 8, 2022

79% Match

Olga Lukyanova, Oleg Nikitin, Alex Kunin

Machine Learning

Artificial Intelligence

Neural and Evolutionary Comp...

Geometric Topology

In this article, we propose the approach to structural optimization of neural networks, based on the braid theory. The paper describes the basics of braid theory as applied to the description of graph structures of neural networks. It is shown how networks of various topologies can be built using braid structures between layers of neural networks. The operation of a neural network based on the braid theory is compared with a homogeneous deep neural network and a network with ...

Find SimilarView on arXiv

A Local Optima Network Analysis of the Feedforward Neural Architecture Space

June 2, 2022

79% Match

Isak Potgieter, Christopher W. Cleghorn, Anna S. Bosman

Neural and Evolutionary Comp...

Artificial Intelligence

Machine Learning

This study investigates the use of local optima network (LON) analysis, a derivative of the fitness landscape of candidate solutions, to characterise and visualise the neural architecture space. The search space of feedforward neural network architectures with up to three layers, each with up to 10 neurons, is fully enumerated by evaluating trained model performance on a selection of data sets. Extracted LONs, while heterogeneous across data sets, all exhibit simple global st...

Find SimilarView on arXiv

An automatic selection of optimal recurrent neural network architecture for processes dynamics modelling purposes

September 25, 2023

79% Match

Krzysztof Laddach, Rafał Łangowski, ... , Puchalski Bartosz

Neural and Evolutionary Comp...

Artificial Intelligence

A problem related to the development of algorithms designed to find the structure of artificial neural network used for behavioural (black-box) modelling of selected dynamic processes has been addressed in this paper. The research has included four original proposals of algorithms dedicated to neural network architecture search. Algorithms have been based on well-known optimisation techniques such as evolutionary algorithms and gradient descent methods. In the presented resea...

Find SimilarView on arXiv

Visual Analysis of Neural Architecture Spaces for Summarizing Design Principles

August 20, 2022

78% Match

Jun Yuan, Mengchen Liu, ... , Liu Shixia

Human-Computer Interaction

Machine Learning

Recent advances in artificial intelligence largely benefit from better neural network architectures. These architectures are a product of a costly process of trial-and-error. To ease this process, we develop ArchExplorer, a visual analysis method for understanding a neural architecture space and summarizing design principles. The key idea behind our method is to make the architecture space explainable by exploiting structural distances between architectures. We formulate the ...

Find SimilarView on arXiv

NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models

March 4, 2024

78% Match

Mengfei Ji, Yuchun Chang, ... , Al-Ars Zaid

Machine Learning

Artificial Intelligence

Computer Vision and Pattern ...

As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency but also superior accuracy performance. We present four versions of the NASH stra...

Find SimilarView on arXiv