aXi: Techniques in optimizing neural net...

Network Graph Based Neural Architecture Search

December 15, 2021

80% Match

Zhenhan Huang, Chunheng Jiang, ... , Gao Jianxi

Machine Learning

Neural architecture search enables automation of architecture design. Despite its success, it is computationally costly and does not provide an insight on how to design a desirable architecture. Here we propose a new way of searching neural network where we search neural architecture by rewiring the corresponding graph and predict the architecture performance by graph properties. Because we do not perform machine learning over the entire graph space and use predicted architec...

Find SimilarView on arXiv

A Survey on Neural Architecture Search

May 4, 2019

80% Match

Martin Wistuba, Ambrish Rawat, Tejaswini Pedapati

Machine Learning

Computer Vision and Pattern ...

Neural and Evolutionary Comp...

Machine Learning

The growing interest in both the automation of machine learning and deep learning has inevitably led to the development of a wide variety of automated methods for neural architecture search. The choice of the network architecture has proven to be critical, and many advances in deep learning spring from its immediate improvements. However, deep learning techniques are computationally intensive and their application requires a high level of domain knowledge. Therefore, even par...

Find SimilarView on arXiv

Practical recommendations for gradient-based training of deep architectures

June 24, 2012

80% Match

Yoshua Bengio

Machine Learning

Learning algorithms related to artificial neural networks and in particular for Deep Learning may seem to involve many bells and whistles, called hyper-parameters. This chapter is meant as a practical guide with recommendations for some of the most commonly used hyper-parameters, in particular in the context of learning algorithms based on back-propagated gradient and gradient-based optimization. It also discusses how to deal with the fact that more interesting results can be...

Find SimilarView on arXiv

DLOPT: Deep Learning Optimization Library

July 10, 2018

80% Match

Andrés Camero, Jamal Toutouh, Enrique Alba

Machine Learning

Neural and Evolutionary Comp...

Machine Learning

Deep learning hyper-parameter optimization is a tough task. Finding an appropriate network configuration is a key to success, however most of the times this labor is roughly done. In this work we introduce a novel library to tackle this problem, the Deep Learning Optimization Library: DLOPT. We briefly describe its architecture and present a set of use examples. This is an open source project developed under the GNU GPL v3 license and it is freely available at https://github....

Find SimilarView on arXiv

Optimizing Dense Feed-Forward Neural Networks

December 16, 2023

79% Match

Luis Balderas, Miguel Lastra, José M. Benítez

Machine Learning

Artificial Intelligence

Deep learning models have been widely used during the last decade due to their outstanding learning and abstraction capacities. However, one of the main challenges any scientist has to face using deep learning models is to establish the network's architecture. Due to this difficulty, data scientists usually build over complex models and, as a result, most of them result computationally intensive and impose a large memory footprint, generating huge costs, contributing to clima...

Find SimilarView on arXiv

Additive regularization schedule for neural architecture search

June 18, 2024

79% Match

Mark Potanin, Kirill Vayser, Vadim Strijov

Machine Learning

Neural network structures have a critical impact on the accuracy and stability of forecasting. Neural architecture search procedures help design an optimal neural network according to some loss function, which represents a set of quality criteria. This paper investigates the problem of neural network structure optimization. It proposes a way to construct a loss function, which contains a set of additive elements. Each element is called the regularizer. It corresponds to some ...

Find SimilarView on arXiv

Mining the Weights Knowledge for Optimizing Neural Network Structures

October 11, 2021

79% Match

Mengqiao Han, Xiabi Liu, ... , Duan Xin

Neural and Evolutionary Comp...

Machine Learning

Knowledge embedded in the weights of the artificial neural network can be used to improve the network structure, such as in network compression. However, the knowledge is set up by hand, which may not be very accurate, and relevant information may be overlooked. Inspired by how learning works in the mammalian brain, we mine the knowledge contained in the weights of the neural network toward automatic architecture learning in this paper. We introduce a switcher neural network ...

Find SimilarView on arXiv

Braid-based architecture search

July 8, 2022

79% Match

Olga Lukyanova, Oleg Nikitin, Alex Kunin

Machine Learning

Artificial Intelligence

Neural and Evolutionary Comp...

Geometric Topology

In this article, we propose the approach to structural optimization of neural networks, based on the braid theory. The paper describes the basics of braid theory as applied to the description of graph structures of neural networks. It is shown how networks of various topologies can be built using braid structures between layers of neural networks. The operation of a neural network based on the braid theory is compared with a homogeneous deep neural network and a network with ...

Find SimilarView on arXiv

Neural Networks Designing Neural Networks: Multi-Objective Hyper-Parameter Optimization

November 7, 2016

79% Match

Sean C. Smithson, Guang Yang, ... , Meyer Brett H.

Neural and Evolutionary Comp...

Machine Learning

Artificial neural networks have gone through a recent rise in popularity, achieving state-of-the-art results in various fields, including image classification, speech recognition, and automated control. Both the performance and computational complexity of such models are heavily dependant on the design of characteristic hyper-parameters (e.g., number of hidden layers, nodes per layer, or choice of activation functions), which have traditionally been optimized manually. With m...

Find SimilarView on arXiv

Combining Neural Architecture Search and Automatic Code Optimization: A Survey

August 7, 2024

79% Match

Inas Bachiri, Hadjer Benmeziane, Smail Niar, Riyadh Baghdadi, ... , Aries Abdelkrime

Machine Learning

Programming Languages

Deep Learning models have experienced exponential growth in complexity and resource demands in recent years. Accelerating these models for efficient execution on resource-constrained devices has become more crucial than ever. Two notable techniques employed to achieve this goal are Hardware-aware Neural Architecture Search (HW-NAS) and Automatic Code Optimization (ACO). HW-NAS automatically designs accurate yet hardware-friendly neural networks, while ACO involves searching f...

Find SimilarView on arXiv