aXi: Techniques in optimizing neural net...

A Survey on Neural Architecture Search Based on Reinforcement Learning

September 26, 2024

79% Match

Wenzhu Shao

Machine Learning

Artificial Intelligence

The automation of feature extraction of machine learning has been successfully realized by the explosive development of deep learning. However, the structures and hyperparameters of deep neural network architectures also make huge difference on the performance in different tasks. The process of exploring optimal structures and hyperparameters often involves a lot of tedious human intervene. As a result, a legitimate question is to ask for the automation of searching for optim...

Find SimilarView on arXiv

A Local Optima Network Analysis of the Feedforward Neural Architecture Space

June 2, 2022

79% Match

Isak Potgieter, Christopher W. Cleghorn, Anna S. Bosman

Neural and Evolutionary Comp...

Artificial Intelligence

Machine Learning

This study investigates the use of local optima network (LON) analysis, a derivative of the fitness landscape of candidate solutions, to characterise and visualise the neural architecture space. The search space of feedforward neural network architectures with up to three layers, each with up to 10 neurons, is fully enumerated by evaluating trained model performance on a selection of data sets. Extracted LONs, while heterogeneous across data sets, all exhibit simple global st...

Find SimilarView on arXiv

An automatic selection of optimal recurrent neural network architecture for processes dynamics modelling purposes

September 25, 2023

79% Match

Krzysztof Laddach, Rafał Łangowski, ... , Puchalski Bartosz

Neural and Evolutionary Comp...

Artificial Intelligence

A problem related to the development of algorithms designed to find the structure of artificial neural network used for behavioural (black-box) modelling of selected dynamic processes has been addressed in this paper. The research has included four original proposals of algorithms dedicated to neural network architecture search. Algorithms have been based on well-known optimisation techniques such as evolutionary algorithms and gradient descent methods. In the presented resea...

Find SimilarView on arXiv

A novel Taguchi-based approach for optimizing neural network architectures: application to elastic short fiber composites

July 29, 2024

79% Match

Mohammad Hossein Nikzad, Mohammad Heidari-Rarani, Mohsen Mirkhalaf

Computational Engineering, F...

This study presents an innovative application of the Taguchi design of experiment method to optimize the structure of an Artificial Neural Network (ANN) model for the prediction of elastic properties of short fiber reinforced composites. The main goal is to minimize the required computational effort for hyperparameter optimization while enhancing the prediction accuracy. Utilizing a robust design of experiment framework, the structure of an ANN model is optimized. This essent...

Find SimilarView on arXiv

Optimizing Deep Neural Network Architecture: A Tabu Search Based Approach

August 17, 2018

79% Match

Tarun Kumar Gupta, Khalid Raza

Machine Learning

Artificial Intelligence

Neural and Evolutionary Comp...

Machine Learning

The performance of Feedforward neural network (FNN) fully de-pends upon the selection of architecture and training algorithm. FNN architecture can be tweaked using several parameters, such as the number of hidden layers, number of hidden neurons at each hidden layer and number of connections between layers. There may be exponential combinations for these architectural attributes which may be unmanageable manually, so it requires an algorithm which can automatically design an ...

Find SimilarView on arXiv

A Construction Kit for Efficient Low Power Neural Network Accelerator Designs

June 24, 2021

78% Match

Petar Jokic, Erfan Azarkhish, Andrea Bonetti, Marc Pons, ... , Benini Luca

Hardware Architecture

Machine Learning

Implementing embedded neural network processing at the edge requires efficient hardware acceleration that couples high computational performance with low power consumption. Driven by the rapid evolution of network architectures and their algorithmic features, accelerator designs are constantly updated and improved. To evaluate and compare hardware design choices, designers can refer to a myriad of accelerator implementations in the literature. Surveys provide an overview of t...

Find SimilarView on arXiv

Dynamic Neural Network Architectural and Topological Adaptation and Related Methods -- A Survey

July 28, 2021

78% Match

Lorenz Kummer

Machine Learning

Artificial Intelligence

Training and inference in deep neural networks (DNNs) has, due to a steady increase in architectural complexity and data set size, lead to the development of strategies for reducing time and space requirements of DNN training and inference, which is of particular importance in scenarios where training takes place in resource constrained computation environments or inference is part of a time critical application. In this survey, we aim to provide a general overview and catego...

Find SimilarView on arXiv

A Heuristic for Efficient Reduction in Hidden Layer Combinations For Feedforward Neural Networks

September 25, 2019

78% Match

Wei Hao Khoong

Machine Learning

In this paper, we describe the hyper-parameter search problem in the field of machine learning and present a heuristic approach in an attempt to tackle it. In most learning algorithms, a set of hyper-parameters must be determined before training commences. The choice of hyper-parameters can affect the final model's performance significantly, but yet determining a good choice of hyper-parameters is in most cases complex and consumes large amount of computing resources. In this...

Find SimilarView on arXiv

How Could Polyhedral Theory Harness Deep Learning?

June 17, 2018

78% Match

Thiago Serra, Christian Tjandraatmadja, Srikumar Ramalingam

Optimization and Control

Machine Learning

The holy grail of deep learning is to come up with an automatic method to design optimal architectures for different applications. In other words, how can we effectively dimension and organize neurons along the network layers based on the computational resources, input size, and amount of training data? We outline promising research directions based on polyhedral theory and mixed-integer representability that may offer an analytical approach to this question, in contrast to t...

Find SimilarView on arXiv

Visual Analysis of Neural Architecture Spaces for Summarizing Design Principles

August 20, 2022

78% Match

Jun Yuan, Mengchen Liu, ... , Liu Shixia

Human-Computer Interaction

Machine Learning

Recent advances in artificial intelligence largely benefit from better neural network architectures. These architectures are a product of a costly process of trial-and-error. To ease this process, we develop ArchExplorer, a visual analysis method for understanding a neural architecture space and summarizing design principles. The key idea behind our method is to make the architecture space explainable by exploiting structural distances between architectures. We formulate the ...

Find SimilarView on arXiv