aXi: Techniques in optimizing neural net...

Quantized Neural Networks: Characterization and Holistic Optimization

May 31, 2020

78% Match

Yoonho Boo, Sungho Shin, Wonyong Sung

Machine Learning

Quantized deep neural networks (QDNNs) are necessary for low-power, high throughput, and embedded applications. Previous studies mostly focused on developing optimization methods for the quantization of given models. However, quantization sensitivity depends on the model architecture. Therefore, the model selection needs to be a part of the QDNN design process. Also, the characteristics of weight and activation quantization are quite different. This study proposes a holistic ...

Find SimilarView on arXiv

Neuroevolution of Neural Network Architectures Using CoDeepNEAT and Keras

February 11, 2020

78% Match

Jonas da Silveira Bohrer, Bruno Iochins Grisci, Marcio Dorn

Neural and Evolutionary Comp...

Computer Vision and Pattern ...

Machine Learning

Machine learning is a huge field of study in computer science and statistics dedicated to the execution of computational tasks through algorithms that do not require explicit instructions but instead rely on learning patterns from data samples to automate inferences. A large portion of the work involved in a machine learning project is to define the best type of algorithm to solve a given problem. Neural networks - especially deep neural networks - are the predominant type of...

Find SimilarView on arXiv

Controlling Model Complexity in Probabilistic Model-Based Dynamic Optimization of Neural Network Structures

July 15, 2019

78% Match

Shota Saito, Shinichi Shirakawa

Neural and Evolutionary Comp...

Machine Learning

A method of simultaneously optimizing both the structure of neural networks and the connection weights in a single training loop can reduce the enormous computational cost of neural architecture search. We focus on the probabilistic model-based dynamic neural network structure optimization that considers the probability distribution of structure parameters and simultaneously optimizes both the distribution parameters and connection weights based on gradient methods. Since the...

Find SimilarView on arXiv

Learning by Turning: Neural Architecture Aware Optimisation

February 14, 2021

78% Match

Yang Liu, Jeremy Bernstein, ... , Yue Yisong

Neural and Evolutionary Comp...

Machine Learning

Descent methods for deep networks are notoriously capricious: they require careful tuning of step size, momentum and weight decay, and which method will work best on a new benchmark is a priori unclear. To address this problem, this paper conducts a combined study of neural architecture and optimisation, leading to a new optimiser called Nero: the neuronal rotator. Nero trains reliably without momentum or weight decay, works in situations where Adam and SGD fail, and requires...

Find SimilarView on arXiv

An optimal scheduling architecture for accelerating batch algorithms on Neural Network processor architectures

February 14, 2020

78% Match

Phani Kumar Nyshadham, Mohit Sinha, ... , Vijay H S

Performance

Machine Learning

In neural network topologies, algorithms are running on batches of data tensors. The batches of data are typically scheduled onto the computing cores which execute in parallel. For the algorithms running on batches of data, an optimal batch scheduling architecture is very much needed by suitably utilizing hardware resources - thereby resulting in significant reduction training and inference time. In this paper, we propose to accelerate the batch algorithms for neural networks...

Find SimilarView on arXiv

Neural Networks with A La Carte Selection of Activation Functions

June 24, 2022

78% Match

Moshe Sipper

Neural and Evolutionary Comp...

Machine Learning

Activation functions (AFs), which are pivotal to the success (or failure) of a neural network, have received increased attention in recent years, with researchers seeking to design novel AFs that improve some aspect of network performance. In this paper we take another direction, wherein we combine a slew of known AFs into successful architectures, proposing three methods to do so beneficially: 1) generate AF architectures at random, 2) use Optuna, an automatic hyper-paramete...

Find SimilarView on arXiv

On the use of Artificial Neural Networks in Topology Optimisation

August 4, 2022

78% Match

Rebekka V. Woldseth, Niels Aage, ... , Sigmund Ole

Computational Engineering, F...

The question of how methods from the field of artificial intelligence can help improve the conventional frameworks for topology optimisation has received increasing attention over the last few years. Motivated by the capabilities of neural networks in image analysis, different model-variations aimed at obtaining iteration-free topology optimisation have been proposed with varying success. Other works focused on speed-up through replacing expensive optimisers and state solvers...

Find SimilarView on arXiv

Designing Network Design Strategies Through Gradient Path Analysis

November 9, 2022

78% Match

Chien-Yao Wang, Hong-Yuan Mark Liao, I-Hau Yeh

Computer Vision and Pattern ...

Designing a high-efficiency and high-quality expressive network architecture has always been the most important research topic in the field of deep learning. Most of today's network design strategies focus on how to integrate features extracted from different layers, and how to design computing units to effectively extract these features, thereby enhancing the expressiveness of the network. This paper proposes a new network design strategy, i.e., to design the network archite...

Find SimilarView on arXiv

Permutative redundancy and uncertainty of the objective in deep learning

November 11, 2024

78% Match

Vacslav Glukhov

Artificial Intelligence

Implications of uncertain objective functions and permutative symmetry of traditional deep learning architectures are discussed. It is shown that traditional architectures are polluted by an astronomical number of equivalent global and local optima. Uncertainty of the objective makes local optima unattainable, and, as the size of the network grows, the global optimization landscape likely becomes a tangled web of valleys and ridges. Some remedies which reduce or eliminate gho...

Find SimilarView on arXiv

Design Automation for Efficient Deep Learning Computing

April 24, 2019

78% Match

Song Han, Han Cai, Ligeng Zhu, Ji Lin, Kuan Wang, ... , Lin Yujun

Machine Learning

Efficient deep learning computing requires algorithm and hardware co-design to enable specialization: we usually need to change the algorithm to reduce memory footprint and improve energy efficiency. However, the extra degree of freedom from the algorithm makes the design space much larger: it's not only about designing the hardware but also about how to tweak the algorithm to best fit the hardware. Human engineers can hardly exhaust the design space by heuristics. It's labor...

Find SimilarView on arXiv