aXi: Techniques in optimizing neural net...

PLiNIO: A User-Friendly Library of Gradient-based Methods for Complexity-aware DNN Optimization

July 18, 2023

77% Match

Daniele Jahier Pagliari, Matteo Risso, ... , Burrello Alessio

Machine Learning

Accurate yet efficient Deep Neural Networks (DNNs) are in high demand, especially for applications that require their execution on constrained edge devices. Finding such DNNs in a reasonable time for new applications requires automated optimization pipelines since the huge space of hyper-parameter combinations is impossible to explore extensively by hand. In this work, we propose PLiNIO, an open-source library implementing a comprehensive set of state-of-the-art DNN design au...

Find SimilarView on arXiv

An Artificial Neural Network Functionalized by Evolution

May 16, 2022

77% Match

Fabien Furfaro, Avner Bar-Hen, Geoffroy Berthelot

Neural and Evolutionary Comp...

Artificial Intelligence

Machine Learning

The topology of artificial neural networks has a significant effect on their performance. Characterizing efficient topology is a field of promising research in Artificial Intelligence. However, it is not a trivial task and it is mainly experimented on through convolutional neural networks. We propose a hybrid model which combines the tensor calculus of feed-forward neural networks with Pseudo-Darwinian mechanisms. This allows for finding topologies that are well adapted for e...

Find SimilarView on arXiv

Computational complexity reduction of deep neural networks

July 29, 2022

77% Match

Mee Seong Im, Venkat R. Dasari

Machine Learning

Computational Complexity

Computer Vision and Pattern ...

Neural and Evolutionary Comp...

Optimization and Control

Deep neural networks (DNN) have been widely used and play a major role in the field of computer vision and autonomous navigation. However, these DNNs are computationally complex and their deployment over resource-constrained platforms is difficult without additional optimizations and customization. In this manuscript, we describe an overview of DNN architecture and propose methods to reduce computational complexity in order to accelerate training and inference speeds to fit...

Find SimilarView on arXiv

Principled Architecture-aware Scaling of Hyperparameters

February 27, 2024

77% Match

Wuyang Chen, Junru Wu, ... , Hanin Boris

Machine Learning

Training a high-quality deep neural network requires choosing suitable hyperparameters, which is a non-trivial and expensive process. Current works try to automatically optimize or design principles of hyperparameters, such that they can generalize to diverse unseen scenarios. However, most designs or optimization methods are agnostic to the choice of network structures, and thus largely ignore the impact of neural architectures on hyperparameters. In this work, we precisely ...

Find SimilarView on arXiv

Neural Architecture Codesign for Fast Physics Applications

January 9, 2025

77% Match

Jason Weitz, Dmitri Demler, Luke McDermott, ... , Duarte Javier

Machine Learning

Materials Science

Instrumentation and Detector...

We develop a pipeline to streamline neural architecture codesign for physics applications to reduce the need for ML expertise when designing models for novel tasks. Our method employs neural architecture search and network compression in a two-stage approach to discover hardware efficient models. This approach consists of a global search stage that explores a wide range of architectures while considering hardware constraints, followed by a local search stage that fine-tunes a...

Find SimilarView on arXiv

Vertex finding in neutrino-nucleus interaction: A Model Architecture Comparison

January 7, 2022

77% Match

F. The MINERvA Collaboration Akbar, A. The MINERvA Collaboration Ghosh, S. The MINERvA Collaboration Young, S. The MINERvA Collaboration Akhter, Z. Ahmad The MINERvA Collaboration Dar, V. The MINERvA Collaboration Ansari, M. V. The MINERvA Collaboration Ascencio, M. Sajjad The MINERvA Collaboration Athar, A. The MINERvA Collaboration Bodek, J. L. The MINERvA Collaboration Bonilla, A. The MINERvA Collaboration Bravar, H. The MINERvA Collaboration Budd, G. The MINERvA Collaboration Caceres, T. The MINERvA Collaboration Cai, M. F. The MINERvA Collaboration Carneiro, G. A. The MINERvA Collaboration Díaz, J. The MINERvA Collaboration Felix, L. The MINERvA Collaboration Fields, A. The MINERvA Collaboration Filkins, R. The MINERvA Collaboration Fine, P. K. The MINERvA Collaboration Gaura, R. The MINERvA Collaboration Gran, D. A. The MINERvA Collaboration Harris, D. The MINERvA Collaboration Jena, S. The MINERvA Collaboration Jena, J. The MINERvA Collaboration Kleykamp, A. The MINERvA Collaboration Klustová, D. The MINERvA Collaboration Last, A. The MINERvA Collaboration Lozano, X. G. The MINERvA Collaboration Lu, E. The MINERvA Collaboration Maher, S. The MINERvA Collaboration Manly, W. A. The MINERvA Collaboration Mann, K. S. The MINERvA Collaboration McFarland, B. The MINERvA Collaboration Messerly, J. The MINERvA Collaboration Miller, O. The MINERvA Collaboration Moreno, J. G. The MINERvA Collaboration Morfín, J. K. The MINERvA Collaboration Nelson, C. The MINERvA Collaboration Nguyen, A. The MINERvA Collaboration Olivier, V. The MINERvA Collaboration Paolone, G. N. The MINERvA Collaboration Perdue, K. J. The MINERvA Collaboration Plows, M. A. The MINERvA Collaboration Ramírez, D. The MINERvA Collaboration Ruterbories, H. The MINERvA Collaboration Su, V. S. The MINERvA Collaboration Syrotenko, A. V. The MINERvA Collaboration Waldron, ... , Zazueta L. The MINERvA Collaboration

Data Analysis, Statistics an...

Instrumentation and Detector...

We compare different neural network architectures for Machine Learning (ML) algorithms designed to identify the neutrino interaction vertex position in the MINERvA detector. The architectures developed and optimized by hand are compared with the architectures developed in an automated way using the package "Multi-node Evolutionary Neural Networks for Deep Learning" (MENNDL), developed at Oak Ridge National Laboratory (ORNL). The two architectures resulted in a similar perform...

Find SimilarView on arXiv

Automatic Gradient Descent: Deep Learning without Hyperparameters

April 11, 2023

77% Match

Jeremy Bernstein, Chris Mingard, Kevin Huang, ... , Yue Yisong

cs.LG

cs.AI

cs.NA

cs.NE

math.NA

stat.ML

The architecture of a deep neural network is defined explicitly in terms of the number of layers, the width of each layer and the general network topology. Existing optimisation frameworks neglect this information in favour of implicit architectural information (e.g. second-order methods) or architecture-agnostic distance functions (e.g. mirror descent). Meanwhile, the most popular optimiser in practice, Adam, is based on heuristics. This paper builds a new framework for deri...

Find SimilarView on arXiv

Growing Tiny Networks: Spotting Expressivity Bottlenecks and Fixing Them Optimally

May 30, 2024

77% Match

Manon TAU, LISN Verbockhaven, Sylvain TAU, LISN Chevallier, Guillaume TAU, LISN Charpiat

Artificial Intelligence

Machine learning tasks are generally formulated as optimization problems, where one searches for an optimal function within a certain functional space. In practice, parameterized functional spaces are considered, in order to be able to perform gradient descent. Typically, a neural network architecture is chosen and fixed, and its parameters (connection weights) are optimized, yielding an architecture-dependent result. This way of proceeding however forces the evolution of the...

Find SimilarView on arXiv

A Hardware-Aware System for Accelerating Deep Neural Network Optimization

February 25, 2022

77% Match

Anthony Sarah, Daniel Cummings, Sharath Nittur Sridhar, Sairam Sundaresan, Maciej Szankin, ... , Munoz J. Pablo

Artificial Intelligence

Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely under-explored. For example, some recent network morphism techniques allow a su...

Find SimilarView on arXiv

Towards Optimal VPU Compiler Cost Modeling by using Neural Networks to Infer Hardware Performances

May 9, 2022

77% Match

Ian Frederick Vigogne Goodbody Hunter, Alessandro Palla, Sebastian Eusebiu Nagy, ... , McAdoo Kyle

Machine Learning

Artificial Intelligence

Neural and Evolutionary Comp...

Calculating the most efficient schedule of work in a neural network compiler is a difficult task. There are many parameters to be accounted for that can positively or adversely affect that schedule depending on their configuration - How work is shared between distributed targets, the subdivision of tensors to fit in memory, toggling the enablement of optimizations, etc. Traditionally, neural network compilers determine how to set these values by building a graph of choices an...

Find SimilarView on arXiv