aXi: Techniques in optimizing neural net...

Symmetry Breaking in Neural Network Optimization: Insights from Input Dimension Expansion

September 10, 2024

77% Match

Jun-Jie Zhang, Nan Cheng, Fu-Peng Li, Xiu-Cheng Wang, Jian-Nan Chen, ... , Meng Deyu

Machine Learning

Artificial Intelligence

Mathematical Physics

Understanding the mechanisms behind neural network optimization is crucial for improving network design and performance. While various optimization techniques have been developed, a comprehensive understanding of the underlying principles that govern these techniques remains elusive. Specifically, the role of symmetry breaking, a fundamental concept in physics, has not been fully explored in neural network optimization. This gap in knowledge limits our ability to design netwo...

Find SimilarView on arXiv

Training binary neural networks without floating point precision

October 19, 2023

77% Match

Federico Fontana

Machine Learning

Artificial Intelligence

Neural and Evolutionary Comp...

The main goal of this work is to improve the efficiency of training binary neural networks, which are low latency and low energy networks. The main contribution of this work is the proposal of two solutions comprised of topology changes and strategy training that allow the network to achieve near the state-of-the-art performance and efficient training. The time required for training and the memory required in the process are two factors that contribute to efficient training.

Find SimilarView on arXiv

DQNAS: Neural Architecture Search using Reinforcement Learning

January 17, 2023

77% Match

Anshumaan Chauhan, Siddhartha Bhattacharyya, S. Vadivel

Machine Learning

Artificial Intelligence

Convolutional Neural Networks have been used in a variety of image related applications after their rise in popularity due to ImageNet competition. Convolutional Neural Networks have shown remarkable results in applications including face recognition, moving target detection and tracking, classification of food based on the calorie content and many more. Designing of Convolutional Neural Networks requires experts having a cross domain knowledge and it is laborious, which requ...

Find SimilarView on arXiv

Neural Architecture Search: Insights from 1000 Papers

January 20, 2023

77% Match

Colin White, Mahmoud Safari, Rhea Sukthanker, Binxin Ru, Thomas Elsken, Arber Zela, ... , Hutter Frank

Machine Learning

Artificial Intelligence

Machine Learning

In the past decade, advances in deep learning have resulted in breakthroughs in a variety of areas, including computer vision, natural language understanding, speech recognition, and reinforcement learning. Specialized, high-performing neural architectures are crucial to the success of deep learning in these areas. Neural architecture search (NAS), the process of automating the design of neural architectures for a given task, is an inevitable next step in automating machine l...

Find SimilarView on arXiv

An Approach for Efficient Neural Architecture Search Space Definition

October 25, 2023

77% Match

Léo ESTACA'Lab Pouy, Fouad ESTACA'Lab Khenfri, Patrick ESTACA'Lab Leserf, ... , Larouci Cherif ESTACA'Lab

Machine Learning

Neural and Evolutionary Comp...

As we advance in the fast-growing era of Machine Learning, various new and more complex neural architectures are arising to tackle problem more efficiently. On the one hand their efficient usage requires advanced knowledge and expertise, which is most of the time difficult to find on the labor market. On the other hand, searching for an optimized neural architecture is a time-consuming task when it is performed manually using a trial and error approach. Hence, a method and a ...

Find SimilarView on arXiv

Topological derivative approach for deep neural network architecture adaptation

February 8, 2025

77% Match

C G Krishnanunni, Tan Bui-Thanh, Clint Dawson

Machine Learning

Artificial Intelligence

This work presents a novel algorithm for progressively adapting neural network architecture along the depth. In particular, we attempt to address the following questions in a mathematically principled way: i) Where to add a new capacity (layer) during the training process? ii) How to initialize the new capacity? At the heart of our approach are two key ingredients: i) the introduction of a ``shape functional" to be minimized, which depends on neural network topology, and ii) ...

Find SimilarView on arXiv

POPNASv2: An Efficient Multi-Objective Neural Architecture Search Technique

October 6, 2022

77% Match

Andrea Falanti, Eugenio Lomurno, Stefano Samele, ... , Matteucci Matteo

Machine Learning

Artificial Intelligence

Automating the research for the best neural network model is a task that has gained more and more relevance in the last few years. In this context, Neural Architecture Search (NAS) represents the most effective technique whose results rival the state of the art hand-crafted architectures. However, this approach requires a lot of computational capabilities as well as research time, which makes prohibitive its usage in many real-world scenarios. With its sequential model-based ...

Find SimilarView on arXiv

Exploring the Unexplored: Understanding the Impact of Layer Adjustments on Image Classification

January 25, 2024

77% Match

Haixia Liu, Tim Brailsford, James Goulding, ... , Bull Larry

Computer Vision and Pattern ...

This paper investigates how adjustments to deep learning architectures impact model performance in image classification. Small-scale experiments generate initial insights although the trends observed are not consistent with the entire dataset. Filtering operations in the image processing pipeline are crucial, with image filtering before pre-processing yielding better results. The choice and order of layers as well as filter placement significantly impact model performance. Th...

Find SimilarView on arXiv

Enhancing Deep Learning with Optimized Gradient Descent: Bridging Numerical Methods and Neural Network Training

September 7, 2024

77% Match

Yuhan Ma, Dan Sun, Erdi Gao, Ningjing Sang, ... , Huang Guanming

Machine Learning

Artificial Intelligence

Optimization theory serves as a pivotal scientific instrument for achieving optimal system performance, with its origins in economic applications to identify the best investment strategies for maximizing benefits. Over the centuries, from the geometric inquiries of ancient Greece to the calculus contributions by Newton and Leibniz, optimization theory has significantly advanced. The persistent work of scientists like Lagrange, Cauchy, and von Neumann has fortified its progres...

Find SimilarView on arXiv

Magnificent Minified Models

June 16, 2023

76% Match

Rich Harang, Hillary Sanders

Machine Learning

This paper concerns itself with the task of taking a large trained neural network and 'compressing' it to be smaller by deleting parameters or entire neurons, with minimal decreases in the resulting model accuracy. We compare various methods of parameter and neuron selection: dropout-based neuron damage estimation, neuron merging, absolute-value based selection, random selection, OBD (Optimal Brain Damage). We also compare a variation on the classic OBD method that slightly o...

Find SimilarView on arXiv