Particle Physics Model Building with Rei...

High-dimensional reinforcement learning for optimization and control of ultracold quantum gases

August 9, 2023

83% Match

Nicholas Milson, Arina Tashchilina, Tian Ooi, Anna Czarnecka, ... , LeBlanc Lindsay J.

Quantum Gases

Atomic Physics

Machine-learning techniques are emerging as a valuable tool in experimental physics, and among them, reinforcement learning offers the potential to control high-dimensional, multistage processes in the presence of fluctuating environments. In this experimental work, we apply reinforcement learning to the preparation of an ultracold quantum gas to realize a consistent and large number of atoms at microkelvin temperatures. This reinforcement learning agent determines an optimal...

Find SimilarView on arXiv

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

February 21, 2020

83% Match

Ashley D. Edwards, Himanshu Sahni, Rosanne Liu, Jane Hung, Ankit Jain, Rui Wang, Adrien Ecoffet, Thomas Miconi, ... , Yosinski Jason

Machine Learning

Artificial Intelligence

Machine Learning

In this paper, we introduce a novel form of value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter. In order to derive an optimal policy, we develop a forward dynamics model that learns to make next-state predictions that maximize this value. This formulation decouples actions from values while still learning off-policy. We highlight the benefits of this approach in terms of val...

Find SimilarView on arXiv

Interpretable Dynamics Models for Data-Efficient Reinforcement Learning

July 10, 2019

83% Match

Markus Kaiser, Clemens Otte, ... , Ek Carl Henrik

Machine Learning

In this paper, we present a Bayesian view on model-based reinforcement learning. We use expert knowledge to impose structure on the transition model and present an efficient learning scheme based on variational inference. This scheme is applied to a heteroskedastic and bimodal benchmark problem on which we compare our results to NFQ and show how our approach yields human-interpretable insight about the underlying dynamics while also increasing data-efficiency.

Find SimilarView on arXiv

Hybrid actor-critic algorithm for quantum reinforcement learning at CERN beam lines

September 22, 2022

83% Match

Michael Schenk, Elías F. Combarro, Michele Grossi, Verena Kain, Kevin Shing Bruce Li, ... , Vallecorsa Sofia

Quantum Physics

Free energy-based reinforcement learning (FERL) with clamped quantum Boltzmann machines (QBM) was shown to significantly improve the learning efficiency compared to classical Q-learning with the restriction, however, to discrete state-action space environments. In this paper, the FERL approach is extended to multi-dimensional continuous state-action space environments to open the doors for a broader range of real-world applications. First, free energy-based Q-learning is stud...

Find SimilarView on arXiv

Design of Interacting Particle Systems for Fast and Efficient Reinforcement Learning

June 16, 2024

83% Match

Anant A Joshi, Heng-Sheng Chang, Amirhossein Taghvaei, ... , Meyn Sean P.

Systems and Control

This paper is concerned with the design of algorithms based on systems of interacting particles to represent, approximate, and learn the optimal control law for reinforcement learning (RL). The primary contribution of the present paper is to show that convergence rates can be accelerated dramatically through careful design of interactions between particles. Theory focuses on the linear quadratic stochastic optimal control problem for which a complete and novel theory is prese...

Find SimilarView on arXiv

libGroomRL: Reinforcement Learning for Jets

September 15, 2019

83% Match

Stefano Carrazza, Frédéric A. Dreyer

Data Analysis, Statistics an...

In these proceedings, we present a library allowing for straightforward calls in C++ to jet grooming algorithms trained with deep reinforcement learning. The RL agent is trained with a reward function constructed to optimize the groomed jet properties, using both signal and background samples in a simultaneous multi-level training. We show that the grooming algorithm derived from the deep RL agent can match state-of-the-art techniques used at the Large Hadron Collider, result...

Find SimilarView on arXiv

Reinforcement Learning via Recurrent Convolutional Neural Networks

January 9, 2017

83% Match

Tanmay Shankar, Santosha K. Dwivedy, Prithwijit Guha

Machine Learning

Artificial Intelligence

Deep Reinforcement Learning has enabled the learning of policies for complex tasks in partially observable environments, without explicitly learning the underlying model of the tasks. While such model-free methods achieve considerable performance, they often ignore the structure of task. We present a natural representation of to Reinforcement Learning (RL) problems using Recurrent Convolutional Neural Networks (RCNNs), to better exploit this inherent structure. We define 3 su...

Find SimilarView on arXiv

Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation

July 19, 2013

83% Match

Syogo Mori, Voot Tangkaratt, Tingting Zhao, ... , Sugiyama Masashi

Machine Learning

The goal of reinforcement learning (RL) is to let an agent learn an optimal control policy in an unknown environment so that future expected rewards are maximized. The model-free RL approach directly learns the policy based on data samples. Although using many samples tends to improve the accuracy of policy learning, collecting a large number of samples is often expensive in practice. On the other hand, the model-based RL approach first estimates the transition model of the e...

Find SimilarView on arXiv

Branes with Brains: Exploring String Vacua with Deep Reinforcement Learning

March 27, 2019

83% Match

James Halverson, Brent Nelson, Fabian Ruehle

High Energy Physics - Theory

We propose deep reinforcement learning as a model-free method for exploring the landscape of string vacua. As a concrete application, we utilize an artificial intelligence agent known as an asynchronous advantage actor-critic to explore type IIA compactifications with intersecting D6-branes. As different string background configurations are explored by changing D6-brane configurations, the agent receives rewards and punishments related to string consistency conditions and pro...

Find SimilarView on arXiv

Deep Reinforcement Learning for Optimal Control of Space Heating

May 10, 2018

83% Match

Adam Nagy, Hussain Kazmi, ... , Driesen Johan

Applications

Systems and Control

Machine Learning

Classical methods to control heating systems are often marred by suboptimal performance, inability to adapt to dynamic conditions and unreasonable assumptions e.g. existence of building models. This paper presents a novel deep reinforcement learning algorithm which can control space heating in buildings in a computationally efficient manner, and benchmarks it against other known techniques. The proposed algorithm outperforms rule based control by between 5-10% in a simulation...

Find SimilarView on arXiv

Particle Physics Model Building with Reinforcement Learning

High-dimensional reinforcement learning for optimization and control of ultracold quantum gases

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

Interpretable Dynamics Models for Data-Efficient Reinforcement Learning

Hybrid actor-critic algorithm for quantum reinforcement learning at CERN beam lines

Design of Interacting Particle Systems for Fast and Efficient Reinforcement Learning

libGroomRL: Reinforcement Learning for Jets

Reinforcement Learning via Recurrent Convolutional Neural Networks

Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation

Branes with Brains: Exploring String Vacua with Deep Reinforcement Learning

Deep Reinforcement Learning for Optimal Control of Space Heating