Learning a Machine for the Decision in a...

A Markovian Model for Learning-to-Optimize

August 21, 2024

85% Match

Michael Sucker, Peter Ochs

Machine Learning

Probability

We present a probabilistic model for stochastic iterative algorithms with the use case of optimization algorithms in mind. Based on this model, we present PAC-Bayesian generalization bounds for functions that are defined on the trajectory of the learned algorithm, for example, the expected (non-asymptotic) convergence rate and the expected time to reach the stopping criterion. Thus, not only does this model allow for learning stochastic algorithms based on their empirical per...

Find SimilarView on arXiv

Learning Hidden Markov Models for Regression using Path Aggregation

June 13, 2012

85% Match

Keith Noto, Mark Craven

Machine Learning

Computational Engineering, F...

Quantitative Methods

We consider the task of learning mappings from sequential data to real-valued responses. We present and evaluate an approach to learning a type of hidden Markov model (HMM) for regression. The learning process involves inferring the structure and parameters of a conventional HMM, while simultaneously learning a regression model that maps features that characterize paths through the model to continuous responses. Our results, in both synthetic and biological domains, demonstra...

Find SimilarView on arXiv

An Algorithm for Pattern Discovery in Time Series

October 29, 2002

85% Match

Cosma Rohilla Shalizi, Kristina Lisa Shalizi, James P. Crutchfield

Machine Learning

Computation and Language

We present a new algorithm for discovering patterns in time series and other sequential data. We exhibit a reliable procedure for building the minimal set of hidden, Markovian states that is statistically capable of producing the behavior exhibited in the data -- the underlying process's causal states. Unlike conventional methods for fitting hidden Markov models (HMMs) to data, our algorithm makes no assumptions about the process's causal architecture (the number of hidden st...

Find SimilarView on arXiv

Online Baum-Welch algorithm for Hierarchical Imitation Learning

March 22, 2021

85% Match

Vittorio Giammarino, Ioannis Ch. Paschalidis

Machine Learning

Systems and Control

Machine Learning

The options framework for hierarchical reinforcement learning has increased its popularity in recent years and has made improvements in tackling the scalability problem in reinforcement learning. Yet, most of these recent successes are linked with a proper options initialization or discovery. When an expert is available, the options discovery problem can be addressed by learning an options-type hierarchical policy directly from expert demonstrations. This problem is referred ...

Find SimilarView on arXiv

POMDPs in Continuous Time and Discrete Spaces

October 2, 2020

85% Match

Bastian Alt, Matthias Schultheis, Heinz Koeppl

Machine Learning

Systems and Control

Optimization and Control

Machine Learning

Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space systems under partial observability. This places our work at the intersection of optimal filtering and optimal control. At the current state of research, a mathematical description for simultaneous decision making and filtering in continuous time...

Find SimilarView on arXiv

Learning in Observable POMDPs, without Computationally Intractable Oracles

June 7, 2022

85% Match

Noah Golowich, Ankur Moitra, Dhruv Rohatgi

Machine Learning

Artificial Intelligence

Data Structures and Algorith...

Optimization and Control

Machine Learning

Much of reinforcement learning theory is built on top of oracles that are computationally hard to implement. Specifically for learning near-optimal policies in Partially Observable Markov Decision Processes (POMDPs), existing algorithms either need to make strong assumptions about the model dynamics (e.g. deterministic transitions) or assume access to an oracle for solving a hard optimistic planning or estimation problem as a subroutine. In this work we develop the first orac...

Find SimilarView on arXiv

Inverse Reinforcement Learning Under Noisy Observations

October 27, 2017

85% Match

Shervin Shahryari, Prashant Doshi

Robotics

Artificial Intelligence

Machine Learning

We consider the problem of performing inverse reinforcement learning when the trajectory of the expert is not perfectly observed by the learner. Instead, a noisy continuous-time observation of the trajectory is provided to the learner. This problem exhibits wide-ranging applications and the specific application we consider here is the scenario in which the learner seeks to penetrate a perimeter patrolled by a robot. The learner's field of view is limited due to which it canno...

Find SimilarView on arXiv

Diffusion of Context and Credit Information in Markovian Models

October 1, 1995

85% Match

Y. Bengio, P. Frasconi

Artificial Intelligence

This paper studies the problem of ergodicity of transition probability matrices in Markovian models, such as hidden Markov models (HMMs), and how it makes very difficult the task of learning to represent long-term context for sequential data. This phenomenon hurts the forward propagation of long-term context information, as well as learning a hidden state representation to represent long-term context, which depends on propagating credit information backwards in time. Using re...

Find SimilarView on arXiv

Variational Bayesian Inference for Hidden Markov Models With Multivariate Gaussian Output Distributions

May 27, 2016

85% Match

Christian Gruhl, Bernhard Sick

Machine Learning

Hidden Markov Models (HMM) have been used for several years in many time series analysis or pattern recognitions tasks. HMM are often trained by means of the Baum-Welch algorithm which can be seen as a special variant of an expectation maximization (EM) algorithm. Second-order training techniques such as Variational Bayesian Inference (VI) for probabilistic models regard the parameters of the probabilistic models as random variables and define distributions over these distrib...

Find SimilarView on arXiv

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making

September 29, 2022

85% Match

Qinghua Liu, Praneeth Netrapalli, ... , Jin Chi

Machine Learning

Artificial Intelligence

Machine Learning

This paper introduces a simple efficient learning algorithms for general sequential decision making. The algorithm combines Optimism for exploration with Maximum Likelihood Estimation for model estimation, which is thus named OMLE. We prove that OMLE learns the near-optimal policies of an enormously rich class of sequential decision making problems in a polynomial number of samples. This rich class includes not only a majority of known tractable model-based Reinforcement Lear...

Find SimilarView on arXiv

Learning a Machine for the Decision in a Partially Observable Markov Universe

A Markovian Model for Learning-to-Optimize

Learning Hidden Markov Models for Regression using Path Aggregation

An Algorithm for Pattern Discovery in Time Series

Online Baum-Welch algorithm for Hierarchical Imitation Learning

POMDPs in Continuous Time and Discrete Spaces

Learning in Observable POMDPs, without Computationally Intractable Oracles

Inverse Reinforcement Learning Under Noisy Observations

Diffusion of Context and Credit Information in Markovian Models

Variational Bayesian Inference for Hidden Markov Models With Multivariate Gaussian Output Distributions

Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making