Browsing Open Access Repositories by Subject "policy search"

Now showing items 1-2 of 2

The Essential Dynamics Algorithm: Essential Results

Unknown author (2003-05-01)

This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP ...
Reinforcement Learning by Policy Search

Unknown author (2003-02-14)

One objective of artificial intelligence is to model the behavior of an intelligent agent interacting with its environment. The environment's transformations can be modeled as a Markov chain, whose state is partially ...