Browsing Open Access Repositories by Subject "policy search"
Now showing items 1-2 of 2
-
The Essential Dynamics Algorithm: Essential Results
(2003-05-01)This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP ...
-
Reinforcement Learning by Policy Search
(2003-02-14)One objective of artificial intelligence is to model the behavior of an intelligent agent interacting with its environment. The environment's transformations can be modeled as a Markov chain, whose state is partially ...