Browsing Open Access Repositories by Subject "policy search"

Now showing items 1-2 of 2

  • The Essential Dynamics Algorithm: Essential Results 

    Unknown author (2003-05-01)
    This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP ...

  • Reinforcement Learning by Policy Search 

    Unknown author (2003-02-14)
    One objective of artificial intelligence is to model the behavior of an intelligent agent interacting with its environment. The environment's transformations can be modeled as a Markov chain, whose state is partially ...