Show simple item record

Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution

dc.date.accessioned2009-09-28T21:00:15Z
dc.date.accessioned2018-11-26T22:26:06Z
dc.date.available2009-09-28T21:00:15Z
dc.date.available2018-11-26T22:26:06Z
dc.date.issued2009-09-23
dc.identifier.urihttp://hdl.handle.net/1721.1/46820
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/1721.1/46820
dc.description.abstractOnline, forward-search techniques have demonstrated promising results for solving problems in partially observable environments. These techniques depend on the ability to efficiently search and evaluate the set of beliefs reachable from the current belief. However, enumerating or sampling action-observation sequences to compute the reachable beliefs is computationally demanding; coupled with the need to satisfy real-time constraints, existing online solvers can only search to a limited depth. In this paper, we propose that policies can be generated directly from the distribution of the agent's posterior belief. When the underlying state distribution is Gaussian, and the observation function is an exponential family distribution, we can calculate this distribution of beliefs without enumerating the possible observations. This property not only enables us to plan in problems with large observation spaces, but also allows us to search deeper by considering policies composed of multi-step action sequences. We present the Posterior Belief Distribution (PBD) algorithm, an efficient forward-search POMDP planner for continuous domains, demonstrating that better policies are generated when we can perform deeper forward search.en_US
dc.format.extent12 p.en_US
dc.rightsCreative Commons Attribution-Noncommercial-No Derivative Works 3.0 Unporteden_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/
dc.titleEfficient POMDP Forward Search by Predicting the Posterior Belief Distributionen_US


Files in this item

FilesSizeFormatView
MIT-CSAIL-TR-2009-044.pdf329.5Kbapplication/pdfView/Open
MIT-CSAIL-TR-2009-044.ps1.805Mbapplication/postscriptView/Open

This item appears in the following Collection(s)

Show simple item record

Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 Unported
Except where otherwise noted, this item's license is described as Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 Unported