Belief Selection in Point-Based Planning Algorithms for POMDPs

Izadi, Masoumeh T.; Precup, Doina; Azar, Danielle

doi:10.1007/11766247_33

Masoumeh T. Izadi²⁰,
Doina Precup²⁰ &
Danielle Azar²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4013))

Included in the following conference series:

Conference of the Canadian Society for Computational Studies of Intelligence

2675 Accesses
5 Citations

Abstract

Current point-based planning algorithms for solving partially observable Markov decision processes (POMDPs) have demonstrated that a good approximation of the value function can be derived by interpolation from the values of a specially selected set of points. The performance of these algorithms can be improved by eliminating unnecessary backups or concentrating on more important points in the belief simplex. We study three methods designed to improve point-based value iteration algorithms. The first two methods are based on reachability analysis on the POMDP belief space. This approach relies on prioritizing the beliefs based on how they are reached from the given initial belief state. The third approach is motivated by the observation that beliefs which are the most overestimated or underestimated have greater influence on the precision of value function than other beliefs. We present an empirical evaluation illustrating how the performance of point-based value iteration (Pineau et al., 2003) varies with these approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cassandra, A.R., Littman, M.L., Kaelbling, L.P.: A simple, fast, exact methods for partially observable Markov decisi on processes. In: Proceedings of UAI, pp. 54–61 (1997)
Google Scholar
Izadi, M.T., Rajwade, A., Precup, D.: Using core beliefs for point-based value iteration. In: Proceedings of IJCAI, pp. 1751–1753 (2005)
Google Scholar
Hauskrecht, M.: Value-function approximations for Partially Observable Markov Decision Processes. Journal of Artificial Intelligence Research 13, 33–94 (2000)
MATH MathSciNet Google Scholar
Pineau, J., Gordon, G., Thrun, S.: Point-based value iteration: An anytime algorithms for POMDPs. In: Proceedings of IJCAI, pp. 1025–1032 (2003)
Google Scholar
Smith, T., Simmons, R.: Heuristic search value iteration for POMDPs. In: Proceedings of UAI, pp. 520–527 (2004)
Google Scholar
Smith, T., Simmons, R.: Point-based POMDP Algorithm: Improved Analysis and Implementation. In: Proceedings of ICML (2005)
Google Scholar
Sondik The, E.J.: optimal control of Partially Observable Markov Processe. Ph.D. thesis, Stanford University (1971)
Google Scholar
Spaan, M.T.J., Vlassis, N.: Perseus: Randomized point-base value iteration for POMDPs. Journal of Artificial Intelligencce Research, 195–220 (2005)
Google Scholar
Zhang, N.L., Zhang, W.: Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligience Research 14, 2 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

McGill University, Canada
Masoumeh T. Izadi & Doina Precup
American Lebanese University Byblos, Lebanon
Danielle Azar

Authors

Masoumeh T. Izadi
View author publications
You can also search for this author in PubMed Google Scholar
Doina Precup
View author publications
You can also search for this author in PubMed Google Scholar
Danielle Azar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departement of Computer Science and Software Engineering, Laval University, G1K 7P4, Québec, Canada
Luc Lamontagne
Département IFT-GLO, Pavillon Adrien-Pouliot, Université Laval, G1K-7P4, Québec, Canada
Mario Marchand

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Izadi, M.T., Precup, D., Azar, D. (2006). Belief Selection in Point-Based Planning Algorithms for POMDPs. In: Lamontagne, L., Marchand, M. (eds) Advances in Artificial Intelligence. Canadian AI 2006. Lecture Notes in Computer Science(), vol 4013. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11766247_33

Download citation

DOI: https://doi.org/10.1007/11766247_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34628-9
Online ISBN: 978-3-540-34630-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics