Learning Form Experience: A Bayesian Network Based Reinforcement Learning Approach

Jin, Zhao; Jin, Jian; Song, Jiong

doi:10.1007/978-3-642-25255-6_52

Learning Form Experience: A Bayesian Network Based Reinforcement Learning Approach

Zhao Jin¹⁸,
Jian Jin¹⁹ &
Jiong Song²⁰

Conference paper

2519 Accesses
1 Citations
1 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7030))

Abstract

Agent completely depends on trail-and-error to learn the optimal policy is the major reason to make reinforcement learning being slow and time consuming. Excepting for trail-and-error, human can also take advantage of prior learned experience to plan and accelerate subsequent learning. We propose an approach to model agent’s learning experience by Bayesian Network, which can be used to shape agent for bias exploration towards the most promising regions of state space and thereby reduces exploration and accelerate learning. The experiment results on Grid-World problem show our approach can significantly improve agent’s performance and shorten learning time. More importantly, our approach makes agent can take advantage of its learning experience to plan and accelerate learning.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)
Google Scholar
Sertan, G., Faruk, P., Reda, A.: Improving reinforcement learning by using sequence trees. Machine Learning 81(3), 283–331 (2010)
Article MathSciNet Google Scholar
Grzes, M., Kudenko, D.: Online learning of shaping rewards in reinforcement learning. Neural Networks 23(4), 541–550 (2010)
Article Google Scholar
Wang, t., Daniel, L.: Bayesian Sparse Sampling for On-line Reward Optimization. In: Proceedings of the 22nd International Conference on Machine Learning, pp. 956–963 (2005)
Google Scholar
Amizadeh, S., Ahmadabadi, M.: A Bayesian Approach to Conceptualization Using Reinforcement Learning. In: 2007 International Conference on Advanced Intelligent Mechatronics, pp. 1–7 (2007)
Google Scholar
Doshi, F., Pineau, J.: Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs, pp. 256–263 (2008)
Google Scholar
Joseph, R., Peter, S.: Online Kernel Selection for Bayesian Reinforcement Learning. In: Proceedings of the 25th International Conference on Machine Learning, pp. 816–823 (2008)
Google Scholar
Bob, P., Craig, B.: A Bayesian Approach to Imitation in Reinforcement Learning. In: Proceedings of IJCAI 2003, Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 712–720 (2003)
Google Scholar
Firouzi, H., Ahmadabadi, M.N.: A Probabilistic Reinforcement-Based Approach to Conceptualization. International Journal of Intelligent Systems and Technologies 3, 48–55 (2008)
Google Scholar
Pearl, J.: Probabilistic reasoning in intelligent systems: networks of plausible inference, pp. 79–119. Morgan Kaufmann, San Mateo (1988)
Google Scholar
Jin, Z., Jin, J., Liu, W.: Autonomous Discovery of Subgoals Using Acyclic State Trajectories. In: Zhu, R., Zhang, Y., Liu, B., Liu, C. (eds.) ICICA 2010. LNCS, vol. 6377, pp. 49–56. Springer, Heidelberg (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Yunnan University, 650091, Kunming, China
Zhao Jin
Hongta Group Tobacco Limited Corporation, 653100, Yuxi, China
Jian Jin
Yunnan Jiao Tong Vocational and Technical College, 650101, Kunming, China
Jiong Song

Authors

Zhao Jin
View author publications
You can also search for this author in PubMed Google Scholar
Jian Jin
View author publications
You can also search for this author in PubMed Google Scholar
Jiong Song
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Science, Hebei United University, Xinhua West Road 46, 063000, Tangshan, Hebei, China
Baoxiang Liu
School of Computer Science and Information Engineering, Zhejiang Gongshang University, Xueheng Street 18, 310018, Hangzhou, China
Chunlai Chai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jin, Z., Jin, J., Song, J. (2011). Learning Form Experience: A Bayesian Network Based Reinforcement Learning Approach. In: Liu, B., Chai, C. (eds) Information Computing and Applications. ICICA 2011. Lecture Notes in Computer Science, vol 7030. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25255-6_52

Download citation

DOI: https://doi.org/10.1007/978-3-642-25255-6_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25254-9
Online ISBN: 978-3-642-25255-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics