Communication-Less Cooperative Q-Learning Agents in Maze Problem

Uwano, Fumito; Takadama, Keiki

doi:10.1007/978-3-319-49049-6_33

Fumito Uwano⁶ &
Keiki Takadama⁶

Part of the book series: Proceedings in Adaptation, Learning and Optimization ((PALO,volume 8))

1052 Accesses
5 Citations

Abstract

This paper introduces a reinforcement learning technique with an internal reward for a multi-agent cooperation task. The proposed method is an extension of Q-learning which changes the ordinary (external) reward to the internal reward for agent-cooperation under the condition of no communication. To increase the certainty of the proposed methods, we theoretically investigate what values should be set to select the goal for the cooperation among agents. In order to show the effectiveness of the proposed method, we conduct the intensive simulation on the maze problem for the agent-cooperation task, and confirm the following implications: (1) the proposed method successfully enable agents to acquire cooperative behaviors while a conventional method fails to always acquire such behaviors; (2) the cooperation among agents according to their internal rewards is achieved no communication; and (3) the condition for the cooperation among any number of agent is indicated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Yong-Jae Kim Kui-Hong Park and Jong-Hwan Kim. Modular q-learning based multi-agent cooperation for robot soccer. Robotics and Autonomous System, pages 3026–3033, 2015.
Google Scholar
Michael Camara, Oliver Bonham-Carter, and Janyl Jumadinova. A multi-agent system with reinforcement learning agents for biomedical text mining. In Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics, BCB ’15, pages 634–643, New York, NY, USA, 2015. ACM.
Google Scholar
H. Iima and Y. Kuroe. Swarm reinforcement learning methods improving certainty of learning for a multi-robot formation problem. CEC, pages 3026–3033, May 2015.
Google Scholar
Y. Ichikawa and K. Takadama. Designing internal reward of reinforcement learning agents in multi-step dilemma problem. Journal of Computational Intelligence and Intelligent Informatics, JACIII, 17(6):926–931, 2013.
Article Google Scholar
M. Gini M. Elidrisi, N. Johnson and J. Crandall. Fast adaptive learning in repeated stochastic games by game abstraction. AAMAS, pages 1141–1148, May 2014.
Google Scholar
Prabuchandran K. J., Hemanth Kumar A. N, and S. Bhatnagar. Multi-agent reinforcement learning for traffic signal control. In Intelligent Transportation Systems (ITSC), 2014 IEEE 17th International Conference on, pages 2529–2534, Oct 2014.
Google Scholar
Katja Verbeeck Karl Tuyls and Tom Lenaerts. A selection-mutation model for q-learning in multi-agent systems. Robotics and Autonomous System, pages 3026–3033, May 2015.
Google Scholar
Alessandro Lazaric Enrique Munoz de Cote and Marcello Restelli. Learning to cooperate in multi-agent social dilemmas. AAMAS, pages 783–785, May 2006.
Google Scholar
Ming Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In In Proceedings of the Tenth International Conference on Machine Learning, pages 330–337. Morgan Kaufmann, 1993.
Google Scholar
N. Ono and K. Fukumoto, editors. Multi-agent reinforcement learning: A modular approach, 1996.
Google Scholar
R.S. Sutton and A.G. Barto. Reinforcement Learning. Bradford Books/MIT Press, Cambridge, MA, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

The University of Electro-Communications, 1-5-1 Chofugaoka Chofu, Tokyo, W6-309, Japan
Fumito Uwano & Keiki Takadama

Authors

Fumito Uwano
View author publications
You can also search for this author in PubMed Google Scholar
Keiki Takadama
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fumito Uwano .

Editor information

Editors and Affiliations

School of Engineering and Information Technology, Australian Defence Force Academy, The University of New South Wales, Canberra, Australian Capital Territory, Australia
George Leu
School of Engineering and Information Technology, Australian Defence Force Academy, The University of New South Wales, Canberra, Australian Capital Territory, Australia
Hemant Kumar Singh
School of Engineering and Information Technology, Australian Defence Force Academy, The University of New South Wales, Canberra, Australian Capital Territory, Australia
Saber Elsayed

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Uwano, F., Takadama, K. (2017). Communication-Less Cooperative Q-Learning Agents in Maze Problem. In: Leu, G., Singh, H., Elsayed, S. (eds) Intelligent and Evolutionary Systems. Proceedings in Adaptation, Learning and Optimization, vol 8. Springer, Cham. https://doi.org/10.1007/978-3-319-49049-6_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-49049-6_33
Published: 09 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49048-9
Online ISBN: 978-3-319-49049-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics