Skip to main content

Communication-Less Cooperative Q-Learning Agents in Maze Problem

  • Conference paper
  • First Online:
Intelligent and Evolutionary Systems

Part of the book series: Proceedings in Adaptation, Learning and Optimization ((PALO,volume 8))

Abstract

This paper introduces a reinforcement learning technique with an internal reward for a multi-agent cooperation task. The proposed method is an extension of Q-learning which changes the ordinary (external) reward to the internal reward for agent-cooperation under the condition of no communication. To increase the certainty of the proposed methods, we theoretically investigate what values should be set to select the goal for the cooperation among agents. In order to show the effectiveness of the proposed method, we conduct the intensive simulation on the maze problem for the agent-cooperation task, and confirm the following implications: (1) the proposed method successfully enable agents to acquire cooperative behaviors while a conventional method fails to always acquire such behaviors; (2) the cooperation among agents according to their internal rewards is achieved no communication; and (3) the condition for the cooperation among any number of agent is indicated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Yong-Jae Kim Kui-Hong Park and Jong-Hwan Kim. Modular q-learning based multi-agent cooperation for robot soccer. Robotics and Autonomous System, pages 3026–3033, 2015.

    Google Scholar 

  2. Michael Camara, Oliver Bonham-Carter, and Janyl Jumadinova. A multi-agent system with reinforcement learning agents for biomedical text mining. In Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics, BCB ’15, pages 634–643, New York, NY, USA, 2015. ACM.

    Google Scholar 

  3. H. Iima and Y. Kuroe. Swarm reinforcement learning methods improving certainty of learning for a multi-robot formation problem. CEC, pages 3026–3033, May 2015.

    Google Scholar 

  4. Y. Ichikawa and K. Takadama. Designing internal reward of reinforcement learning agents in multi-step dilemma problem. Journal of Computational Intelligence and Intelligent Informatics, JACIII, 17(6):926–931, 2013.

    Article  Google Scholar 

  5. M. Gini M. Elidrisi, N. Johnson and J. Crandall. Fast adaptive learning in repeated stochastic games by game abstraction. AAMAS, pages 1141–1148, May 2014.

    Google Scholar 

  6. Prabuchandran K. J., Hemanth Kumar A. N, and S. Bhatnagar. Multi-agent reinforcement learning for traffic signal control. In Intelligent Transportation Systems (ITSC), 2014 IEEE 17th International Conference on, pages 2529–2534, Oct 2014.

    Google Scholar 

  7. Katja Verbeeck Karl Tuyls and Tom Lenaerts. A selection-mutation model for q-learning in multi-agent systems. Robotics and Autonomous System, pages 3026–3033, May 2015.

    Google Scholar 

  8. Alessandro Lazaric Enrique Munoz de Cote and Marcello Restelli. Learning to cooperate in multi-agent social dilemmas. AAMAS, pages 783–785, May 2006.

    Google Scholar 

  9. Ming Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In In Proceedings of the Tenth International Conference on Machine Learning, pages 330–337. Morgan Kaufmann, 1993.

    Google Scholar 

  10. N. Ono and K. Fukumoto, editors. Multi-agent reinforcement learning: A modular approach, 1996.

    Google Scholar 

  11. R.S. Sutton and A.G. Barto. Reinforcement Learning. Bradford Books/MIT Press, Cambridge, MA, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fumito Uwano .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Uwano, F., Takadama, K. (2017). Communication-Less Cooperative Q-Learning Agents in Maze Problem. In: Leu, G., Singh, H., Elsayed, S. (eds) Intelligent and Evolutionary Systems. Proceedings in Adaptation, Learning and Optimization, vol 8. Springer, Cham. https://doi.org/10.1007/978-3-319-49049-6_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-49049-6_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49048-9

  • Online ISBN: 978-3-319-49049-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics