Deep Imitation Learning with Memory for Robocup Soccer Simulation

Hussein, Ahmed; Elyan, Eyad; Jayne, Chrisina

doi:10.1007/978-3-319-98204-5_3

Ahmed Hussein¹⁰,
Eyad Elyan¹⁰ &
Chrisina Jayne¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 893))

Included in the following conference series:

International Conference on Engineering Applications of Neural Networks

871 Accesses
2 Citations

Abstract

Imitation learning is a field that is rapidly gaining attention due to its relevance to many autonomous agent applications. Providing demonstrations of effective behaviour to teach the agent is useful in real world challenges such as sparse rewards and dynamic environments. However, most imitation learning approaches don’t retain a memory of previous actions and treat the demonstrations as independent and identically distributed samples. This neglects the temporal dependency between low-level actions that are performed in sequence to achieve the desired behaviour. This paper proposes an imitation learning method to learn sequences of actions by utilizing memory in deep neural networks. Long short-term memory networks are utilized to capture the temporal dependencies in a teacher’s demonstrations. This way, past states and actions provide context for performing following actions. The network is trained using raw low-level features and directly maps the input to low-level parametrized actions in real-time. This minimizes the need for task specific knowledge to be manually employed in the learning process compared to related approaches. The proposed methods are evaluated on a benchmark soccer simulator and compared to supervised learning and data-aggregation approaches. The results show that utilizing memory while learning significantly improves the performance and generalization of the agent and can provide a stationary policy than can produce robust predictions at any point in the sequence.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

End-to-End Deep Imitation Learning: Robot Soccer Case Study

Deep Reinforcement Learning for Humanoid Robot Behaviors

Article 27 April 2022

Deep Reinforcement Learning for a Humanoid Robot Soccer Player

Article 26 June 2021

References

Graves, A.: Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013)
Graves, A., Mohamed, A.R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649. IEEE (2013)
Google Scholar
Graves, A., et al.: Supervised Sequence Labelling with Recurrent Neural Networks, vol. 385. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-24797-2
Book MATH Google Scholar
Hausknecht, M., Mupparaju, P., Subramanian, S., Kalyanakrishnan, S., Stone, P.: Half field offense: an environment for multiagent learning and ad hoc teamwork. In: AAMAS Adaptive Learning Agents (ALA) Workshop (2016)
Google Scholar
Hausknecht, M., Stone, P.: Deep reinforcement learning in parameterized action space. arXiv preprint arXiv:1511.04143 (2015)
Hussein, A., Elyan, E., Gaber, M.M., Jayne, C.: Deep imitation learning for 3D navigation tasks. Neural Comput. Appl. 1–16 (2017)
Google Scholar
Hussein, A., Gaber, M.M., Elyan, E., Jayne, C.: Imitation learning: a survey of learning methods. ACM Comput. Surv. 50(2), 21:1–21:35 (2017). https://doi.org/10.1145/3054912
Article Google Scholar
Jain, D., Shah, M., Garg, B.K.: Watchdogs 2D soccer simulation
Google Scholar
Kitano, H., Asada, M., Kuniyoshi, Y., Noda, I., Osawa, E.: RoboCup: the robot world cup initiative. In: Proceedings of the First International Conference on Autonomous Agents, AGENTS 1997, pp. 340–347. ACM, New York (1997). https://doi.org/10.1145/267658.267738
Masson, W., Ranchod, P., Konidaris, G.: Reinforcement learning with parameterized actions. arXiv preprint arXiv:1509.01644 (2015)
Mayer, H., Gomez, F., Wierstra, D., Nagy, I., Knoll, A., Schmidhuber, J.: A system for robotic heart surgery that learns to tie knots using recurrent neural networks. Adv. Robot. 22(13–14), 1521–1537 (2008)
Article Google Scholar
Mikolov, T., Karafiát, M., Burget, L., Cernockỳ, J., Khudanpur, S.: Recurrent neural network based language model. In: Interspeech, vol. 2, p. 3 (2010)
Google Scholar
Mnih, V., et al.: Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013)
Pietro, A.D., While, L., Barone, L.: Learning in RoboCup keep away using evolutionary algorithms. In: Proceedings of the 4th Annual Conference on Genetic and Evolutionary Computation, pp. 1065–1072. Morgan Kaufmann Publishers Inc. (2002)
Google Scholar
Raza, S., Haider, S., Williams, M.A.: Teaching coordinated strategies to soccer robots via imitation. In: 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO), pp. 1434–1439. IEEE (2012)
Google Scholar
Ross, S., Gordon, G.J., Bagnell, J.A.: A reduction of imitation learning and structured prediction to no-regret online learning. arXiv preprint arXiv:1011.0686 (2010)
Stone, P., Kuhlmann, G., Taylor, M.E., Liu, Y.: Keepaway soccer: from machine learning testbed to benchmark. In: Bredenfeld, A., Jacoff, A., Noda, I., Takahashi, Y. (eds.) RoboCup 2005. LNCS (LNAI), vol. 4020, pp. 93–105. Springer, Heidelberg (2006). https://doi.org/10.1007/11780519_9
Chapter Google Scholar
Stone, P., Sutton, R.S., Kuhlmann, G.: Reinforcement learning for robocup soccer keepaway. Adapt. Behav. 13(3), 165–188 (2005)
Article Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems, pp. 3104–3112 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, Robert Gordon University, Garthdee Road, Aberdeen, AB10 7QB, UK
Ahmed Hussein & Eyad Elyan
School of Engineering, Computing and Mathematics, Oxford Brookes University, Oxford, OX3 0BP, UK
Chrisina Jayne

Authors

Ahmed Hussein
View author publications
You can also search for this author in PubMed Google Scholar
Eyad Elyan
View author publications
You can also search for this author in PubMed Google Scholar
Chrisina Jayne
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ahmed Hussein .

Editor information

Editors and Affiliations

University of the West of England, Bristol, United Kingdom
Elias Pimenidis
Oxford Brookes University, Oxford, United Kingdom
Chrisina Jayne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hussein, A., Elyan, E., Jayne, C. (2018). Deep Imitation Learning with Memory for Robocup Soccer Simulation. In: Pimenidis, E., Jayne, C. (eds) Engineering Applications of Neural Networks. EANN 2018. Communications in Computer and Information Science, vol 893. Springer, Cham. https://doi.org/10.1007/978-3-319-98204-5_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-98204-5_3
Published: 27 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98203-8
Online ISBN: 978-3-319-98204-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Deep Imitation Learning with Memory for Robocup Soccer Simulation

Abstract

Access this chapter

Similar content being viewed by others

End-to-End Deep Imitation Learning: Robot Soccer Case Study

Deep Reinforcement Learning for Humanoid Robot Behaviors

Deep Reinforcement Learning for a Humanoid Robot Soccer Player

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Deep Imitation Learning with Memory for Robocup Soccer Simulation

Abstract

Access this chapter

Similar content being viewed by others

End-to-End Deep Imitation Learning: Robot Soccer Case Study

Deep Reinforcement Learning for Humanoid Robot Behaviors

Deep Reinforcement Learning for a Humanoid Robot Soccer Player

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation