An Estimator Update Scheme for Large Teams of Learning Automata

Cuéllar, Manuel P.; Ros, María; Delgado, Miguel; Vila, Amparo

doi:10.1007/978-3-642-19934-9_26

Manuel P. Cuéllar⁵,
María Ros⁵,
Miguel Delgado⁵ &
…
Amparo Vila⁵

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 91))

882 Accesses

Abstract

Learning Automata are stochastic decision-making machines that have been widely used in classification, control, and network routing, between others. Despite their versatility, one of the main drawbacks of these models is the low convergence rate of the learning rules used for the training. Estimator algorithms such as Pursuit schemes help to overcome this limitation, although they require a high computer memory cost for their operation. This fact becomes a serious inconvenient when a large set of learning automata collaborate in a team to solve a concrete task, since the memory requirements of these algorithms increases exponentially. In these cases, Pursuit algorithms are ineffective due to memory overflow.

In this work, we address this problem and we propose an estimator algorithm that can be used to train large teams of Learning Automata. The approach uses a similar strategy to Tabu Search algorithms to manage long and short term memory, in order to reduce the memory requirements. The method is applied in classic permutation problems as a test-bed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agache, M., Oommen, B.J.: Generalized pursuit learning schemes: new families of continuous and discretized learning automata. IEEE Transactions on Systems, Man, and Cybernetics, Part B 32(6), 738–749 (2002)
Article Google Scholar
Baba, N., Mogami, Y.: A new learning algorithm for the hierarchical structure learning automata operating in the nonstationary s-model random environment. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 32(6), 750–758 (2002)
Article Google Scholar
Baba, N., Mogami, Y.: A relative reward-strength algorithm for the hierarchical structure learning automata operating in the general nonstationary multiteacher environment. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 36(4), 781–794 (2006)
Article Google Scholar
Beigy, H., Meybodi, M.R.: Utilizing distributed learning automata to solve stochastic shortest path problems. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 14(5), 591–615 (2006)
Article MATH MathSciNet Google Scholar
Glover, F., Laguna, M.: Tabu search, pp. 70–150 (1993)
Google Scholar
Lukac, K., Lukac, Z., Tkalic, M.: Behaviour of f learning automata as multicriteria routing agents in connection oriented networks. In: The 12th IEEE International Conference on Fuzzy Systems, FUZZ 2003, vol. 1, pp. 296–301 (2003)
Google Scholar
Narendra, K.S., Thathachar, M.A.L.: Learning automata: an introduction. Prentice-Hall, Inc., Upper Saddle River (1989)
Google Scholar
Nowé, A., Verbeeck, K., Peeters, M.: Learning automata as a basis for multi agent reinforcement learning. In: Tuyls, K., ’t Hoen, P.J., Verbeeck, K., Sen, S. (eds.) LAMAS 2005. LNCS (LNAI), vol. 3898, pp. 71–85. Springer, Heidelberg (2006)
Chapter Google Scholar
Oommen, B., Agache, M.: A comparison of continuous and discretized pursuit learning schemes. In: Proceedings of 1999 IEEE International Conference on Systems, Man, and Cybernetics, SMC 1999, vol. 4, pp. 1061–1067 (1999)
Google Scholar
Oommen, B., Lanctot, J.: Discretized pursuit learning automata. IEEE Transactions on Systems, Man and Cybernetics 20(4), 931–938 (1990)
Article MATH MathSciNet Google Scholar
Oommen, B.J., Hashem, M.K.: Modeling a student-classroom interaction in a tutorial-like system using learning automata. Trans. Sys. Man Cyber. Part B 40(1), 29–42 (2010), http://dx.doi.org/10.1109/TSMCB.2009.2032414
Article Google Scholar
Peeters, M., Verbeeck, K., Nowé, A.: Solving multi-stage games with hierarchical learning automata that bootstrap. In: Tuyls, K., Nowe, A., Guessoum, Z., Kudenko, D. (eds.) ALAMAS 2005, ALAMAS 2006, and ALAMAS 2007. LNCS (LNAI), vol. 4865, pp. 169–187. Springer, Heidelberg (2008)
Chapter Google Scholar
Poznyak, A.S., Najim, K.: Learning Automata and Stochastic Optimization. Springer-Verlag New York, Inc., Secaucus (1997)
MATH Google Scholar
Sastry, P., Thathachar, M.: Learning automata algorithms for pattern classification. Sadhana 24(4-5), 261–292 (1999)
Article MATH MathSciNet Google Scholar
Thathachar, M.A.L., Arvind, M.T.: Parallel algorithms for modules of learning automata. IEEE Transactions on Systems, Man, and Cybernetics, Part B 28(1), 24–33 (1998)
Article Google Scholar
Thathachar, M.A.L., Ramakrishnan, K.R.: A cooperative game of a pair of learning automata. Automatica 20(6), 797–801 (1984)
Article MATH MathSciNet Google Scholar
Thathachar, M.A.L., Sastry, P.S.: Varieties of learning automata: an overview. IEEE Transactions on Systems, Man, and Cybernetics, Part B 32(6), 711–722 (2002)
Article Google Scholar
Torkestani, J., Meybodi, M.: Graph coloring problem based on learning automata. In: International Conference on Information Management and Engineering, ICIME 2009, pp. 718–722 (2009)
Google Scholar
Torkestani, J., Meybodi, M.: Solving the minimum spanning tree problem in stochastic graphs using learning automata. In: International Conference on Information Management and Engineering, ICIME 2009, pp. 643–647 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Artificial Intelligence E.T.S.I. Informática y de Telecomunicación, University of Granada, C/. Pdta. Daniel Saucedo Aranda s.n., Spain
Manuel P. Cuéllar, María Ros, Miguel Delgado & Amparo Vila

Authors

Manuel P. Cuéllar
View author publications
You can also search for this author in PubMed Google Scholar
María Ros
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Delgado
View author publications
You can also search for this author in PubMed Google Scholar
Amparo Vila
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Machine Intelligence Research Labs (MIR Labs), Scientific Network for Innovation and Research Excellence (SNIRE), P.O. Box 2259, 98071-2259, Auburn, WA, USA
Ajith Abraham
Department of Computing Science and Control, Faculty of Science, University of Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Juan M. Corchado
Department of Computing Science Faculty of Science, University of Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Sara Rodríguez González & Juan F. De Paz Santana &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cuéllar, M.P., Ros, M., Delgado, M., Vila, A. (2011). An Estimator Update Scheme for Large Teams of Learning Automata. In: Abraham, A., Corchado, J.M., González, S.R., De Paz Santana, J.F. (eds) International Symposium on Distributed Computing and Artificial Intelligence. Advances in Intelligent and Soft Computing, vol 91. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19934-9_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-19934-9_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19933-2
Online ISBN: 978-3-642-19934-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics