A Novel Information Measure for Predictive Learning in a Social System Setting

Di Prodi, Paolo; Porr, Bernd; Wörgötter, Florentin

doi:10.1007/978-3-642-15193-4_48

Paolo Di Prodi²¹,
Bernd Porr²¹ &
Florentin Wörgötter²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6226))

Included in the following conference series:

International Conference on Simulation of Adaptive Behavior

1400 Accesses

Abstract

We introduce a new theoretical framework, based on Shannon’s communication theory and on Ashby’s law of requisite variety, suitable for artificial agents using predictive learning. The framework quantifies the performance constraints of a predictive adaptive controller as a function of its learning stage. In addition, we formulate a practical measure, based on information flow, that can be applied to adaptive controllers which use hebbian learning, input correlation learning (ICO/ISO) and temporal difference learning. The framework is also useful in quantifying the social division of tasks in a social group of honest, cooperative food foraging, communicating agents.

Simulations are in accordance with Luhmann, who suggested that adaptive agents self-organise by reducing the amount of sensory information or, equivalently, reducing the complexity of the perceived environment from the agents perspective.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ashby, W.: An Introduction to Cybernetics. Chapman & Hall, Boca Raton (1956)
MATH Google Scholar
Ay, N., Bertschinger, N., Der, R., Güttler, F., Olbrich, E.: Predictive information and explorative behavior of autonomous robots. The European Physical Journal B 63(3), 329–339 (2008)
Article MATH MathSciNet Google Scholar
Polani, D., Ay, N.: Information ows in causal networks. Adv. Compl. Syst. (2007)
Google Scholar
Wörgötter, F., Porr, B.: Isotropic sequence order learning in a closed loop behavioural system. In: Roy. Soc. Phil. Trans. Mathematical, pp. 2225–2244
Google Scholar
Wörgötter, F., Porr, B.: Inside embodiment what means embodiment to radical constructivists? Kybernetes, 105–117 (2005)
Google Scholar
Booth, T.L.: Sequential Machines and Automata Theory, 1st edn. (1967)
Google Scholar
Braitenberg, V.: Vehicles: Experiments in synthetic psychology. MIT Press, Cambridge (1984)
Google Scholar
Zhang, J., Bi, D., Wang, G.L.: Novel learning feed-forward controller for accurate robot trajectory tracking. In: Wang, L., Chen, K., S. Ong, Y. (eds.) ICNC 2005. LNCS, vol. 3611, pp. 266–269. Springer, Heidelberg (2005)
Google Scholar
Nehaniv, C., Klyubin, A.S., Polani, D.: Empowerment: A universal agent-centric measure of control. Proceedings of the IEEE Congress on Evolutionary Computation 1, 128–135 (2005)
Article Google Scholar
Nehaniv, C.L., Klyubin, A.S., Polani, D.: Keep your options open: An information-based driving principle for the sensorimotor systems. In: PLoSOne, vol. 3 (2008)
Google Scholar
Polani, D., Klyubin, A.S., Nehaniv, C.L.: Organization of the information flow in the perception-action loop of evolved agents, pp. 177–180 (June 2004)
Google Scholar
Kolodziejski, C., Kulvicius, T.: On the analysis of differential hebbian learning in closed-loop behavioral systems. In: Frontiers in Computational Neuroscience. Conference Abstract: Bernstein Conference on Computational Neuroscience (2009)
Google Scholar
Luhmann, N.: Social Systems (1996)
Google Scholar
Bullwinkle, D., Lungarella, M., Pegors, T.: Methods for quantifying the information structure of sensory and motor data. Neuroinformatics 3, 243–262 (2005)
Google Scholar
Di Prodi, P., Porr, B., Wörgötter, F.: Adaptive communication promotes sub-system formation in a multi agent system with limited resources. In: LAB-RS 2008: Proceedings of the 2008 ECSIS Symposium on Learning and Adaptive Behaviors for Robotic Systems, pp. 89–96 (2008)
Google Scholar
Sporns, O., Pfeifer, R., Lungarella, M., Kuniyoshi, Y.: On the information theoretic implications of embodiment - principles and methods. In: Proc. of the 50th Anniversary Summit of Artificial Intelligence, pp. 76–86 (2008)
Google Scholar
Porr, B., Wörgötter, F.: Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only. Neural Computation 18(6), 1380–1412 (2006)
Article MATH Google Scholar
Blahut, R.E.: Computation of channel capacity and rate distorsion functions. IEEE Trans. on Inform. Theory 18(4), 460–473 (1972)
Article MATH MathSciNet Google Scholar
Rouslton, M.S.: Estimating the errors on measured entropy and mutual information. Physica D, 285–294 (1999)
Google Scholar
Arimoto, S.: An algorithm for computing the capacity of arbitrary memoryless channels. IEEE Transactions on Information Theory 18(1), 14–20 (1972)
Article MATH MathSciNet Google Scholar
Stuart, B.: Nicholas minorsky and the automatic steering of ships. IEEE Control Systems Magazine 4(4) (1984)
Google Scholar
Sutton, A.G., Barto, R.S.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998)
Google Scholar
Schreiber, T.: Measuring information transfer. Phys. Rev. Lett. 85, 461–464 (2000)
Article Google Scholar
Touchette, H., Lloyd, S.: Information-theoretic limits of control. Phys. Rev. Lett. 84(6), 1156–1159 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Glasgow,
Paolo Di Prodi & Bernd Porr
BCCN Göttingen, Germany
Florentin Wörgötter

Authors

Paolo Di Prodi
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Porr
View author publications
You can also search for this author in PubMed Google Scholar
Florentin Wörgötter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISIR, Université Pierre et Marie Curie-Paris 6, 4 Place Jussieu, 75252, Paris cedex 05, France
Stéphane Doncieux , Benoît Girard , Agnès Guillot , Jean-Arcady Meyer & Jean-Baptiste Mouret , , , &
The Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Campusvej 55, 5230, Odense M, Denmark
John Hallam

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Di Prodi, P., Porr, B., Wörgötter, F. (2010). A Novel Information Measure for Predictive Learning in a Social System Setting. In: Doncieux, S., Girard, B., Guillot, A., Hallam, J., Meyer, JA., Mouret, JB. (eds) From Animals to Animats 11. SAB 2010. Lecture Notes in Computer Science(), vol 6226. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15193-4_48

Download citation

DOI: https://doi.org/10.1007/978-3-642-15193-4_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15192-7
Online ISBN: 978-3-642-15193-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics