Learning and Tacit Collusion by Artificial Agents in Cournot Duopoly Games

Kimbrough, Steven O.; Lu, Ming; Murphy, Frederic

doi:10.1007/3-540-26989-4_19

Learning and Tacit Collusion by Artificial Agents in Cournot Duopoly Games

Steven O. Kimbrough³,
Ming Lu⁴ &
Frederic Murphy⁵

Chapter

553 Accesses
5 Citations

Part of the book series: International Handbooks on Information Systems ((INFOSYS))

Abstract

We examine learning by artificial agents in repeated play of Cournot duopoly games. Our learning model is simple and cognitively realistic. The model departs from standard reinforcement learning models, as applied to agents in games, in that it credits the agent with a form of conceptual ascent, whereby the agent is able to learn from a consideration set of strategies spanning more than one period of play. The resulting behavior is markedly different from behavior predicted by classical economics for the single-shot (unrepeated) Cournot duopoly game. In repeated play under our learning regime, agents are able to arrive at a tacit form of collusion and set production levels near to those for a monopolist. We note that Cournot duopoly games are reasonable approximations for many real-world arrangements, including hourly spot markets for electricity.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

B. Allaz and J.-L Vila, Cournot competition, forward markets and efficiency, Journal of Economic Theory 59 (1993), 1–16.
Article ISI Google Scholar
Robert Axelrod, The evolution of cooperation, Basic Books, Inc., New York, NY, 1984.
Google Scholar
R. R. Bush and F. Mosteller, Stochastic models for learning, Wiley, New York, NY, 1955.
Google Scholar
B. Banerjee, R. Mukherjee, and S. Sen, Learning mutual trust, Working Notes of AGENTS-00 Workshop on Deception, Fraud and Trust in Agent Societies, 2000, citeseer.nj.nec.com/banerjee00learning.html, pp. 9–14.
Google Scholar
D. W. Bunn and F. Oliveira, Evaluating individual market power in electricity markets via agent-based simulation, Annals of Operations Research 121 (2003), 57–78.
Article ISI MathSciNet Google Scholar
Colin F. Camerer, Behavioral game theory: Experiments in strategic interaction, Russell Sage Foundation and Princeton University Press, New York, NY and Princeton, NJ, 2003.
Google Scholar
Caroline Claus and Craig Boutilier, The dynamics of reinforcement learning in cooperative multiagent systems, Proceedings of the Fifteenth National Conference on Artificial Intelligence (Menlo Park, CA), AAAI Press/MIT Press, 1998, pp. 746–752.
Google Scholar
Andrew M. Colman, Game theory and its applications in the social and biological sciences, second ed., Routledge, London, UK, 1995.
Google Scholar
A. Cournot, Researches into the mathematical principles of the theory of wealth, Macmillan, New York, NY, 1897, English edition edited by N. Bacon. Originally published in French as Recherches sur Principes Mathématiques de la Théorie des Richesses in 1838.
Google Scholar
Robyn M. Dawes, Social dilemmas, Annual Review of Psychology 31 (1980), 169–193.
Article ISI Google Scholar
Garett O. Dworman, Steven O. Kimbrough, and James D. Laing, Bargaining by artificial agents in two coalition games: A study in genetic programming for electronic commerce, Genetic Programming 1996: Proceedings of the First Annual Genetic Programming Conference, July 28–31, 1996, Stanford University (John R. Koza, David E. Goldberg, David B. Fogel, and Rick L. Riolo, eds.), The MIT Press, 1996, pp. 54–62.
Google Scholar
Ido Erev and Alvin E. Roth, Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria, The American Economic Review 88 (1998), no. 4, 848–881.
Google Scholar
[FKP⁺02]_Christina Fang, Steven O. Kimbrough, Stefano Pace, Annapurna Valluri, and Zhiqiang Zheng, On adaptive emergence of trust behavior in the game of stag hunt, Group Decision and Negotiation 11 (November 2002), no. 6, 449–467.
Article ISI Google Scholar
J. W. Friedman, Oligopoly and the theory of games, North Holland (now Elsevier), 1977.
Google Scholar
J. S. Gans, D. Price, and K. Woods, Contracts and electricity pool prices, Australian Journal of Management 23 (1998), no. 1, 83–96.
Article Google Scholar
S. M. Harvey and W. W. Hogan, California electricity prices and forward market hedging, Technical report: working paper series, Center for Business and Government, John F. Kennedy School of Government, Harvard University, Cambridge, Massachusetts 02138, October 2000.
Google Scholar
Charles A. Holt, An experimental test of the consistent-conjectures hypothesis, The American Economic Review 75 (1985), no. 3, 314–325.
Google Scholar
J. Hu and M. P. Wellman, Multiagent reinforcement learning: Theoretical framework and an algorithm, Fifteenth International Conference on Machine Learning, July 1998, pp. 242–250.
Google Scholar
Steven O. Kimbrough and Ming Lu, A note on Q-learning in the Cournot game, WeB 2003: Proceedings of the Second Workshop in e-Business (Seattle, WA), December 13–14, 2003, Available at http://opimsun.wharton.upenn.edu/~sok/sokpapers/2004/cournot-rl-note-final.doc.
Google Scholar
—, Simple reinforcement learning agents: Pareto beats Nash in an algorithmic game theory study, Information Systems and e-Business (forthcoming 2004).
Google Scholar
Steven O. Kimbrough, Ming Lu, and Ann Kuo, A note on strategic learning in policy space, Formal Modelling in Electronic Commerce: Representation, Inference, and Strategic Interaction (Steven O. Kimbrough and D. J. Wu, eds.), Springer, 2004.
Google Scholar
Leslie Pack Kaelbling, Michael L. Littman, and Andrew W. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research 4 (1996), 237–285.
ISI Google Scholar
John H. Kagel and Alvin E. Roth (eds.), The handbook of experimental economics, Princeton University Press, Princeton, NJ, 1995.
Google Scholar
David M. Kreps, Game theory and economic modeling, Clarendon Press, Oxford, England, 1990.
Google Scholar
C. Le Coq and Henrik Orzen, Do forward markets enhance competition? experimental evidence, Technical report: working paper series, The Economic Research Institute, Stockholm School of Economics, SSE/EFI Working Paper, Department of Economics, Sveavagen, P.O. Box 6501, 113 83 Stockholm, Sweden, August 2002.
Google Scholar
Michael W. Macy and Andreas Flache, Learning dynamics in social dilemmas, Proceedings of the National Academy of Science (PNAS) 99 (2002), no. suppl. 3, 7229–7236.
CAS Google Scholar
Rajatish Mukherjee and Sandip Sen, Towards a pareto-optimal solution in general-sum games, 2004, citeseer.nj.nec.com/591017.html.
Google Scholar
Anatol Rapoport and Albert M. Chammah, Prisoner’s dilemma: A study in conflict and cooperation, The University of Michigan Press, Ann Arbor, MI, 1965.
Google Scholar
Alvin E. Roth and Ido Erev, Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term, Games and Economic Behavior 8 (1995), 164–212.
Article ISI MathSciNet Google Scholar
Amnon Rapoport, William E. Stein, and Graham J. Burkheimer, Response models for detection of change, D. Reidel Publishing Company, Dordrecht, Holland, 1979.
Google Scholar
Richar S. Sutton and Andrew G. Barto, Reinforcement learning: An introduction, The MIT Press, Cambridge, MA, 1998.
Google Scholar
T. Sandholm and R. Crites, Multiagent reinforcement learning in iterated prisoner’s dilemma, Biosystems 37 (1995), 147–166, Special Issue on the Prisoner’s Dilemma.
ISI Google Scholar
Hal R. Varian, Intermediate microeconomics: A modern approach, W. W. Norton & Company, New York, NY, 2003.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Pennsylvania, Philadelphia, PA, USA
Steven O. Kimbrough
University of Pennsylvania, Philadelphia, PA, USA
Ming Lu
Temple University, Philadelphia, PA, USA
Frederic Murphy

Authors

Steven O. Kimbrough
View author publications
You can also search for this author in PubMed Google Scholar
Ming Lu
View author publications
You can also search for this author in PubMed Google Scholar
Frederic Murphy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Operations & Information Management, University of Pennsylvania, 565 Jon M. Huntsman Hall 3730 Walnut Street, Philadelphia, PA, 19104-6340, USA
Steven O. Kimbrough
College of Management, Georgia Institute of Technology, 800 West Peachtree Street, NW, Atlanta, GA, 30332-0520, USA
D.J. Wu

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kimbrough, S.O., Lu, M., Murphy, F. (2005). Learning and Tacit Collusion by Artificial Agents in Cournot Duopoly Games. In: Kimbrough, S.O., Wu, D. (eds) Formal Modelling in Electronic Commerce. International Handbooks on Information Systems. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26989-4_19

Download citation

DOI: https://doi.org/10.1007/3-540-26989-4_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21431-1
Online ISBN: 978-3-540-26989-2
eBook Packages: Business and EconomicsBusiness and Management (R0)

Publish with us

Policies and ethics