Skip to main content

Learning and Tacit Collusion by Artificial Agents in Cournot Duopoly Games

  • Chapter

Part of the book series: International Handbooks on Information Systems ((INFOSYS))

Abstract

We examine learning by artificial agents in repeated play of Cournot duopoly games. Our learning model is simple and cognitively realistic. The model departs from standard reinforcement learning models, as applied to agents in games, in that it credits the agent with a form of conceptual ascent, whereby the agent is able to learn from a consideration set of strategies spanning more than one period of play. The resulting behavior is markedly different from behavior predicted by classical economics for the single-shot (unrepeated) Cournot duopoly game. In repeated play under our learning regime, agents are able to arrive at a tacit form of collusion and set production levels near to those for a monopolist. We note that Cournot duopoly games are reasonable approximations for many real-world arrangements, including hourly spot markets for electricity.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. B. Allaz and J.-L Vila, Cournot competition, forward markets and efficiency, Journal of Economic Theory 59 (1993), 1–16.

    Article  ISI  Google Scholar 

  2. Robert Axelrod, The evolution of cooperation, Basic Books, Inc., New York, NY, 1984.

    Google Scholar 

  3. R. R. Bush and F. Mosteller, Stochastic models for learning, Wiley, New York, NY, 1955.

    Google Scholar 

  4. B. Banerjee, R. Mukherjee, and S. Sen, Learning mutual trust, Working Notes of AGENTS-00 Workshop on Deception, Fraud and Trust in Agent Societies, 2000, citeseer.nj.nec.com/banerjee00learning.html, pp. 9–14.

    Google Scholar 

  5. D. W. Bunn and F. Oliveira, Evaluating individual market power in electricity markets via agent-based simulation, Annals of Operations Research 121 (2003), 57–78.

    Article  ISI  MathSciNet  Google Scholar 

  6. Colin F. Camerer, Behavioral game theory: Experiments in strategic interaction, Russell Sage Foundation and Princeton University Press, New York, NY and Princeton, NJ, 2003.

    Google Scholar 

  7. Caroline Claus and Craig Boutilier, The dynamics of reinforcement learning in cooperative multiagent systems, Proceedings of the Fifteenth National Conference on Artificial Intelligence (Menlo Park, CA), AAAI Press/MIT Press, 1998, pp. 746–752.

    Google Scholar 

  8. Andrew M. Colman, Game theory and its applications in the social and biological sciences, second ed., Routledge, London, UK, 1995.

    Google Scholar 

  9. A. Cournot, Researches into the mathematical principles of the theory of wealth, Macmillan, New York, NY, 1897, English edition edited by N. Bacon. Originally published in French as Recherches sur Principes Mathématiques de la Théorie des Richesses in 1838.

    Google Scholar 

  10. Robyn M. Dawes, Social dilemmas, Annual Review of Psychology 31 (1980), 169–193.

    Article  ISI  Google Scholar 

  11. Garett O. Dworman, Steven O. Kimbrough, and James D. Laing, Bargaining by artificial agents in two coalition games: A study in genetic programming for electronic commerce, Genetic Programming 1996: Proceedings of the First Annual Genetic Programming Conference, July 28–31, 1996, Stanford University (John R. Koza, David E. Goldberg, David B. Fogel, and Rick L. Riolo, eds.), The MIT Press, 1996, pp. 54–62.

    Google Scholar 

  12. Ido Erev and Alvin E. Roth, Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria, The American Economic Review 88 (1998), no. 4, 848–881.

    Google Scholar 

  13. [FKP+02]_Christina Fang, Steven O. Kimbrough, Stefano Pace, Annapurna Valluri, and Zhiqiang Zheng, On adaptive emergence of trust behavior in the game of stag hunt, Group Decision and Negotiation 11 (November 2002), no. 6, 449–467.

    Article  ISI  Google Scholar 

  14. J. W. Friedman, Oligopoly and the theory of games, North Holland (now Elsevier), 1977.

    Google Scholar 

  15. J. S. Gans, D. Price, and K. Woods, Contracts and electricity pool prices, Australian Journal of Management 23 (1998), no. 1, 83–96.

    Article  Google Scholar 

  16. S. M. Harvey and W. W. Hogan, California electricity prices and forward market hedging, Technical report: working paper series, Center for Business and Government, John F. Kennedy School of Government, Harvard University, Cambridge, Massachusetts 02138, October 2000.

    Google Scholar 

  17. Charles A. Holt, An experimental test of the consistent-conjectures hypothesis, The American Economic Review 75 (1985), no. 3, 314–325.

    Google Scholar 

  18. J. Hu and M. P. Wellman, Multiagent reinforcement learning: Theoretical framework and an algorithm, Fifteenth International Conference on Machine Learning, July 1998, pp. 242–250.

    Google Scholar 

  19. Steven O. Kimbrough and Ming Lu, A note on Q-learning in the Cournot game, WeB 2003: Proceedings of the Second Workshop in e-Business (Seattle, WA), December 13–14, 2003, Available at http://opimsun.wharton.upenn.edu/~sok/sokpapers/2004/cournot-rl-note-final.doc.

    Google Scholar 

  20. —, Simple reinforcement learning agents: Pareto beats Nash in an algorithmic game theory study, Information Systems and e-Business (forthcoming 2004).

    Google Scholar 

  21. Steven O. Kimbrough, Ming Lu, and Ann Kuo, A note on strategic learning in policy space, Formal Modelling in Electronic Commerce: Representation, Inference, and Strategic Interaction (Steven O. Kimbrough and D. J. Wu, eds.), Springer, 2004.

    Google Scholar 

  22. Leslie Pack Kaelbling, Michael L. Littman, and Andrew W. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research 4 (1996), 237–285.

    ISI  Google Scholar 

  23. John H. Kagel and Alvin E. Roth (eds.), The handbook of experimental economics, Princeton University Press, Princeton, NJ, 1995.

    Google Scholar 

  24. David M. Kreps, Game theory and economic modeling, Clarendon Press, Oxford, England, 1990.

    Google Scholar 

  25. C. Le Coq and Henrik Orzen, Do forward markets enhance competition? experimental evidence, Technical report: working paper series, The Economic Research Institute, Stockholm School of Economics, SSE/EFI Working Paper, Department of Economics, Sveavagen, P.O. Box 6501, 113 83 Stockholm, Sweden, August 2002.

    Google Scholar 

  26. Michael W. Macy and Andreas Flache, Learning dynamics in social dilemmas, Proceedings of the National Academy of Science (PNAS) 99 (2002), no. suppl. 3, 7229–7236.

    CAS  Google Scholar 

  27. Rajatish Mukherjee and Sandip Sen, Towards a pareto-optimal solution in general-sum games, 2004, citeseer.nj.nec.com/591017.html.

    Google Scholar 

  28. Anatol Rapoport and Albert M. Chammah, Prisoner’s dilemma: A study in conflict and cooperation, The University of Michigan Press, Ann Arbor, MI, 1965.

    Google Scholar 

  29. Alvin E. Roth and Ido Erev, Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term, Games and Economic Behavior 8 (1995), 164–212.

    Article  ISI  MathSciNet  Google Scholar 

  30. Amnon Rapoport, William E. Stein, and Graham J. Burkheimer, Response models for detection of change, D. Reidel Publishing Company, Dordrecht, Holland, 1979.

    Google Scholar 

  31. Richar S. Sutton and Andrew G. Barto, Reinforcement learning: An introduction, The MIT Press, Cambridge, MA, 1998.

    Google Scholar 

  32. T. Sandholm and R. Crites, Multiagent reinforcement learning in iterated prisoner’s dilemma, Biosystems 37 (1995), 147–166, Special Issue on the Prisoner’s Dilemma.

    ISI  Google Scholar 

  33. Hal R. Varian, Intermediate microeconomics: A modern approach, W. W. Norton & Company, New York, NY, 2003.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kimbrough, S.O., Lu, M., Murphy, F. (2005). Learning and Tacit Collusion by Artificial Agents in Cournot Duopoly Games. In: Kimbrough, S.O., Wu, D. (eds) Formal Modelling in Electronic Commerce. International Handbooks on Information Systems. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-26989-4_19

Download citation

Publish with us

Policies and ethics