Regret Matching with Finite Memory

Saran, Rene; Serrano, Roberto

doi:10.1007/s13235-011-0021-8

Regret Matching with Finite Memory

Open access
Published: 09 June 2011

Volume 2, pages 160–175, (2012)
Cite this article

Download PDF

You have full access to this open access article

Dynamic Games and Applications Aims and scope Submit manuscript

Regret Matching with Finite Memory

Download PDF

Rene Saran¹ &
Roberto Serrano^2,3

942 Accesses
2 Citations
Explore all metrics

Abstract

We consider the regret matching process with finite memory. For general games in normal form, it is shown that any recurrent class of the dynamics must be such that the action profiles that appear in it constitute a closed set under the “same or better reply” correspondence (CUSOBR set) that does not contain a smaller product set that is closed under “same or better replies,” i.e., a smaller PCUSOBR set. Two characterizations of the recurrent classes are offered. First, for the class of weakly acyclic games under better replies, each recurrent class is monomorphic and corresponds to each pure Nash equilibrium. Second, for a modified process with random sampling, if the sample size is sufficiently small with respect to the memory bound, the recurrent classes consist of action profiles that are minimal PCUSOBR sets. Our results are used in a robust example that shows that the limiting empirical distribution of play can be arbitrarily far from correlated equilibria for any large but finite choice of the memory bound.

Article PDF

On Repeated Zero-Sum Games with Incomplete Information and Asymptotically Bounded Values

Article 03 March 2017

Subgame-perfection in recursive perfect information games, where each player controls one state

Article Open access 26 October 2015

The worst-case payoff in games with stochastic revision opportunities

Article 19 November 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Basu K, Weibull JW (1991) Strategy subsets closed under rational behavior. Econ Lett 36:141–146
Article MathSciNet MATH Google Scholar
Bernheim BD (1984) Rationalizable strategic behavior. Econometrica 52:1007–1028
Article MathSciNet MATH Google Scholar
Foster DP, Vohra RV (1997) Calibrated learning and correlated equilibrium. Games Econ Behav 21:40–55
Article MathSciNet MATH Google Scholar
Foster DP, Vohra RV (1998) Asymptotic calibration. Biometrika 85:379–390
Article MathSciNet MATH Google Scholar
Friedman JW, Mezzetti C (2001) Learning in games by random sampling. J Econ Theory 98:55–84
Article MathSciNet MATH Google Scholar
Fudenberg D, Levine DK (1995) Universal consistency and cautious fictitious play. J Econ Dyn Control 19:1065–1089
Article MathSciNet MATH Google Scholar
Fudenberg D, Levine DK (1998) The theory of learning in games. MIT Press, Cambridge
MATH Google Scholar
Fudenberg D, Levine DK (1999) Conditional universal consistency. Games Econ Behav 29:104–130
Article MathSciNet MATH Google Scholar
Hannan J (1957) Approximation to Bayes risk in repeated play. In: Dresher M et al. (eds) Contributions to the theory of games III. Princeton University Press, Princeton, pp 97–139
Google Scholar
Hart S (2005) Adaptive heuristics. Econometrica 73:1401–1430
Article MathSciNet MATH Google Scholar
Hart S, Mas-Colell A (2000) A simple adaptive procedure leading to correlated equilibrium. Econometrica 68:1127–1150
Article MathSciNet MATH Google Scholar
Hurkens S (1995) Learning by forgetful players. Games Econ Behav 11:304–329
Article MathSciNet MATH Google Scholar
Josephson J, Matros A (2004) Stochastic imitation in finite games. Games Econ Behav 49:244–259
Article MathSciNet MATH Google Scholar
Lehrer E, Solan E (2009) Approachability with bounded memory. Games Econ Behav 66:995–1004
Article MathSciNet MATH Google Scholar
Marden JR, Arslan G, Shamma JS (2007) Regret based dynamics: convergence in weakly acyclic games. In: AAMAS ’07: proceedings of the 6th international joint conference on autonomous agents and multiagent systems. ACM, New York, pp 194–201
Google Scholar
Monderer D, Shapley LS (1996) Potential games. Games Econ Behav 14:124–143
Article MathSciNet MATH Google Scholar
Pearce DG (1984) Rationalizable strategic behavior and the problem of perfection. Econometrica 52:1029–1050
Article MathSciNet MATH Google Scholar
Ritzberger K, Weibull JW (1995) Evolutionary selection in normal-form games. Econometrica 63:1371–1399
Article MathSciNet MATH Google Scholar
Sandholm WH (2009) Evolutionary game theory. In: Meyers R (ed) Encyclopedia of complexity and systems science. Springer, New York, pp 3176–3205
Google Scholar
Saran R, Serrano R (2010) Ex-post regret learning in games with fixed and random matching: the case of private values. Working Paper, Brown University. URL: http://www.econ.brown.edu/faculty/serrano/pdfs/wp2010-11.pdf
Shapley LS (1964) Some topics in two-person games. In: Dresher M et al. (eds) Advances in game theory. Annals of mathematical studies, vol 52. Princeton University Press, Princeton, pp 1–28
Google Scholar
Viossat Y (2007) The replicator dynamics does not lead to correlated equilibria. Games Econ Behav 59:397–407
Article MathSciNet MATH Google Scholar
Viossat Y (2008) Evolutionary dynamics may eliminate all strategies used in correlated equilibrium. Math Soc Sci 56:27–43
Article MathSciNet MATH Google Scholar
Young HP (1993) The evolution of conventions. Econometrica 61:57–84
Article MathSciNet MATH Google Scholar
Young HP (1998) Individual strategy and social structure. Princeton University Press, Princeton
Google Scholar
Young HP (2004) Strategic learning and its limits. Oxford University Press, Oxford
Book Google Scholar
Zapechelnyuk A (2008) Better-reply dynamics with bounded recall. Math Oper Res 33:869–879
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Maastricht University, Maastricht, The Netherlands
Rene Saran
Brown University, Providence, USA
Roberto Serrano
IMDEA Social Sciences Institute, Madrid, Spain
Roberto Serrano

Authors

Rene Saran
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Serrano
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rene Saran.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Saran, R., Serrano, R. Regret Matching with Finite Memory. Dyn Games Appl 2, 160–175 (2012). https://doi.org/10.1007/s13235-011-0021-8

Download citation

Published: 09 June 2011
Issue Date: March 2012
DOI: https://doi.org/10.1007/s13235-011-0021-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Regret Matching with Finite Memory

Abstract

Article PDF

Similar content being viewed by others

On Repeated Zero-Sum Games with Incomplete Information and Asymptotically Bounded Values

Subgame-perfection in recursive perfect information games, where each player controls one state

The worst-case payoff in games with stochastic revision opportunities

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Regret Matching with Finite Memory

Abstract

Article PDF

Similar content being viewed by others

On Repeated Zero-Sum Games with Incomplete Information and Asymptotically Bounded Values

Subgame-perfection in recursive perfect information games, where each player controls one state

The worst-case payoff in games with stochastic revision opportunities

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation