ABSTRACT
We analyze a scenario in which software agents implemented as regret-minimizing algorithms engage in a repeated auction on behalf of their users. We study first-price and second-price auctions, as well as their generalized versions (e.g., as those used for ad auctions). Using both theoretical analysis and simulations, we show that, surprisingly, in second-price auctions the players have incentives to misreport their true valuations to their own learning agents, while in first-price auctions it is a dominant strategy for all players to truthfully report their valuations to their agents.
- Saeed Alaei, Ashwinkumar Badanidiyuru, Mohammad Mahdian, and Sadra Yazdanbod. 2019. Response prediction for low-regret agents. In International Conference on Web and Internet Economics. Springer, 31–44.Google ScholarCross Ref
- Sanjeev Arora, Elad Hazan, and Satyen Kale. 2012. The multiplicative weights update method: a meta-algorithm and applications. Theory of Computing 8, 1 (2012), 121–164.Google ScholarCross Ref
- Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. 2002. Finite-time analysis of the multiarmed bandit problem. Machine learning 47, 2 (2002), 235–256.Google ScholarDigital Library
- Peter Auer, Nicolo Cesa-Bianchi, and Claudio Gentile. 2002. Adaptive and self-confident on-line learning algorithms. J. Comput. System Sci. 64, 1 (2002), 48–75.Google ScholarDigital Library
- James P Bailey, Sai Ganesh Nagarajan, and Georgios Piliouras. 2021. Stochastic Multiplicative Weights Updates in Zero-Sum Games. arXiv preprint arXiv:2110.02134(2021).Google Scholar
- James P Bailey and Georgios Piliouras. 2018. Multiplicative weights update in zero-sum games. In Proceedings of the 2018 ACM Conference on Economics and Computation. 321–338.Google ScholarDigital Library
- Santiago R Balseiro and Yonatan Gur. 2019. Learning in repeated auctions with budgets: Regret minimization and equilibrium. Management Science 65, 9 (2019).Google Scholar
- Omer Ben-Porat and Moshe Tennenholtz. 2018. A game-theoretic approach to recommendation systems with strategic content providers. In Proceedings of the 32nd International Conference on Neural Information Processing Systems. 1118–1128.Google Scholar
- Georgios Birmpas, Jiarui Gan, Alexandros Hollender, Francisco J Marmolejo-Cossío, Ninad Rajgopal, and Alexandros A Voudouris. 2020. Optimally Deceiving a Learning Leader in Stackelberg Games. arXiv:2006.06566 (2020).Google Scholar
- Avrim Blum, MohammadTaghi Hajiaghayi, Katrina Ligett, and Aaron Roth. 2008. Regret minimization and the price of total anarchy. In Proceedings of the fortieth annual ACM symposium on Theory of computing. 373–382.Google ScholarDigital Library
- Avrim Blum and Yishay Monsour. 2007. Learning, regret minimization, and equilibria. (2007).Google Scholar
- Christian Borgs, Jennifer Chayes, Nicole Immorlica, Kamal Jain, Omid Etesami, and Mohammad Mahdian. 2007. Dynamics of bid optimization in online advertisement auctions. In Proceedings of the 16th international conference on World Wide Web. 531–540.Google ScholarDigital Library
- Mark Braverman, Jieming Mao, Jon Schneider, and Matt Weinberg. 2018. Selling to a no-regret buyer. In Proceedings of the 2018 ACM Conference on Economics and Computation. 523–538.Google ScholarDigital Library
- George W Brown. 1951. Iterative solution of games by fictitious play. Activity analysis of production and allocation 13, 1(1951), 374–376.Google Scholar
- Yang Cai, Constantinos Daskalakis, and Christos Papadimitriou. 2015. Optimum statistical estimation with strategic data sources. In Conference on Learning Theory. PMLR, 280–296.Google Scholar
- Ioannis Caragiannis, Christos Kaklamanis, Panagiotis Kanellopoulos, Maria Kyropoulou, Brendan Lucier, Renato Paes Leme, and Eva Tardos. 2015. Bounding the inefficiency of outcomes in generalized second price auctions. Journal of Economic Theory 156 (2015), 343–388.Google ScholarCross Ref
- Nicolo Cesa-Bianchi, Claudio Gentile, and Yishay Mansour. 2014. Regret minimization for reserve prices in second-price auctions. IEEE Transactions on Information Theory 61, 1 (2014), 549–564.Google ScholarCross Ref
- Nicolo Cesa-Bianchi and Gábor Lugosi. 2006. Prediction, learning, and games. Cambridge university press.Google Scholar
- Shuchi Chawla and Jason D Hartline. 2013. Auctions with unique equilibria. In Proceedings of the fourteenth ACM conference on Electronic commerce. 181–196.Google ScholarDigital Library
- Yiling Chen, Yang Liu, and Chara Podimata. 2020. Learning Strategy-Aware Linear Classifiers. Advances in Neural Information Processing Systems 33 (2020), 15265–15276.Google Scholar
- Constantinos Daskalakis, Rafael Frongillo, Christos H Papadimitriou, George Pierrakos, and Gregory Valiant. 2010. On learning algorithms for Nash equilibria. In International Symposium on Algorithmic Game Theory. Springer, 114–125.Google ScholarCross Ref
- Constantinos Daskalakis and Vasilis Syrgkanis. 2016. Learning in auctions: Regret is hard, envy is easy. In 2016 ieee 57th annual symposium on foundations of computer science (focs). IEEE, 219–228.Google Scholar
- Xiaotie Deng, Xinyan Hu, Tao Lin, and Weiqiang Zheng. 2021. Nash Convergence of Mean-Based Learning Algorithms in First Price Auctions. arXiv preprint arXiv:2110.03906(2021).Google Scholar
- Yuan Deng, Jieming Mao, Vahab Mirrokni, and Song Zuo. 2021. Towards Efficient Auctions in an Auto-bidding World. In Proceedings of the Web Conference 2021.Google ScholarDigital Library
- Yuan Deng, Jon Schneider, and Balasubramanian Sivan. 2019. Strategizing against No-regret Learners. Advances in Neural Information Processing Systems 32 (2019), 1579–1587.Google Scholar
- Jinshuo Dong, Aaron Roth, Zachary Schutzman, Bo Waggoner, and Zhiwei Steven Wu. 2018. Strategic classification from revealed preferences. In Proceedings of the 2018 ACM Conference on Economics and Computation. 55–70.Google ScholarDigital Library
- Paul Dütting, Felix Fischer, and David C Parkes. 2019. Expressiveness and robustness of first-price position auctions. Mathematics of Operations Research 44, 1 (2019), 196–211.Google ScholarDigital Library
- Benjamin Edelman, Michael Ostrovsky, and Michael Schwarz. 2007. Internet advertising and the generalized second-price auction: Selling billions of dollars worth of keywords. American economic review 97, 1 (2007), 242–259.Google Scholar
- Michal Feldman, Brendan Lucier, and Noam Nisan. 2016. Correlated and coarse equilibria of single-item auctions. In International Conference on Web and Internet Economics. Springer, 131–144.Google ScholarDigital Library
- Zhe Feng, Guru Guruganesh, Christopher Liaw, Aranyak Mehta, and Abhishek Sethi. 2020. Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions. arXiv preprint arXiv:2009.06136(2020).Google Scholar
- Jiarui Gan, Qingyu Guo, Long Tran-Thanh, Bo An, and Michael Wooldridge. 2019. Manipulating a learning defender and ways to counteract. In Proceedings of the 33rd International Conference on Neural Information Processing Systems. 8274–8283.Google Scholar
- Jiarui Gan, Haifeng Xu, Qingyu Guo, Long Tran-Thanh, Zinovi Rabinovich, and Michael Wooldridge. 2019. Imitative Follower Deception in Stackelberg Games. In Proceedings of the 2019 ACM Conference on Economics and Computation(EC ’19). 639–657.Google ScholarDigital Library
- Matthew L Gentry, Timothy P Hubbard, Denis Nekipelov, Harry J Paarsch, 2018. Structural econometrics of auctions: A review.Google Scholar
- Ganesh Ghalme, Vineet Nair, Itay Eilat, Inbal Talgam-Cohen, and Nir Rosenfeld. 2021. Strategic Classification in the Dark. 139 (2021), 3672–3681. https://proceedings.mlr.press/v139/ghalme21a.htmlGoogle Scholar
- Michael D Grigoriadis and Leonid G Khachiyan. 1995. A sublinear-time randomized approximation algorithm for matrix games. Operations Research Letters 18, 2 (1995), 53–58.Google ScholarDigital Library
- Nika Haghtalab, Nicole Immorlica, Brendan Lucier, and Jack Z. Wang. 2021. Maximizing Welfare with Incentive-Aware Evaluation Mechanisms., Article 23 (2021), 7 pages.Google Scholar
- James Hannan. 1957. Approximation to Bayes Risk in Repeated Play. In Contributions to the Theory of Games (AM-39), Volume III. Princeton University Press, 97–139.Google Scholar
- Moritz Hardt, Nimrod Megiddo, Christos Papadimitriou, and Mary Wootters. 2016. Strategic classification. In Proceedings of the 2016 ACM conference on innovations in theoretical computer science. 111–122.Google ScholarDigital Library
- Sergiu Hart and Andreu Mas-Colell. 2000. A simple adaptive procedure leading to correlated equilibrium. Econometrica 68, 5 (2000), 1127–1150.Google ScholarCross Ref
- Sergiu Hart and Andreu Mas-Colell. 2013. Simple adaptive strategies: from regret-matching to uncoupled dynamics. Vol. 4. World Scientific.Google Scholar
- Jason Hartline, Vasilis Syrgkanis, and Eva Tardos. 2015. No-Regret Learning in Bayesian Games. Advances in Neural Information Processing Systems 28 (2015), 3061–3069.Google Scholar
- Darrell Hoy, Kamal Jain, and Christopher A Wilkens. 2013. A dynamic axiomatic approach to first-price auctions. (2013), 583–584.Google Scholar
- Bernard J Jansen and Tracy Mullen. 2008. Sponsored search: an overview of the concept, history, and technology. International Journal of Electronic Business 6, 2 (2008), 114–131.Google ScholarCross Ref
- Adam Kalai and Santosh Vempala. 2002. Geometric algorithms for online optimization. In Journal of Computer and System Sciences. Citeseer.Google Scholar
- Adam Kalai and Santosh Vempala. 2005. Efficient algorithms for online decision problems. J. Comput. System Sci. 71, 3 (2005), 291–307.Google ScholarDigital Library
- Yoav Kolumbus and Noam Nisan. 2021. Auctions Between Regret-Minimizing Agents. arXiv preprint arXiv:2110.11855(2021). https://arxiv.org/pdf/2110.11855.pdfGoogle Scholar
- Yoav Kolumbus and Noam Nisan. 2021. How and Why to Manipulate Your Own Agent. arXiv preprint arXiv:2112.07640(2021). https://arxiv.org/pdf/2112.07640.pdfGoogle Scholar
- Vijay Krishna. 2009. Auction theory. Academic press.Google Scholar
- John Langford and Tong Zhang. 2007. Epoch-Greedy algorithm for multi-armed bandits with side information. Advances in Neural Information Processing Systems (NIPS 2007) 20 (2007), 1.Google Scholar
- Sagi Levanon and Nir Rosenfeld. 2021. Strategic Classification Made Practical. In Proceedings of the 38th International Conference on Machine Learning, Marina Meila and Tong Zhang (Eds.). Vol. 139. 6243–6253. https://proceedings.mlr.press/v139/levanon21a.htmlGoogle Scholar
- Paul R Milgrom 1987. Auction theory. In Advances in economic theory: Fifth world congress, Vol. 1. Cambridge University Press Cambridge, 32.Google Scholar
- Mehryar Mohri and Andres Munoz. 2014. Optimal regret minimization in posted-price auctions with strategic buyers. In Advances in Neural Information Processing Systems. 1871–1879.Google Scholar
- Barnabé Monnot and Georgios Piliouras. 2017. Limits and limitations of no-regret learning in games. The Knowledge Engineering Review 32 (2017).Google Scholar
- Denis Nekipelov, Vasilis Syrgkanis, and Eva Tardos. 2015. Econometrics for Learning Agents. In Proceedings of the Sixteenth ACM Conference on Economics and Computation (Portland, Oregon, USA) (EC ’15). Association for Computing Machinery, New York, NY, USA, 1–18. https://doi.org/10.1145/2764468.2764522Google ScholarDigital Library
- Thanh Nguyen and Haifeng Xu. 2019. Imitative Attacker Deception in Stackelberg Security Games.. In IJCAI. 528–534.Google Scholar
- Thanh H Nguyen, Yongzhao Wang, Arunesh Sinha, and Michael P Wellman. 2019. Deception in finitely repeated security games. In Proceedings of the AAAI Conference on Artificial Intelligence.Google ScholarDigital Library
- Noam Nisan 2007. Introduction to mechanism design (for computer scientists). Algorithmic game theory 9 (2007), 209–242.Google Scholar
- Noam Nisan and Gali Noti. 2017. An Experimental Evaluation of Regret-Based Econometrics. In Proceedings of the 26th International Conference on World Wide Web (Perth, Australia) (WWW ’17). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 73–81. https://doi.org/10.1145/3038912.3052621Google ScholarDigital Library
- Noam Nisan and Gali Noti. 2017. A ”Quantal Regret” Method for Structural Econometrics in Repeated Games. In Proceedings of the 2017 ACM Conference on Economics and Computation (Cambridge, Massachusetts, USA) (EC ’17). Association for Computing Machinery, New York, NY, USA, 123. https://doi.org/10.1145/3033274.3085111Google ScholarDigital Library
- Gali Noti and Vasilis Syrgkanis. 2021. Bid prediction in repeated auctions with learning. In Proceedings of the Web Conference 2021. 3953–3964. https://doi.org/10.1145/3442381.3449968Google ScholarDigital Library
- Christos Papadimitriou and Georgios Piliouras. 2018. From nash equilibria to chain recurrent sets: An algorithmic solution concept for game theory. Entropy 20, 10 (2018), 782.Google ScholarCross Ref
- Julia Robinson. 1951. An iterative method of solving a game. Annals of mathematics(1951), 296–301.Google Scholar
- Tim Roughgarden. 2012. The price of anarchy in games of incomplete information. In Proceedings of the 13th ACM Conference on Electronic Commerce. 862–879.Google ScholarDigital Library
- Tim Roughgarden, Vasilis Syrgkanis, and Eva Tardos. 2017. The price of anarchy in auctions. Journal of Artificial Intelligence Research 59 (2017), 59–101.Google ScholarDigital Library
- Tim Roughgarden and Joshua R Wang. 2019. Minimizing regret with multiple reserves. ACM Transactions on Economics and Computation (TEAC) 7, 3 (2019).Google Scholar
- Shai Shalev-Shwartz 2011. Online learning and online convex optimization. Foundations and trends in Machine Learning 4 (2011), 107–194.Google Scholar
- Vasilis Syrgkanis and Eva Tardos. 2013. Composable and Efficient Mechanisms. In Proceedings of the Forty-Fifth Annual ACM Symposium on Theory of Computing (Palo Alto, California, USA) (STOC ’13). Association for Computing Machinery, New York, NY, USA, 211–220. https://doi.org/10.1145/2488608.2488635Google ScholarDigital Library
- Moshe Tennenholtz and Oren Kurland. 2019. Rethinking search engines and recommendation systems: a game theoretic perspective. Commun. ACM 62, 12 (2019), 66–75.Google ScholarDigital Library
- Hal R Varian. 2007. Position auctions. international Journal of industrial Organization 25, 6(2007), 1163–1178.Google Scholar
- William Vickrey. 1961. Counterspeculation, auctions, and competitive sealed tenders. The Journal of finance 16, 1 (1961), 8–37.Google ScholarCross Ref
- H Peyton Young. 2004. Strategic learning and its limits. OUP Oxford.Google Scholar
- Martin Zinkevich. 2003. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th international conference on machine learning. 928–936.Google Scholar
Index Terms
- Auctions between Regret-Minimizing Agents
Recommendations
The Effect of Regret on Optimal Bidding in Auctions
<P>While most models of auctions and competitive bidding assume that each bidder's utility for an outcome depends only on his own profit, we allow the utility to also depend on any regret that a bidder suffers after the fact and characterize when and ...
Envy-free auctions for digital goods
EC '03: Proceedings of the 4th ACM conference on Electronic commerceWe study auctions for a commodity in unlimited supply, e.g., a digital good. In particular we consider three desirable properties for auctions: item Competitive: the auction achieves a constant fraction of the optimal revenue even on worst case inputs. ...
Auctions and differential pricing: optimal seller and bidder strategies in second-chance offers
The second chance offer is a common seller practice on eBay. It consists of price discrimination against the losing bidder, who is offered an identical item at the value of his or her highest bid. Prior work has shown that, if the goods are private-...
Comments