The Budgeted Multi-armed Bandit Problem

Madani, Omid; Lizotte, Daniel J.; Greiner, Russell

doi:10.1007/978-3-540-27819-1_46

Omid Madani²⁰,
Daniel J. Lizotte²¹ &
Russell Greiner²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3120))

Included in the following conference series:

International Conference on Computational Learning Theory

2252 Accesses
12 Citations
3 Altmetric

Abstract

The following coins problem is a version of a multi-armed bandit problem where one has to select from among a set of objects, say classifiers, after an experimentation phase that is constrained by a time or cost budget. The question is how to spend the budget. The problem involves pure exploration only, differentiating it from typical multi-armed bandit problems involving an exploration/exploitation tradeoff [BF85]. It is an abstraction of the following scenarios: choosing from among a set of alternative treatments after a fixed number of clinical trials, determining the best parameter settings for a program given a deadline that only allows a fixed number of runs; or choosing a life partner in the bachelor/bachelorette TV show where time is limited. We are interested in the computational complexity of the coins problem and/or efficient algorithms with approximation guarantees.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Berry, D., Fristedt, B.: Bandit Problems: Sequential Allocation of Experiments. Chapman and Hall, NewYork (1985)
MATH Google Scholar
Lizotte, D., Madani, O., Greiner, R.: Budgeted learning of Naive Bayes classifiers. In: UAI 2003 (2003)
Google Scholar
Madani, O., Lizotte, D., Greiner, R.: Active model selection (submitted). Technical report, University of Alberta and AICML (2004), http://www.cs.ualberta.ca/~madani/budget.html

Download references

Author information

Authors and Affiliations

Yahoo! Research Labs, 74 N. Pasadena Ave, Pasadena, CA, 91101, USA
Omid Madani
Dept. of Computing Science, University of Alberta, Edmonton, T6J 2E8
Daniel J. Lizotte & Russell Greiner

Authors

Omid Madani
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Lizotte
View author publications
You can also search for this author in PubMed Google Scholar
Russell Greiner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Centre for Computational Statistics and Machine Learning Department of Computer Science, University College London, Gower St., WC1E 6BT, London
John Shawe-Taylor
Google, 1600 Amphitheater Parkway, CA 94043, Mountain View, USA
Yoram Singer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Madani, O., Lizotte, D.J., Greiner, R. (2004). The Budgeted Multi-armed Bandit Problem. In: Shawe-Taylor, J., Singer, Y. (eds) Learning Theory. COLT 2004. Lecture Notes in Computer Science(), vol 3120. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27819-1_46

Download citation

DOI: https://doi.org/10.1007/978-3-540-27819-1_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22282-8
Online ISBN: 978-3-540-27819-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics