TheN-armed bandit with unimodal structure

Herkenrath, U.

doi:10.1007/BF02056924

TheN-armed bandit with unimodal structure

Published: 01 December 1983

Volume 30, pages 195–210, (1983)
Cite this article

Metrika Aims and scope Submit manuscript

U. Herkenrath¹

52 Accesses
4 Citations
Explore all metrics

Abstract

In this paper we study a special class of bandit problems, which are characterized by a unimodal structure of the expected rewards of the arms. In Section 1, the motivation for studying this problem is explained. In the next two sections, two different decision procedures are analyzed, which are based on a stochastic approximation of the best arm of the bandit. Finally, in Section 4, a special procedure is discussed and some numerical data are presented, which were obtained by applying it to a concreteN-armed bandit with unimodal structure.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bather, J.: Randomised allocation of treatments in sequential trials. Adv. Appl. Prob.12, 1980, 174–182.
Article MathSciNet MATH Google Scholar
Fabian, V.: Stochastic approximation of minima with improved asymptotic speed. Ann. Math. Statist.38, 1967, 191–200.
Article MathSciNet MATH Google Scholar
Robbins, H.: Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc.58, 1952, 527–535.
Article MathSciNet MATH Google Scholar
Wasan, M.T.: Stochastic Approximation. Cambridge 1969.
Wilde, D.J., andC.S. Beightler: Foundations of Optimization. Englewood Cliffs, NJ, 1967.
Witten, I.H.: The apparent conflict between estimation and control—A survey of the two-armed bandit problem. J. Franklin Instit.301, 1976, 161–189.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Applied Mathematics, University of Bonn, Wegelerstraße 6, D-5300, Bonn
U. Herkenrath

Authors

U. Herkenrath
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Herrn Professor Dr. Walter Vogel zu seinem 60. Geburtstag am 22. Juni 1983 gewidmet

Research supported by the Deutsche Forschungsgemeinschaft, SFB 72.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Herkenrath, U. TheN-armed bandit with unimodal structure. Metrika 30, 195–210 (1983). https://doi.org/10.1007/BF02056924

Download citation

Received: 19 January 1982
Revised: 02 July 1982
Published: 01 December 1983
Issue Date: December 1983
DOI: https://doi.org/10.1007/BF02056924

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

TheN-armed bandit with unimodal structure

Abstract

Access this article

Similar content being viewed by others

Multi-armed bandits with dependent arms

On Two Continuum Armed Bandit Problems in High Dimensions

The non-stationary stochastic multi-armed bandit problem

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

TheN-armed bandit with unimodal structure

Abstract

Access this article

Similar content being viewed by others

Multi-armed bandits with dependent arms

On Two Continuum Armed Bandit Problems in High Dimensions

The non-stationary stochastic multi-armed bandit problem

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation