✦ LIBER ✦

A dynamic programming strategy to balance exploration and exploitation in the bandit problem

✍ Scribed by Olivier Caelen; Gianluca Bontempi

Book ID: 106343155
Publisher: Springer Netherlands
Year: 2010
Tongue: English
Weight: 542 KB
Volume: 60
Category: Article
ISSN: 1012-2443
DOI: 10.1007/s10472-010-9190-1

No coin nor oath required. For personal study only.

📜 SIMILAR VOLUMES

Finding minimax strategy and minimax ris

Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)

✍ A. V. Kolnogorov 📂 Article 📅 2011 🏛 SP MAIK Nauka/Interperiodica 🌐 English ⚖ 220 KB

Problems and opportunities in achieving

Problems and opportunities in achieving a common international strategy to explore the Moon and Mars

✍ Richard J.H. Barnes 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 707 KB

Extensions of the dynamic programming me

Extensions of the dynamic programming method in the deterministic and stochastic assembly-line balancing problems

✍ Mordechai I. Henig 📂 Article 📅 1986 🏛 Elsevier Science 🌐 English ⚖ 627 KB

Dynamic programming and a max-min proble

Dynamic programming and a max-min problem in the theory of structures

✍ Nestor Distefano 📂 Article 📅 1972 🏛 Elsevier Science 🌐 English ⚖ 674 KB

A max-min problem in the realm of optimum beam design is formulated and thoroughly investigated from a dynamic programming point of view. It is shown that the conditions of optimality can be directly derived from the Hamilton-Jacobi-Bellman equation of the process. The classical Euler-Lagrange equat

Balancing exploration and exploitation i

Balancing exploration and exploitation in alternative work arrangements: a multiple case study in the professional and management services industry

✍ Jean-Baptiste Litrico; Mary Dean Lee 📂 Article 📅 2008 🏛 John Wiley and Sons 🌐 English ⚖ 216 KB

## Abstract In this inductive study we investigate the local context surrounding professionals choosing to work on a reduced‐load basis. We analyze qualitative data collected from key individuals (spouse, boss, co‐worker, and HR manager) composing a network around several professionals working redu

A dynamic programming methodology in ver

A dynamic programming methodology in very large scale neighborhood search applied to the traveling salesman problem

✍ Özlem Ergun; James B. Orlin 📂 Article 📅 2006 🏛 Elsevier Science 🌐 English ⚖ 215 KB