✦ LIBER ✦

Value iteration in average cost Markov control processes on Borel spaces

✍ Scribed by Raúl Montes-de-Oca; Onésimo Hernández-Lerma

Publisher: Springer Netherlands
Year: 1996
Tongue: English
Weight: 872 KB
Volume: 42
Category: Article
ISSN: 0167-8019
DOI: 10.1007/bf00047169

No coin nor oath required. For personal study only.

✦ Synopsis

This paper deals with discrete-time Markov control processes with Borel state and control spaces, with possibly unbounded costs and noncompact control constraint sets, and the average cost criterion. Conditions are given for the convergence of the value iteration algorithm to the optimal average cost, and for a sequence of finite-horizon optimal policies to have an accumulation point which is average cost optimal.

📜 SIMILAR VOLUMES

The average cost optimality equation for

The average cost optimality equation for Markov control processes on Borel spaces

✍ Raúl Montes-de-Oca 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 353 KB

Average cost optimal policies for Markov

Average cost optimal policies for Markov control processes with Borel state space and unbounded costs

✍ Onésimo Hernández-Lerma; Jean B. Lasserre 📂 Article 📅 1990 🏛 Elsevier Science 🌐 English ⚖ 533 KB

The convergence of value iteration in av

The convergence of value iteration in average cost Markov decision chains

✍ Linn I. Sennott 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 415 KB

A pause control approach to the value it

A pause control approach to the value iteration scheme in average Markov decision processes

✍ Rolando Cavazos-Cadena 📂 Article 📅 1998 🏛 Elsevier Science 🌐 English ⚖ 124 KB

This work concerns average Markov decision chains with denumerable state space. Assuming that the Lyapunov function condition holds, it is shown that the value iteration scheme yields convergent approximations to the solution of the average cost optimality equation. This result is obtained using a p

Discounted Cost Markov Decision Processe

Discounted Cost Markov Decision Processes on Borel Spaces: The Linear Programming Formulation

✍ O. Hernandezlerma; D. Hernandezhernandez 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 557 KB

This paper is concerned with the linear programming formulation of Markov decision processes (or stochastic dynamic programs) with Borel state and action spaces and the discounted cost criterion. The one-stage cost function may be unbounded. A linear program and its dual are introduced, for which is

Average optimality in dynamic programmin

Average optimality in dynamic programming on Borel spaces — unbounded costs and controls

✍ Onésimo Hernández-Lerma 📂 Article 📅 1991 🏛 Elsevier Science 🌐 English ⚖ 367 KB