Value iteration in average cost Markov c
✍
Raúl Montes-de-Oca; Onésimo Hernández-Lerma
📂
Article
📅
1996
🏛
Springer Netherlands
🌐
English
⚖ 872 KB
This paper deals with discrete-time Markov control processes with Borel state and control spaces, with possibly unbounded costs and noncompact control constraint sets, and the average cost criterion. Conditions are given for the convergence of the value iteration algorithm to the optimal average cos