✦ LIBER ✦

Bounding reward measures of Markov models using the Markov decision processes

✍ Scribed by Peter Buchholz

Publisher: John Wiley and Sons
Year: 2011
Tongue: English
Weight: 388 KB
Volume: 18
Category: Article
ISSN: 1070-5325
DOI: 10.1002/nla.792

No coin nor oath required. For personal study only.

✦ Synopsis

SUMMARY

For a Markov reward process, where upper and lower bounds for the transition rates and rewards are known, a new approach to bound the expected reward is presented. Based on a previous paper where sharp bounds have been defined for the problem, but only an inefficient and unstable algorithm is proposed, this paper presents algorithms to compute the bounds by interpreting the problem as a Markov Decision Process. In this way, the well known value and policy iteration algorithms can be adopted to compute reward bounds in a stable and fairly efficient way. Different numerical techniques are presented for computing the reward bounds. Copyright © 2011 John Wiley & Sons, Ltd.

📜 SIMILAR VOLUMES

Bounding the equilibrium distribution of

Bounding the equilibrium distribution of Markov population models

✍ Tuǧrul Dayar; Holger Hermanns; David Spieler; Verena Wolf 📂 Article 📅 2011 🏛 John Wiley and Sons 🌐 English ⚖ 705 KB

Measuring the quality of life through Ma

Measuring the quality of life through Markov reward processes: Analysis and inference

✍ Guglielmo D'Amico 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 140 KB

The Convergence of Value Iteration in Di

The Convergence of Value Iteration in Discounted Markov Decision Processes

✍ D.J. White; W.T. Scherer 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 435 KB

Experimental optimization of a real time

Experimental optimization of a real time fed-batch fermentation process using Markov decision process

✍ Victor M. Saucedo; M. Nazmul Karim 📂 Article 📅 1997 🏛 John Wiley and Sons 🌐 English ⚖ 304 KB 👁 2 views

This article describes a methodology that implements a Markov decision process (MDP) optimization technique in a real time fed-batch experiment. Biological systems can be better modeled under the stochastic framework and MDP is shown to be a suitable technique for their optimization. A nonlinear inp

On-line monitoring of pharmaceutical pro

On-line monitoring of pharmaceutical production processes using Hidden Markov Model

✍ Hui Zhang; Zhuangde Jiang; J.Y. Pi; H.K. Xu; R. Du 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 505 KB

This article presents a new method for on-line monitoring of pharmaceutical production process, especially the powder blending process. The new method consists of two parts: extracting features from the Near Infrared (NIR) spectroscopy signals and recognizing patterns from the features. Features are

Measure of the multiple self-intersectio

Measure of the multiple self-intersection set of a markov process

✍ Simeon M. Berman 📂 Article 📅 1990 🏛 John Wiley and Sons 🌐 English ⚖ 908 KB