✦ LIBER ✦

Equivalence notions and model minimization in Markov decision processes

✍ Scribed by Robert Givan; Thomas Dean; Matthew Greig

Publisher: Elsevier Science
Year: 2003
Tongue: English
Weight: 566 KB
Volume: 147
Category: Article
ISSN: 0004-3702
DOI: 10.1016/s0004-3702(02)00376-4

No coin nor oath required. For personal study only.

✦ Synopsis

Many stochastic planning problems can be represented using Markov Decision Processes (MDPs). A difficulty with using these MDP representations is that the common algorithms for solving them run in time polynomial in the size of the state space, where this size is extremely large for most real-world planning problems of interest. Recent AI research has addressed this problem by representing the MDP in a factored form. Factored MDPs, however, are not amenable to traditional solution methods that call for an explicit enumeration of the state space. One familiar way to solve MDP problems with very large state spaces is to form a reduced (or aggregated) MDP with the same properties as the original MDP by combining "equivalent" states. In this paper, we discuss applying this approach to solving factored MDP problems-we avoid enumerating the state space by describing large blocks of "equivalent" states in factored form, with the block descriptions being inferred directly from the original factored representation. The resulting reduced MDP may have exponentially fewer states than the original factored MDP, and can then be solved using traditional methods. The reduced MDP found depends on the notion of equivalence between states used in the aggregation. The notion of equivalence chosen will be fundamental in designing and analyzing algorithms for reducing MDPs. Optimally, these algorithms will be able to find the smallest possible reduced MDP for any given input MDP and notion of equivalence (i.e., find the "minimal model" for the input MDP). Unfortunately, the classic notion of state equivalence from non-deterministic finite state machines generalized to MDPs does not prove useful. We present here a notion of equivalence that is based upon the notion of bisimulation from the literature on concurrent processes. Our generalization of bisimulation to stochastic processes yields a non-trivial notion of state equivalence that guarantees the optimal policy for the reduced model immediately induces a corresponding optimal policy for the original model. With this notion of state equivalence, we design and analyze

📜 SIMILAR VOLUMES

Equivalence classes for optimizing risk

Equivalence classes for optimizing risk models in Markov decision processes

✍ Yoshio Ohtsubo; Kenji Toyonaga 📂 Article 📅 2004 🏛 Springer 🌐 English ⚖ 295 KB

Minimizing a Threshold Probability in Di

Minimizing a Threshold Probability in Discounted Markov Decision Processes

✍ D.J. White 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 399 KB

Minimizing Risk Models in Markov Decisio

Minimizing Risk Models in Markov Decision Processes with Policies Depending on Target Values

✍ Congbin Wu; Yuanlie Lin 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 143 KB

Notes on equivalent stationary policies

Notes on equivalent stationary policies in Markov decision processes with total rewards

✍ Eugene A. Feinberg; Isaac M. Sonin 📂 Article 📅 1996 🏛 Springer 🌐 English ⚖ 846 KB

A First Course in Stochastic Models (Tij

A First Course in Stochastic Models (Tijms/Stochastic Models) || Semi-Markov Decision Processes

✍ Tijms, Henk C. 📂 Article 📅 2004 🏛 John Wiley & Sons, Ltd 🌐 English ⚖ 208 KB 👁 1 views

The Discounted Method and Equivalence of

The Discounted Method and Equivalence of Average Criteria for Risk-Sensitive Markov Decision Processes on Borel Spaces

✍ Rolando Cavazos-Cadena; Francisco Salem-Silva 📂 Article 📅 2009 🏛 Springer 🌐 English ⚖ 577 KB