๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A solving method of an mdp with a constraint by genetic algorithms

โœ Scribed by K Hirayama; H Kawai


Publisher
Elsevier Science
Year
2000
Tongue
English
Weight
550 KB
Volume
31
Category
Article
ISSN
0895-7177

No coin nor oath required. For personal study only.

โœฆ Synopsis


consider a discrete time Markov decision process (MDP) with a finite state space, a finite action space, and two kinds of immediate rewards. The problem is to maximize the time average reward generated by one reward stream, subject to the other reward not being smaller than a prescribed value. An MDP with a reward constraint can be solved by linear programming in the range of mixed policies. On the other hand, when we restrict ourselves to pure policies, the problem is a combinatorial problem, for which a solution has not been discovered. In this paper, we propose an approach by Genetic Algorithms (GAS) in order to obtain an effective search process and to obtain a near optimal, possibly optimal pure stationary policy. A numerical example is given to examine the efficiency of the approach proposed.


๐Ÿ“œ SIMILAR VOLUMES


96/06109 Solving the unit commitment pro
๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science โš– 194 KB

The paper presents a study of the dynamic behaviour of a static frequency converter driving a 300 MVA synchronous generator which is used in a pumped storage power plant of Taiwan Power Company. ## 96/061W A modified linear programming method for distributlon system reconfiguration Abur, A.

Restoration of gray images based on a ge
โœ Yen-Wei Chen; Zensho Nakao; Kouichi Arakaki; Xue Fang; Shinichi Tamura ๐Ÿ“‚ Article ๐Ÿ“… 1999 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 604 KB

Genetic algorithms are used for restoration of gray images. The restoration problem is modeled as an optimization problem, whose cost function is minimized based on mechanics of natural selection and natural genetics. Because the complicated a priori constraints can be easily incorporated by the app