𝔖 Bobbio Scriptorium
✦   LIBER   ✦

BOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHM

✍ Scribed by NI, YAODONG; LIU, ZHI-QIANG


Book ID
121835237
Publisher
World Scientific Publishing Company
Year
2013
Tongue
English
Weight
630 KB
Volume
21
Category
Article
ISSN
0218-4885

No coin nor oath required. For personal study only.


πŸ“œ SIMILAR VOLUMES


[Adaptation, Learning, and Optimization]
✍ Wiering, Marco; van Otterlo, Martijn πŸ“‚ Article πŸ“… 2012 πŸ› Springer Berlin Heidelberg 🌐 German βš– 624 KB

Reinforcement learning encompasses both a science of adaptive behavior of rational beings in uncertain environments and a computational methodology for finding optimal behaviors for challenging problems in control, optimization and adaptive behavior of intelligent agents. As a field, reinforcement l