Reinforcement Learning and Approximate Dynamic Programming for Feedback Control

✍ Scribed by Frank L. Lewis, Derong Liu

Publisher: Wiley-IEEE Press
Year: 2012
Tongue: English
Leaves: 633
Series: IEEE Press Series on Computational Intelligence
Edition: 1
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

Reinforcement learning (RL) and adaptive dynamic programming (ADP) has been one of the most critical research fields in science and engineering for modern complex systems. This book describes the latest RL and ADP techniques for decision and control in human engineered systems, covering both single player decision and control and multi-player games. Edited by the pioneers of RL and ADP research, the book brings together ideas and methods from many fields and provides an important and timely guidance on controlling a wide variety of systems, such as robots, industrial processes, and economic decision-making.

✦ Table of Contents

Title page......Page 1
Contents......Page 5
Preface......Page 18
1. Reinforcement Learning and Approximate Dynamic Programming (RLADP)-Foundations, Common Misconceptions, and the Challenges Ahead......Page 26
2. Stable Adaptive Neural Control of Partially Observable Dynamic Systems......Page 54
3. Optimal Control of Unknown Nonlinear Discrete-Time Systems Using the Iterative Globalized Dual Heuristic Programming Algorithm......Page 75
4. Learning and Optimization in Hierarchical Adaptive Critic Design......Page 101
5. Single Network Adaptive Critics Networks-Development, Analysis, and Applications......Page 121
6. Linearly Solvable Optimal Control......Page 142
7. Approximating Optimal Control withValue Gradient Learning......Page 165
8. A Constrained Backpropagation Approach to Function Approximation and Approximate Dynamic Programming......Page 185
9. Toward Design of Nonlinear ADP Learning Controllers with Performance Assurance......Page 205
10. Reinforcement Learning Control with Time-Dependent Agent Dynamics......Page 226
11. Online Optimal Control of Nonaffine Nonlinear Discrete-Time Systems without Using Value and Policy Iterations......Page 244
12. An Actor-Critic-Identifier Architecture for Adaptive Approximate Optimal Control......Page 281
13. Robust Adaptive Dynamic Programming......Page 304
14. Hybrid Learning in Stochastic Games and Its Application in Network Security......Page 327
15. Integral Reinforcement Learning for Online Computation of Nash Strategies of Nonzero-Sum Differential Games......Page 352
16. Online Learning Algorithms for Optimal Control and Dynamic Games......Page 372
17. Lambda-Policy Iteration: A Review and a New Implementation......Page 401
18. Optimal Learning and Approximate Dynamic Programming......Page 430
19. An Introduction to Event-Based Optimization: Theory and Applications......Page 452
20. Bounds for Markov Decision Processes......Page 472
21. Approximate Dynamic Programming and Backpropagation on Timescales......Page 494
22. A Survey of Optimistic Planning in Markov Decision Processes......Page 514
23. Adaptive Feature Pursuit: Online Adaptation of Features in Reinforcement Learning......Page 537
24. Feature Selection for Neuro-Dynamic Programming......Page 555
25. Approximate Dynamic Programming for Optimizing Oil Production......Page 580
26. A Learning Strategy for Source Tracking in Unstructured Environments......Page 602
Index......Page 621

📜 SIMILAR VOLUMES

Reinforcement Learning and Approximate D

📁 Reinforcement Learning and Approximate Dynamic Programming for Feedback Control

✍ IEEE Press.;John Wiley;Sons.;Lewis, Frank L.;Liu, Derong 📂 Library 📅 2013 🏛 IEEE Press, John Wiley & Sons, Inc., Publication 🌐 English

Reinforcement Learning and Dynamic Progr

📁 Reinforcement Learning and Dynamic Programming Using Function Approximators (Automation and Control Engineering)

✍ Lucian Busoniu, Robert Babuska, Bart De Schutter, Damien Ernst 📂 Library 📅 2010 🌐 English

From household appliances to applications in robotics, engineered systems involving complex dynamics can only be as effective as the algorithms that control them. While Dynamic Programming (DP) has provided researchers with a way to optimally solve decision and control problems involving complex dyn

Reinforcement Learning for Optimal Feedb

📁 Reinforcement Learning for Optimal Feedback Control

✍ Rushikesh Kamalapurkar, Patrick Walters, Joel Rosenfeld, Warren Dixon 📂 Library 📅 2018 🏛 Springer International Publishing 🌐 English

Reinforcement Learning for Optimal Feedback Control develops model-based and data-driven reinforcement learning methods for solving optimal control problems in nonlinear deterministic dynamical systems. In order to achieve learning under uncertainty, data-driven methods for identifying sys

Output Feedback Reinforcement Learning C

📁 Output Feedback Reinforcement Learning Control for Linear Systems

✍ Syed Ali Asad Rizvi, Zongli Lin 📂 Library 📅 2022 🏛 Birkhäuser 🌐 English

This monograph explores the analysis and design of model-free optimal control systems based on reinforcement learning (RL) theory, presenting new methods that overcome recent challenges faced by RL. New developments in the design of sensor data efficient RL algorithms are demonstrated that no

Handbook of Learning and Approximate Dyn

📁 Handbook of Learning and Approximate Dynamic Programming

✍ Jennie Si, Andy Barto, Warren Powell, Donald Wunsch(auth.) 📂 Library 📅 2004 🏛 Wiley-IEEE Press 🌐 English

<ul><li>A complete resource to Approximate Dynamic Programming (ADP), including on-line simulation code <li>Provides a tutorial that readers can use to start implementing the learning algorithms provided in the book <li>Includes ideas, directions, and recent results on current research issues and ad

Reinforcement Learning Aided Performance

📁 Reinforcement Learning Aided Performance Optimization of Feedback Control Systems

✍ Changsheng Hua 📂 Library 📅 2021 🏛 Springer Vieweg 🌐 English

Changsheng Hua proposes two approaches, an input/output recovery approach and a performance index-based approach for robustness and performance optimization of feedback control systems. For their data-driven implementation in deterministic and stochastic systems, the author develops Q-learning an