๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Backward Q-learning: The combination of Sarsa algorithm and Q-learning

โœ Scribed by Wang, Yin-Hao; Li, Tzuu-Hseng S.; Lin, Chih-Jui


Book ID
121742094
Publisher
Elsevier Science
Year
2013
Tongue
English
Weight
935 KB
Volume
26
Category
Article
ISSN
0952-1976

No coin nor oath required. For personal study only.


๐Ÿ“œ SIMILAR VOLUMES


New algorithms of the Q-learning type
โœ Shalabh Bhatnagar; K. Mohan Babu ๐Ÿ“‚ Article ๐Ÿ“… 2008 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 284 KB

We propose two algorithms for Q-learning that use the two-timescale stochastic approximation methodology. The first of these updates Q-values of all feasible state-action pairs at each instant while the second updates Q-values of states with actions chosen according to the 'current' randomized polic