✦ LIBER ✦

Backward Q-learning: The combination of Sarsa algorithm and Q-learning

✍ Scribed by Wang, Yin-Hao; Li, Tzuu-Hseng S.; Lin, Chih-Jui

Book ID: 121742094
Publisher: Elsevier Science
Year: 2013
Tongue: English
Weight: 935 KB
Volume: 26
Category: Article
ISSN: 0952-1976
DOI: 10.1016/j.engappai.2013.06.016

No coin nor oath required. For personal study only.

📜 SIMILAR VOLUMES

New algorithms of the Q-learning type

✍ Shalabh Bhatnagar; K. Mohan Babu 📂 Article 📅 2008 🏛 Elsevier Science 🌐 English ⚖ 284 KB

We propose two algorithms for Q-learning that use the two-timescale stochastic approximation methodology. The first of these updates Q-values of all feasible state-action pairs at each instant while the second updates Q-values of states with actions chosen according to the 'current' randomized polic

Absolute Expediency of Q-and S-Model Lea

Absolute Expediency of Q-and S-Model Learning Algorithms

✍ Lakshmivarahan, S.; Thathachar, M. A. L. 📂 Article 📅 1976 🏛 Institute of Electrical and Electronics Engineers ⚖ 1022 KB

Encyclopedia of the Sciences of Learning

Encyclopedia of the Sciences of Learning || Q-Learning

✍ Seel, Norbert M. 📂 Article 📅 2012 🏛 Springer US ⚖ 246 KB

The analysis and performance evaluation

The analysis and performance evaluation of the pheromone-Q-learning algorithm

✍ N. Monekosso; P. Remagnino 📂 Article 📅 2004 🏛 John Wiley and Sons 🌐 English ⚖ 367 KB

A new Q-learning algorithm based on the

A new Q-learning algorithm based on the metropolis criterion

✍ Maozu Guo; Yang Liu; Malec, J. 📂 Article 📅 2004 🏛 IEEE 🌐 English ⚖ 200 KB

A comparison of learning performance in

A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

✍ Kathy Thi Aung; Takayasu Fuchida 📂 Article 📅 2012 🏛 Springer Japan 🌐 English ⚖ 408 KB