𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Linear stochastic approximation driven by slowly varying Markov chains

✍ Scribed by Vijay R. Konda; John N. Tsitsiklis


Publisher
Elsevier Science
Year
2003
Tongue
English
Weight
241 KB
Volume
50
Category
Article
ISSN
0167-6911

No coin nor oath required. For personal study only.

✦ Synopsis


We study a linear stochastic approximation algorithm that arises in the context of reinforcement learning. The algorithm employs a decreasing step-size, and is driven by Markov noise with time-varying statistics. We show that under suitable conditions, the algorithm can track the changes in the statistics of the Markov noise, as long as these changes are slower than the rate at which the step-size of the algorithm goes to zero.