Domain decomposition: Parallel multileve
📂
Article
📅
1997
🏛
Elsevier Science
🌐
English
⚖ 98 KB
policies without measuring merits (P. Dayan and S.P. Singh). Memory-based stochastic optimization (A.W. Moore and J. Schneider). Temporal difference in learning in continuous time and space (K. Doya). Reinforcement learning by probability matching (P.N. Sabes and M.I. Jordan). Author index.