𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Reinforcement learning for robot soccer

✍ Scribed by Martin Riedmiller; Thomas Gabel; Roland Hafner; Sascha Lange


Publisher
Springer US
Year
2009
Tongue
English
Weight
949 KB
Volume
27
Category
Article
ISSN
0929-5593

No coin nor oath required. For personal study only.


πŸ“œ SIMILAR VOLUMES


Modular Q-learning based multi-agent coo
✍ Kui-Hong Park; Yong-Jae Kim; Jong-Hwan Kim πŸ“‚ Article πŸ“… 2001 πŸ› Elsevier Science 🌐 English βš– 532 KB

In a multi-agent system, action selection is important for the cooperation and coordination among agents. As the environment is dynamic and complex, modular Q-learning, which is one of the reinforcement learning schemes, is employed in assigning a proper action to an agent in the multi-agent system.

Learning to avoid collisions: A reinforc
✍ B.J.A. KrΓΆse; J.W.M. van Dam πŸ“‚ Article πŸ“… 1992 πŸ› Elsevier Science βš– 595 KB

The paper describes a self-learning control system for a mobile robot. Based on sensor information the control system has to provide a steering signal in such a way that collisions are avoided. Since in our case no 'examples' are available, the system learns on the basis of an external reinforcement

A reinforcement learning adaptive fuzzy
✍ Chuan-Kai Lin πŸ“‚ Article πŸ“… 2003 πŸ› Elsevier Science 🌐 English βš– 233 KB

In this paper, a new reinforcement learning scheme is developed for a class of serial-link robot arms. Traditional reinforcement learning is the problem faced by an agent that must learn behavior through trial-and-error interactions with a dynamic environment. In the proposed reinforcement learning

Robot arm reaching through neural invers
✍ Pedro Martı́n; JosΓ© del R. MillΓ‘n πŸ“‚ Article πŸ“… 2000 πŸ› Elsevier Science 🌐 English βš– 601 KB

We present a neural method that computes the inverse kinematics of any kind of robot manipulators, both redundant and non-redundant. Inverse kinematics solutions are obtained through the inversion of a neural network that has been previously trained to approximate the manipulator forward kinematics.