Acquisition of stand-up behavior by a re
β
Jun Morimoto; Kenji Doya
π
Article
π
2001
π
Elsevier Science
π
English
β 564 KB
In this paper, we propose a hierarchical reinforcement learning architecture that realizes practical learning speed in real hardware control tasks. In order to enable learning in a practical number of trials, we introduce a low-dimensional representation of the state of the robot for higher-level pl