✦ LIBER ✦

Constructive reinforcement learning

✍ Scribed by Jose Hernandez-Orallo

Publisher: John Wiley and Sons
Year: 2000
Tongue: English
Weight: 163 KB
Volume: 15
Category: Article
ISSN: 0884-8173
DOI: 10.1002/(sici)1098-111x(200003)15:3<241::aid-int6>3.0.co;2-z

No coin nor oath required. For personal study only.

✦ Synopsis

This paper presents an operative measure of reinforcement for constructive learning Ž . methods, i.e., eager learning methods using highly expressible or universal representation languages. These evaluation tools allow a further insight in the study of the growth of knowledge, theory revision, and abduction. The final approach is based on an apportionment of credit wrt the ''course'' that the evidence makes through the learned theory. Our measure of reinforcement is shown to be justified by cross-validation and by the connection with other successful evaluation criteria, like the minimum description length principle. Finally, the relation with the classical view of reinforcement is studied, where the actions of an intelligent system can be rewarded or penalized, and we discuss whether this should affect our distribution of reinforcement. The most important result of this paper is that the way we distribute reinforcement into knowledge results in a rated ontology, instead of a single prior distribution. Therefore, this detailed information can be exploited for guiding the space search of inductive learning algorithms. Likewise, knowledge revision may be done to the part of the theory which is not justified by the evidence.

📜 SIMILAR VOLUMES

Distributed reinforcement learning

✍ Gerhard Weiβ 📂 Article 📅 1995 🏛 Elsevier Science 🌐 English ⚖ 623 KB

Networked reinforcement learning

✍ Makito Oku; Kazuyuki Aihara 📂 Article 📅 2008 🏛 Springer Japan 🌐 English ⚖ 433 KB

Two steps reinforcement learning

✍ Fernando Fernández; Daniel Borrajo 📂 Article 📅 2008 🏛 John Wiley and Sons 🌐 English ⚖ 977 KB

When applying reinforcement learning in domains with very large or continuous state spaces, the experience obtained by the learning agent in the interaction with the environment must be generalized. The generalization methods are usually based on the approximation of the value functions used to comp

Adaptive co-construction of state and ac

Adaptive co-construction of state and action spaces in reinforcement learning

✍ Masato Nagayoshi; Hajime Murao; Hisashi Tamaki 📂 Article 📅 2011 🏛 Springer Japan 🌐 English ⚖ 322 KB

Tuning pianos using reinforcement learni

Tuning pianos using reinforcement learning

✍ Matthew Millard; Hamid R. Tizhoosh 📂 Article 📅 2007 🏛 Elsevier Science 🌐 English ⚖ 885 KB

Reinforcement learning in dendritic stru

Reinforcement learning in dendritic structures

✍ Mathieu Schiess; Robert Urbanczik; Walter Senn 📂 Article 📅 2011 🏛 BioMed Central 🌐 English ⚖ 130 KB