✦ LIBER ✦

Performance functions and reinforcement learning for trading systems and portfolios

✍ Scribed by John Moody; Lizhong Wu; Yuansong Liao; Matthew Saffell

Book ID: 101285838
Publisher: John Wiley and Sons
Year: 1998
Tongue: English
Weight: 433 KB
Volume: 17
Category: Article
ISSN: 0277-6693
DOI: 10.1002/(sici)1099-131x(1998090)17:5/6<441::aid-for707>3.0.co;2-#

No coin nor oath required. For personal study only.

✦ Synopsis

We propose to train trading systems and portfolios by optimizing objective functions that directly measure trading and investment performance. Rather than basing a trading system on forecasts or training via a supervised learning algorithm using labelled trading data, we train our systems using recurrent reinforcement learning (RRL) algorithms. The performance functions that we consider for reinforcement learning are pro®t or wealth, economic utility, the Sharpe ratio and our proposed dierential Sharpe ratio. The trading and portfolio management systems require prior decisions as input in order to properly take into account the eects of transactions costs, market impact, and taxes. This temporal dependence on system state requires the use of reinforcement versions of standard recurrent learning algorithms. We present empirical results in controlled experiments that demonstrate the ecacy of some of our methods for optimizing trading systems and portfolios. For a long/short trader, we ®nd that maximizing the dierential Sharpe ratio yields more consistent results than maximizing pro®ts, and that both methods outperform a trading system based on forecasts that minimize MSE. We ®nd that portfolio traders trained to maximize the dierential Sharpe ratio achieve better risk-adjusted returns than those trained to maximize pro®t. Finally, we provide simulation results for an S&P 500/TBill asset allocation system that demonstrate the presence of out-of-sample predictability in the monthly S&P 500 stock index for the 25 year period 1970 through 1994.

📜 SIMILAR VOLUMES

Opportunities for multiagent systems and

Opportunities for multiagent systems and multiagent reinforcement learning in traffic control

✍ Ana L. C. Bazzan 📂 Article 📅 2008 🏛 Springer US 🌐 English ⚖ 372 KB

Evolutionary learning, reinforcement lea

Evolutionary learning, reinforcement learning, and fuzzy rules for knowledge acquisition in agent-based systems

✍ Bonarini, A. 📂 Article 📅 2001 🏛 IEEE 🌐 English ⚖ 176 KB

Reinforcement learning for the adaptive

Reinforcement learning for the adaptive control of nonlinear systems

✍ Zomaya, A.Y. 📂 Article 📅 1994 🏛 Institute of Electrical and Electronics Engineers ⚖ 666 KB

Comparing Policy Gradient and Value Func

Comparing Policy Gradient and Value Function Based Reinforcement Learning Methods in Simulated Electrical Power Trade

✍ Lincoln, R.; Galloway, S.; Stephen, B.; Burt, G. 📂 Article 📅 2012 🏛 IEEE 🌐 English ⚖ 281 KB

Drive-reinforcement learning and hierarc

Drive-reinforcement learning and hierarchical networks of control systems as models of nervous system function

✍ A.Harry Klopf 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 239 KB

A unified framework for reinforcement le

A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems

✍ Predrag T. Tošić; Ricardo Vilalta 📂 Article 📅 2010 🏛 Elsevier 🌐 English ⚖ 353 KB