Enhancing Q-Learning for Optimal Asset Allocation

Abstract

This paper enhances the Q-Iearning algorithm for optimal asset alloca(cid:173) tion proposed in (Neuneier, 1996 [6]). The new formulation simplifies the approach by using only one value-function for many assets and al(cid:173) lows model-free policy-iteration. After testing the new algorithm on real data, the possibility of risk management within the framework of Markov decision problems is analyzed. The proposed methods allows the construction of a multi-period portfolio management system which takes into account transaction costs, the risk preferences of the investor, and several constraints on the allocation.

Cite

Text

Neuneier. "Enhancing Q-Learning for Optimal Asset Allocation." Neural Information Processing Systems, 1997.

Markdown

[Neuneier. "Enhancing Q-Learning for Optimal Asset Allocation." Neural Information Processing Systems, 1997.](https://mlanthology.org/neurips/1997/neuneier1997neurips-enhancing/)

BibTeX

@inproceedings{neuneier1997neurips-enhancing,
  title     = {{Enhancing Q-Learning for Optimal Asset Allocation}},
  author    = {Neuneier, Ralph},
  booktitle = {Neural Information Processing Systems},
  year      = {1997},
  pages     = {936-942},
  url       = {https://mlanthology.org/neurips/1997/neuneier1997neurips-enhancing/}
}