Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning

Abstract

Portfolio optimization encounters persistent challenges in adapting to dynamic markets due to static assumptions and high-dimensional decision spaces. Although reinforcement learning (RL) has emerged as a potential solution, conventional reward engineering often fails to capture complex market dynamics. Recent advances in deep RL and graph neural networks have attempted to enhance market microstructure modeling. However, these methods still struggle with the systematic integration of financial knowledge. To address the above issues, we propose a novel heuristic-guided inverse reinforcement learning framework for portfolio optimization. Specifically, our framework provides an interpretable expert strategy generation mechanism that takes into account sector diversification and correlation constraints. Then, a multi-objective reward optimization method is adopted to adaptively strike a balance between returns and risks. Furthermore, it also utilizes heterogeneous graph policy learning with hierarchical attention mechanisms to explicitly model inter-stock relationships. Finally, we conduct extensive experiments on real-world financial market data to demonstrate that our framework outperforms several state-of-the-art deep learning and RL baselines in terms of risk-adjusted returns. We provide case studies to showcase the ability of our framework to balance return maximization and risk containment. Our code is publicly available at https://github.com/ChloeWenyiZhang/SmartFolio/.

Cite

Text

Zhang et al. "Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1054

Markdown

[Zhang et al. "Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/zhang2025ijcai-enhancing/) doi:10.24963/IJCAI.2025/1054

BibTeX

@inproceedings{zhang2025ijcai-enhancing,
  title     = {{Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning}},
  author    = {Zhang, Wenyi and Jia, Renjun and Wang, Yanhao and Cheng, Dawei and Zhao, Minghao and Chen, Cen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {9483-9491},
  doi       = {10.24963/IJCAI.2025/1054},
  url       = {https://mlanthology.org/ijcai/2025/zhang2025ijcai-enhancing/}
}