Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning

Zhang, Wenyi; Jia, Renjun; Wang, Yanhao; Cheng, Dawei; Zhao, Minghao; Chen, Cen

doi:10.24963/IJCAI.2025/1054

Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning

Wenyi Zhang, Renjun Jia, Yanhao Wang, Dawei Cheng, Minghao Zhao, Cen Chen

IJCAI 2025 pp. 9483-9491

doi:10.24963/IJCAI.2025/1054 /ijcai/2025/zhang2025ijcai-enhancing/

Abstract

Portfolio optimization encounters persistent challenges in adapting to dynamic markets due to static assumptions and high-dimensional decision spaces. Although reinforcement learning (RL) has emerged as a potential solution, conventional reward engineering often fails to capture complex market dynamics. Recent advances in deep RL and graph neural networks have attempted to enhance market microstructure modeling. However, these methods still struggle with the systematic integration of financial knowledge. To address the above issues, we propose a novel heuristic-guided inverse reinforcement learning framework for portfolio optimization. Specifically, our framework provides an interpretable expert strategy generation mechanism that takes into account sector diversification and correlation constraints. Then, a multi-objective reward optimization method is adopted to adaptively strike a balance between returns and risks. Furthermore, it also utilizes heterogeneous graph policy learning with hierarchical attention mechanisms to explicitly model inter-stock relationships. Finally, we conduct extensive experiments on real-world financial market data to demonstrate that our framework outperforms several state-of-the-art deep learning and RL baselines in terms of risk-adjusted returns. We provide case studies to showcase the ability of our framework to balance return maximization and risk containment. Our code is publicly available at https://github.com/ChloeWenyiZhang/SmartFolio/.

PDF IJCAI Semantic Scholar

Cite

Text

Zhang et al. "Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1054

Markdown

[Zhang et al. "Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/zhang2025ijcai-enhancing/) doi:10.24963/IJCAI.2025/1054

BibTeX

@inproceedings{zhang2025ijcai-enhancing,
  title     = {{Enhancing Portfolio Optimization via Heuristic-Guided Inverse Reinforcement Learning with Multi-Objective Reward and Graph-Based Policy Learning}},
  author    = {Zhang, Wenyi and Jia, Renjun and Wang, Yanhao and Cheng, Dawei and Zhao, Minghao and Chen, Cen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {9483-9491},
  doi       = {10.24963/IJCAI.2025/1054},
  url       = {https://mlanthology.org/ijcai/2025/zhang2025ijcai-enhancing/}
}