Q-Function Decomposition with Intervention Semantics for Factored Action Spaces

Junkyu Lee, Tian Gao, Elliot Nelson, Miao Liu, Debarun Bhattacharjya, Songtao Lu

AISTATS 2025 pp. 1027-1035

/aistats/2025/lee2025aistats-qfunction/

Abstract

Many practical reinforcement learning environments have a discrete factored action space that induces a large combinatorial set of actions, thereby posing significant challenges. Existing approaches leverage the regular structure of the action space and resort to a linear decomposition of Q-functions, which avoids enumerating all combinations of factored actions. In this paper, we consider Q-functions defined over a lower dimensional projected subspace of the original action space, and study the condition for the unbiasedness of decomposed Q-functions using causal effect estimation from the no unobserved confounder setting in causal statistics. This leads to a general scheme which we call action decomposed reinforcement learning that uses the projected Q-functions to approximate the Q-function in standard model-free reinforcement learning algorithms. The proposed approach is shown to improve sample complexity in a model-based reinforcement learning setting. We demonstrate improvements in sample efficiency compared to state-of-the-art baselines in online continuous control environments and a real-world offline sepsis treatment environment.

PDF AISTATS OpenReview Semantic Scholar

Cite

Text

Lee et al. "Q-Function Decomposition with Intervention Semantics for Factored Action Spaces." Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 2025.

Markdown

[Lee et al. "Q-Function Decomposition with Intervention Semantics for Factored Action Spaces." Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 2025.](https://mlanthology.org/aistats/2025/lee2025aistats-qfunction/)

BibTeX

@inproceedings{lee2025aistats-qfunction,
  title     = {{Q-Function Decomposition with Intervention Semantics for Factored Action Spaces}},
  author    = {Lee, Junkyu and Gao, Tian and Nelson, Elliot and Liu, Miao and Bhattacharjya, Debarun and Lu, Songtao},
  booktitle = {Proceedings of The 28th International Conference on Artificial Intelligence and Statistics},
  year      = {2025},
  pages     = {1027-1035},
  volume    = {258},
  url       = {https://mlanthology.org/aistats/2025/lee2025aistats-qfunction/}
}