Scalable Offline Model-Based RL with Action Chunks

Park, Kwanyoung; Park, Seohong; Lee, Youngwoon; Levine, Sergey

Scalable Offline Model-Based RL with Action Chunks

Kwanyoung Park, Seohong Park, Youngwoon Lee, Sergey Levine

ICLR 2026

/iclr/2026/park2026iclr-scalable/

Abstract

In this paper, we study whether model-based reinforcement learning (RL), in particular model-based value expansion, can provide a scalable recipe for tackling complex, long-horizon tasks in offline RL. Model-based value expansion fits an on-policy value function using length-$n$ imaginary rollouts generated by the current policy and a learned dynamics model. While larger $n$ reduces bias in value bootstrapping, it amplifies accumulated model errors over long horizons, degrading future predictions. We address this trade-off with an *action-chunk* model that predicts a future state from a sequence of actions (an "action chunk") instead of a single action, which reduces compounding errors. In addition, instead of directly training a policy to maximize rewards, we employ rejection sampling from an expressive behavioral action-chunk policy, which prevents model exploitation from out-of-distribution actions. We call this recipe **Model-Based RL with Action Chunks (MAC)**. Through experiments on highly challenging tasks with large-scale datasets of up to $100$M transitions, we show that MAC achieves the best performance among offline model-based RL algorithms, especially on challenging long-horizon tasks.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Park et al. "Scalable Offline Model-Based RL with Action Chunks." International Conference on Learning Representations, 2026.

Markdown

[Park et al. "Scalable Offline Model-Based RL with Action Chunks." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/park2026iclr-scalable/)

BibTeX

@inproceedings{park2026iclr-scalable,
  title     = {{Scalable Offline Model-Based RL with Action Chunks}},
  author    = {Park, Kwanyoung and Park, Seohong and Lee, Youngwoon and Levine, Sergey},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/park2026iclr-scalable/}
}