Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning

Abstract

Low-rank structure is a common implicit assumption in many modern reinforcement learning (RL) algorithms. For instance, reward-free and goal-conditioned RL methods often presume that the successor measure admits a low-rank representation. In this work, we challenge this assumption by first remarking that the successor measure itself is not approximately low-rank. Instead, we demonstrate that a low-rank structure naturally emerges in the shifted successor measure, which captures the system dynamics after bypassing a few initial transitions. We provide finite-sample performance guarantees for the entry-wise estimation of a low-rank approximation of the shifted successor measure from sampled entries. Our analysis reveals that both the approximation and estimation errors are primarily governed by a newly introduced quantitity: the spectral recoverability of the corresponding matrix. To bound this parameter, we derive a new class of functional inequalities for Markov chains that we call Type II Poincaré inequalities and from which we can quantify the amount of shift needed for effective low-rank approximation and estimation. This analysis shows in particular that the required shift depends on decay of the high-order singular values of the shifted successor measure and is hence typically small in practice. Additionally, we establish a connection between the necessary shift and the local mixing properties of the underlying dynamical system, which provides a natural way of selecting the shift. Finally, we validate our theoretical findings with experiments, and demonstrate that shifting the successor measure indeed leads to improved performance in goal-conditioned RL.

Cite

Text

Dubail et al. "Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning." Advances in Neural Information Processing Systems, 2025.

Markdown

[Dubail et al. "Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/dubail2025neurips-shift/)

BibTeX

@inproceedings{dubail2025neurips-shift,
  title     = {{Shift Before You Learn: Enabling Low-Rank Representations in Reinforcement Learning}},
  author    = {Dubail, Bastien and Stojanovic, Stefan and Proutiere, Alexandre},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/dubail2025neurips-shift/}
}