Policy Contrastive Decoding for Robotic Foundation Models

Wu, Shihan; Luo, Xu; Zhang, Ji; Xie, Junlin; Song, Jingkuan; Shen, Heng Tao; Gao, Lianli

Policy Contrastive Decoding for Robotic Foundation Models

Shihan Wu, Xu Luo, Ji Zhang, Junlin Xie, Jingkuan Song, Heng Tao Shen, Lianli Gao

ICLR 2026

/iclr/2026/wu2026iclr-policy/

Abstract

Generalist robot policies, or robotic foundation models, hold immense potential to enable flexible, general-purpose and dexterous robotic systems. Despite their advancements, our empirical experiments reveal that existing robot policies are prone to learning spurious correlations from pre-training trajectories, adversely affecting their generalization capabilities during inference. To tackle this, we propose a novel Policy Contrastive Decoding (PCD) approach, which redirects the robot policy’s focus toward object-relevant visual clues by contrasting action probability distributions derived from original and object-masked visual inputs. As a training-free method, our PCD can be used as a plugin to improve different types of robot policies without needing to finetune or access model weights. We conduct extensive experiments on top of three open-source robot policies, including the autoregressive policy OpenVLA and the diffusion-based policies Octo and Pi-0. The obtained results in both simulation and real-world environments prove PCD’s flexibility and effectiveness, e.g., PCD enhances the state-of-the-art policy $\pi_0$ by 8.9% in the simulation environment and by 108% in the real-world environment. Our code is publicly available at: https://github.com/pcd-robot/PCD.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Wu et al. "Policy Contrastive Decoding for Robotic Foundation Models." International Conference on Learning Representations, 2026.

Markdown

[Wu et al. "Policy Contrastive Decoding for Robotic Foundation Models." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wu2026iclr-policy/)

BibTeX

@inproceedings{wu2026iclr-policy,
  title     = {{Policy Contrastive Decoding for Robotic Foundation Models}},
  author    = {Wu, Shihan and Luo, Xu and Zhang, Ji and Xie, Junlin and Song, Jingkuan and Shen, Heng Tao and Gao, Lianli},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wu2026iclr-policy/}
}