Self-Improving Loops for Visual Robotic Planning

Luo, Calvin; Zeng, Zilai; Jia, Mingxi; Du, Yilun; Sun, Chen

Self-Improving Loops for Visual Robotic Planning

Calvin Luo, Zilai Zeng, Mingxi Jia, Yilun Du, Chen Sun

ICLR 2026

/iclr/2026/luo2026iclr-selfimproving/

Abstract

Video generative models trained on expert demonstrations have been utilized as performant text-conditioned visual planners for solving robotic tasks. However, generalization to unseen tasks remains a challenge. Whereas improved generalization may be facilitated by leveraging learned prior knowledge from additional pre-collected offline data sources, such as web-scale video datasets, in the era of experience we aim to design agents that can continuously improve in an online manner from self-collected behaviors. In this work we thus propose the Self-Improving Loops for Visual Robotic Planning (SILVR), where an in-domain video model iteratively updates itself on self-produced trajectories, and steadily improves its performance for a specified task of interest. We apply SILVR to a diverse suite of MetaWorld tasks, as well as two manipulation tasks on a real robot arm, and find that performance improvements continuously emerge over multiple iterations for novel tasks unseen during initial in-domain video model training. We demonstrate that SILVR is robust in the absence of human-provided ground-truth reward functions or expert-quality demonstrations, and is preferable to alternate approaches that utilize online experience in terms of performance and sample efficiency.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Luo et al. "Self-Improving Loops for Visual Robotic Planning." International Conference on Learning Representations, 2026.

Markdown

[Luo et al. "Self-Improving Loops for Visual Robotic Planning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/luo2026iclr-selfimproving/)

BibTeX

@inproceedings{luo2026iclr-selfimproving,
  title     = {{Self-Improving Loops for Visual Robotic Planning}},
  author    = {Luo, Calvin and Zeng, Zilai and Jia, Mingxi and Du, Yilun and Sun, Chen},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/luo2026iclr-selfimproving/}
}