When a Robot Is More Capable than a Human: Learning from Constrained Demonstrators

Li, Xinhu; Jain, Ayush; Yang, Zhaojing; Korkmaz, Yigit; Biyik, Erdem

When a Robot Is More Capable than a Human: Learning from Constrained Demonstrators

Xinhu Li, Ayush Jain, Zhaojing Yang, Yigit Korkmaz, Erdem Biyik

ICLR 2026

/iclr/2026/li2026iclr-robot/

Abstract

Learning from demonstrations enables experts to teach robots complex tasks using interfaces such as kinesthetic teaching, joystick control, and sim-to-real transfer. However, these interfaces often constrain the expert's ability to demonstrate optimal behavior due to indirect control, setup restrictions, and hardware safety. For example, a joystick can move a robotic arm only in a 2D plane, even though the robot operates in a higher-dimensional space. As a result, the demonstrations collected by constrained experts lead to suboptimal performance of the learned policies. This raises a key question: Can a robot learn a better policy than the one demonstrated by a constrained expert? We address this by allowing the agent to go beyond direct imitation of expert actions and explore shorter and more efficient trajectories. We use the demonstrations to infer a state-only reward signal that measures task progress, and self-label reward for unknown states using temporal interpolation. Our approach outperforms common imitation learning in both sample efficiency and task completion time. On a real WidowX robotic arm, it completes the task in 11 seconds, 10x faster than behavioral cloning.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Li et al. "When a Robot Is More Capable than a Human: Learning from Constrained Demonstrators." International Conference on Learning Representations, 2026.

Markdown

[Li et al. "When a Robot Is More Capable than a Human: Learning from Constrained Demonstrators." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/li2026iclr-robot/)

BibTeX

@inproceedings{li2026iclr-robot,
  title     = {{When a Robot Is More Capable than a Human: Learning from Constrained Demonstrators}},
  author    = {Li, Xinhu and Jain, Ayush and Yang, Zhaojing and Korkmaz, Yigit and Biyik, Erdem},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/li2026iclr-robot/}
}