Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control

Abstract

Humans are remarkably proficient at controlling their limbs and tools from a wide range of viewpoints. In robotics, this ability is referred to as visual servoing: moving a tool or end-point to a desired location using primarily visual feedback. In this paper, we propose learning viewpoint invariant visual servoing skills in a robot manipulation task. We train a deep recurrent controller that can automatically determine which actions move the end-effector of a robotic arm to a desired object. This problem is fundamentally ambiguous: under severe variation in viewpoint, it may be impossible to determine the actions in a single feedforward operation. Instead, our visual servoing approach uses its memory of past movements to understand how the actions affect the robot motion from the current viewpoint, correcting mistakes and gradually moving closer to the target. This ability is in stark contrast to previous visual servoing methods, which assume known dynamics or require a calibration phase. We learn our recurrent controller using simulated data, synthetic demonstrations and reinforcement learning. We then describe how the resulting model can be transferred to a real-world robot by disentangling perception from control and only adapting the visual layers. The adapted model can servo to previously unseen objects from novel viewpoints on a real-world Kuka IIWA robotic arm. For supplementary videos, see: href{https://www.youtube.com/watch?v=oLgM2Bnb7fo}https://www.youtube.com/watch?v=oLgM2Bnb7fo

Cite

Text

Sadeghi et al. "Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00493

Markdown

[Sadeghi et al. "Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/sadeghi2018cvpr-sim2real/) doi:10.1109/CVPR.2018.00493

BibTeX

@inproceedings{sadeghi2018cvpr-sim2real,
  title     = {{Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control}},
  author    = {Sadeghi, Fereshteh and Toshev, Alexander and Jang, Eric and Levine, Sergey},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2018},
  doi       = {10.1109/CVPR.2018.00493},
  url       = {https://mlanthology.org/cvpr/2018/sadeghi2018cvpr-sim2real/}
}