Unsupervised Object Keypoint Learning Using Local Spatial Predictability
Abstract
We propose PermaKey, a novel approach to representation learning based on object keypoints. It leverages the predictability of local image regions from spatial neighborhoods to identify salient regions that correspond to object parts, which are then converted to keypoints. Unlike prior approaches, it utilizes predictability to discover object keypoints, an intrinsic property of objects. This ensures that it does not overly bias keypoints to focus on characteristics that are not unique to objects, such as movement, shape, colour etc. We demonstrate the efficacy of PermaKey on Atari where it learns keypoints corresponding to the most salient object parts and is robust to certain visual distractors. Further, on downstream RL tasks in the Atari domain we demonstrate how agents equipped with our keypoints outperform those using competing alternatives, even on challenging environments with moving backgrounds or distractor objects.
Cite
Text
Gopalakrishnan et al. "Unsupervised Object Keypoint Learning Using Local Spatial Predictability." International Conference on Learning Representations, 2021.Markdown
[Gopalakrishnan et al. "Unsupervised Object Keypoint Learning Using Local Spatial Predictability." International Conference on Learning Representations, 2021.](https://mlanthology.org/iclr/2021/gopalakrishnan2021iclr-unsupervised/)BibTeX
@inproceedings{gopalakrishnan2021iclr-unsupervised,
title = {{Unsupervised Object Keypoint Learning Using Local Spatial Predictability}},
author = {Gopalakrishnan, Anand and van Steenkiste, Sjoerd and Schmidhuber, Jürgen},
booktitle = {International Conference on Learning Representations},
year = {2021},
url = {https://mlanthology.org/iclr/2021/gopalakrishnan2021iclr-unsupervised/}
}