Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks
Abstract
We present Occlusion-Net, a framework to predict 2D and 3D locations of occluded keypoints for objects, in a largely self-supervised manner. We use an off-the-shelf detector as input (like MaskRCNN) that is trained only on visible key point annotations. This is the only supervision used in this work. A graph encoder network then explicitly classifies invisible edges and a graph decoder network corrects the occluded keypoint locations from the initial detector. Central to this work is a trifocal tensor loss that provides indirect self-supervision for occluded keypoint locations that are visible in other views of the object. The 2D keypoints are then passed into a 3D graph network that estimates the 3D shape and camera pose using the self-supervised re-projection loss. At test time, our approach successfully localizes keypoints in a single view under a diverse set of severe occlusion settings. We demonstrate and evaluate our approach on synthetic CAD data as well as a large image set capturing vehicles at many busy city intersections. As an interesting aside, we compare the accuracy of human labels of invisible keypoints against those obtained from geometric trifocal-tensor loss.
Cite
Text
Reddy et al. "Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00750Markdown
[Reddy et al. "Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/reddy2019cvpr-occlusionnet/) doi:10.1109/CVPR.2019.00750BibTeX
@inproceedings{reddy2019cvpr-occlusionnet,
title = {{Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks}},
author = {Reddy, N. Dinesh and Vo, Minh and Narasimhan, Srinivasa G.},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00750},
url = {https://mlanthology.org/cvpr/2019/reddy2019cvpr-occlusionnet/}
}