Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks

Reddy, N. Dinesh; Vo, Minh; Narasimhan, Srinivasa G.

doi:10.1109/CVPR.2019.00750

Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks

N. Dinesh Reddy, Minh Vo, Srinivasa G. Narasimhan

CVPR 2019

doi:10.1109/CVPR.2019.00750 /cvpr/2019/reddy2019cvpr-occlusionnet/

Abstract

We present Occlusion-Net, a framework to predict 2D and 3D locations of occluded keypoints for objects, in a largely self-supervised manner. We use an off-the-shelf detector as input (like MaskRCNN) that is trained only on visible key point annotations. This is the only supervision used in this work. A graph encoder network then explicitly classifies invisible edges and a graph decoder network corrects the occluded keypoint locations from the initial detector. Central to this work is a trifocal tensor loss that provides indirect self-supervision for occluded keypoint locations that are visible in other views of the object. The 2D keypoints are then passed into a 3D graph network that estimates the 3D shape and camera pose using the self-supervised re-projection loss. At test time, our approach successfully localizes keypoints in a single view under a diverse set of severe occlusion settings. We demonstrate and evaluate our approach on synthetic CAD data as well as a large image set capturing vehicles at many busy city intersections. As an interesting aside, we compare the accuracy of human labels of invisible keypoints against those obtained from geometric trifocal-tensor loss.

PDF CVPR Semantic Scholar

Cite

Text

Reddy et al. "Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00750

Markdown

[Reddy et al. "Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/reddy2019cvpr-occlusionnet/) doi:10.1109/CVPR.2019.00750

BibTeX

@inproceedings{reddy2019cvpr-occlusionnet,
  title     = {{Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks}},
  author    = {Reddy, N. Dinesh and Vo, Minh and Narasimhan, Srinivasa G.},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00750},
  url       = {https://mlanthology.org/cvpr/2019/reddy2019cvpr-occlusionnet/}
}