Depth Privileged Object Detection in Indoor Scenes via Deformation Hallucination

Abstract

RGB-D object detection has achieved significant advance, because depth provides complementary geometric information to RGB images. Considering depth images are unavailable in some scenarios, we focus on depth privileged object detection in indoor scenes, where the depth images are only available in the training phase. Under this setting, one prevalent research line is modality hallucination, in which depth image and depth feature are the common choices for hallucinating. In contrast, we choose to hallucinate depth deformation, which is explicit geometric information and efficient to hallucinate. Specifically, we employ the deformable convolution layer with augmented offsets as our deformation module and regard the offsets as geometric deformation, because the offsets enable flexibly sampling over the object and transforming to a canonical shape for ease of detection. In addition, we design a quality-based mechanism to avoid negative transfer of depth deformation. Experimental results and analyses on NYUDv2 and SUN RGB-D demonstrate the effectiveness of our method against the state-of-the-art methods for depth privileged object detection.

Cite

Text

Zhang et al. "Depth Privileged Object Detection in Indoor Scenes via Deformation Hallucination." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I4.16459

Markdown

[Zhang et al. "Depth Privileged Object Detection in Indoor Scenes via Deformation Hallucination." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/zhang2021aaai-depth/) doi:10.1609/AAAI.V35I4.16459

BibTeX

@inproceedings{zhang2021aaai-depth,
  title     = {{Depth Privileged Object Detection in Indoor Scenes via Deformation Hallucination}},
  author    = {Zhang, Zhijie and Liu, Yan and Chen, Junjie and Niu, Li and Zhang, Liqing},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {3456-3464},
  doi       = {10.1609/AAAI.V35I4.16459},
  url       = {https://mlanthology.org/aaai/2021/zhang2021aaai-depth/}
}