Speed up Object Detection on Gigapixel-Level Images with Patch Arrangement

Abstract

With the appearance of super high-resolution (e.g., gigapixel-level) images, performing efficient object detection on such images becomes an important issue. Most existing works for efficient object detection on high-resolution images focus on generating local patches where objects may exist, and then every patch is detected independently. However, when the image resolution reaches gigapixel-level, they will suffer from a huge time cost for detecting numerous patches. Different from them, we devise a novel patch arrangement framework for fast object detection on gigapixel-level images. Under this framework, a Patch Arrangement Network (PAN) is proposed to accelerate the detection by determining which patches could be packed together into a compact canvas. Specifically, PAN consists of (1) a Patch Filter Module (PFM) (2) a Patch Packing Module (PPM). PFM filters patch candidates by learning to select patches between two granularities. Subsequently, from the remaining patches, PPM determines how to pack these patches together into a smaller number of canvases. Meanwhile, it generates an ideal layout of patches on canvas. These canvases are fed to the detector to get final results. Experiments show that our method could improve the inference speed on gigapixel-level images by 5 times while maintaining great performance.

Cite

Text

Fan et al. "Speed up Object Detection on Gigapixel-Level Images with Patch Arrangement." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00461

Markdown

[Fan et al. "Speed up Object Detection on Gigapixel-Level Images with Patch Arrangement." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/fan2022cvpr-speed/) doi:10.1109/CVPR52688.2022.00461

BibTeX

@inproceedings{fan2022cvpr-speed,
  title     = {{Speed up Object Detection on Gigapixel-Level Images with Patch Arrangement}},
  author    = {Fan, Jiahao and Liu, Huabin and Yang, Wenjie and See, John and Zhang, Aixin and Lin, Weiyao},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {4653-4661},
  doi       = {10.1109/CVPR52688.2022.00461},
  url       = {https://mlanthology.org/cvpr/2022/fan2022cvpr-speed/}
}