Matrix Nets: A New Deep Architecture for Object Detection

Abstract

We present Matrix Nets (xNets), a new deep architecture for object detection. xNets map objects with different sizes and aspect ratios into layers where the sizes and the aspect ratios of the objects within their layers are nearly uniform. Hence, xNets provide a scale and aspect ratio aware architecture. We leverage xNets to enhance key-points based object detection. Our architecture achieves mAP of 47.8 on MS COCO, which is higher than any other single-shot detector while using half the number of parameters and training 3x faster than the next best architecture.

Cite

Text

Rashwan et al. "Matrix Nets: A New Deep Architecture for Object Detection." IEEE/CVF International Conference on Computer Vision Workshops, 2019. doi:10.1109/ICCVW.2019.00252

Markdown

[Rashwan et al. "Matrix Nets: A New Deep Architecture for Object Detection." IEEE/CVF International Conference on Computer Vision Workshops, 2019.](https://mlanthology.org/iccvw/2019/rashwan2019iccvw-matrix/) doi:10.1109/ICCVW.2019.00252

BibTeX

@inproceedings{rashwan2019iccvw-matrix,
  title     = {{Matrix Nets: A New Deep Architecture for Object Detection}},
  author    = {Rashwan, Abdullah and Kalra, Agastya and Poupart, Pascal},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2019},
  pages     = {2025-2028},
  doi       = {10.1109/ICCVW.2019.00252},
  url       = {https://mlanthology.org/iccvw/2019/rashwan2019iccvw-matrix/}
}