Adaptive RoI with Pretrained Models for Automated Retail Checkout

Abstract

In this paper, we present a solution for automatic checkout in a retail store as a part of AI City Challenge 2023 Track 4. We propose a methodology which involves usage of pretrained Yolov5 models to detect person and media pipe models to detect hands of the person. This information is utilized to compute the Region of Interest (RoI) which is adaptive in nature. Afterwards, a custom trained object detection model is used detect products in the frame. We then use a tracker to track the products across video frames to avoid duplicated counting. The method is evaluated on the AI City challenge 2023 – Track 4 and gets the F1 score 0.6571 on the test A set, which places us on 6th place on the public leader board. The code is made public and available on GitHub.1

Cite

Text

Dhonde et al. "Adaptive RoI with Pretrained Models for Automated Retail Checkout." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00582

Markdown

[Dhonde et al. "Adaptive RoI with Pretrained Models for Automated Retail Checkout." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/dhonde2023cvprw-adaptive/) doi:10.1109/CVPRW59228.2023.00582

BibTeX

@inproceedings{dhonde2023cvprw-adaptive,
  title     = {{Adaptive RoI with Pretrained Models for Automated Retail Checkout}},
  author    = {Dhonde, Anudeep and Guntur, Prabhudev and Palani, Vinitha},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2023},
  pages     = {5507-5510},
  doi       = {10.1109/CVPRW59228.2023.00582},
  url       = {https://mlanthology.org/cvprw/2023/dhonde2023cvprw-adaptive/}
}