Stream-Based Active Distillation for Scalable Model Deployment
Abstract
This paper proposes a scalable technique for developing lightweight yet powerful models for object detection in videos using self-training with knowledge distillation. This approach involves training a compact student model using pseudo-labels generated by a computationally complex but generic teacher model, which can help to reduce the need for massive amounts of data and computational power. However, model-based annotations in large-scale applications may propagate errors or biases. To address these issues, our paper introduces Stream-Based Active Distillation (SBAD) to endow pre-trained students with effective and efficient fine-tuning methods that are robust to teacher imperfections. The proposed pipeline: (i) adapts a pre-trained student model to a specific use case, based on a set of frames whose pseudo-labels are predicted by the teacher, and (ii) selects on-the-fly, along a streamed video, the images that should be considered to fine-tune the student model. Various selection strategies are compared, demonstrating: 1) the effectiveness of implementing distillation with pseudo-labels, and 2) the importance of selecting images for which the pre-trained student detects with a high confidence.
Cite
Text
Manjah et al. "Stream-Based Active Distillation for Scalable Model Deployment." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00528Markdown
[Manjah et al. "Stream-Based Active Distillation for Scalable Model Deployment." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/manjah2023cvprw-streambased/) doi:10.1109/CVPRW59228.2023.00528BibTeX
@inproceedings{manjah2023cvprw-streambased,
title = {{Stream-Based Active Distillation for Scalable Model Deployment}},
author = {Manjah, Dani and Cacciarelli, Davide and Benkedadra, Mohamed and Standaert, Baptiste and De Hertaing, Gauthier Rotsart and Macq, Benoît and Galland, Stéphane and De Vleeschouwer, Christophe},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2023},
pages = {4999-5007},
doi = {10.1109/CVPRW59228.2023.00528},
url = {https://mlanthology.org/cvprw/2023/manjah2023cvprw-streambased/}
}