CLActive: Episodic Memories for Rapid Active Learning

Abstract

Active Learning aims to solve the problem of alleviating labelling costs for large-scale datasets by selecting a subset of data to effectively train on. Deep Active Learning (DAL) techniques typically involve repeated training of a model for sample acquisition over the entire subset of labelled data available in each round. This can be prohibitively expensive to run in real-world scenarios with large and constantly growing data. Some work has been done to address this – notably, Selection-Via-Proxy (SVP) proposed the use of a separate, smaller proxy model for acquisition. We explore further optimizations to the standard DAL setup and propose CLActive: an optimization procedure that brings significant speedups which maintains a constant training time for the selection model across rounds and retains information from past rounds using Experience Replay. We demonstrate large improvements in total train-time compared to the fully-trained baselines and SVP. We achieve up to 89$\times$, 7$\times$, 61$\times$ speedups over the fully-trained baseline at 50% of dataset collection in CIFAR, Imagenet and Amazon Review datasets, respectively, with little accuracy loss. We also show that CLActive is robust against catastrophic forgetting in a challenging class-incremental active-learning setting. Overall, we believe that CLActive can effectively enable rapid prototyping and deployment of deep AL algorithms in real-world use cases across a variety of settings.

Cite

Text

Munagala et al. "CLActive: Episodic Memories for Rapid Active Learning." Proceedings of The 1st Conference on Lifelong Learning Agents, 2022.

Markdown

[Munagala et al. "CLActive: Episodic Memories for Rapid Active Learning." Proceedings of The 1st Conference on Lifelong Learning Agents, 2022.](https://mlanthology.org/collas/2022/munagala2022collas-clactive/)

BibTeX

@inproceedings{munagala2022collas-clactive,
  title     = {{CLActive: Episodic Memories for Rapid Active Learning}},
  author    = {Munagala, Sri Aurobindo and Subramanian, Sidhant and Karthik, Shyamgopal and Prabhu, Ameya and Namboodiri, Anoop},
  booktitle = {Proceedings of The 1st Conference on Lifelong Learning Agents},
  year      = {2022},
  pages     = {430-440},
  volume    = {199},
  url       = {https://mlanthology.org/collas/2022/munagala2022collas-clactive/}
}