CLActive: Episodic Memories for Rapid Active Learning
Abstract
Active Learning aims to solve the problem of alleviating labelling costs for large-scale datasets by selecting a subset of data to effectively train on. Deep Active Learning (DAL) techniques typically involve repeated training of a model for sample acquisition over the entire subset of labelled data available in each round. This can be prohibitively expensive to run in real-world scenarios with large and constantly growing data. Some work has been done to address this – notably, Selection-Via-Proxy (SVP) proposed the use of a separate, smaller proxy model for acquisition. We explore further optimizations to the standard DAL setup and propose CLActive: an optimization procedure that brings significant speedups which maintains a constant training time for the selection model across rounds and retains information from past rounds using Experience Replay. We demonstrate large improvements in total train-time compared to the fully-trained baselines and SVP. We achieve up to 89$\times$, 7$\times$, 61$\times$ speedups over the fully-trained baseline at 50% of dataset collection in CIFAR, Imagenet and Amazon Review datasets, respectively, with little accuracy loss. We also show that CLActive is robust against catastrophic forgetting in a challenging class-incremental active-learning setting. Overall, we believe that CLActive can effectively enable rapid prototyping and deployment of deep AL algorithms in real-world use cases across a variety of settings.
Cite
Text
Munagala et al. "CLActive: Episodic Memories for Rapid Active Learning." Proceedings of The 1st Conference on Lifelong Learning Agents, 2022.Markdown
[Munagala et al. "CLActive: Episodic Memories for Rapid Active Learning." Proceedings of The 1st Conference on Lifelong Learning Agents, 2022.](https://mlanthology.org/collas/2022/munagala2022collas-clactive/)BibTeX
@inproceedings{munagala2022collas-clactive,
title = {{CLActive: Episodic Memories for Rapid Active Learning}},
author = {Munagala, Sri Aurobindo and Subramanian, Sidhant and Karthik, Shyamgopal and Prabhu, Ameya and Namboodiri, Anoop},
booktitle = {Proceedings of The 1st Conference on Lifelong Learning Agents},
year = {2022},
pages = {430-440},
volume = {199},
url = {https://mlanthology.org/collas/2022/munagala2022collas-clactive/}
}