Accelerated Deep Active Learning with Graph-Based Sub- Sampling

Abstract

Past years have witnessed the fast and thorough development of active learning, a human-in-the-loop semi-supervised learning that helps reduce the burden of expensive data annotation. Diverse techniques have been proposed to improve the efficiency of label acquisition. However, the existing techniques are mostly intractable at scale on massive unlabeled instances. In particular, the query time and model retraining time of large scale image-data models is usually linear or even quadratic in the size of the unlabeled pool set and its dimension. The main reason for this intractability is the iterative need to scan the pool set at least once in order to select the best samples for label annotation. To alleviate this computational burden we propose efficient Diffusion Graph Active Learning (DGAL). DGAL is used on a pre-computed Variational-Auto-Encoders (VAE) latent space to restrict the pool set to a much smaller candidates set. The sub-sample is then used in deep architectures, to reduce the query time, via an additional standard active learning baseline criterion. DGAL demonstrates a query time versus accuracy trade-off that is two or more orders of magnitude acceleration over state-of-the-art methods. Moreover, we demonstrate the important exploration-exploitation trade-off in DGAL that allows the restricted set to capture the most impactful samples for active learning at each iteration.

Cite

Text

Kushnir and Xu. "Accelerated Deep Active Learning with Graph-Based Sub- Sampling." Transactions on Machine Learning Research, 2024.

Markdown

[Kushnir and Xu. "Accelerated Deep Active Learning with Graph-Based Sub- Sampling." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/kushnir2024tmlr-accelerated/)

BibTeX

@article{kushnir2024tmlr-accelerated,
  title     = {{Accelerated Deep Active Learning with Graph-Based Sub- Sampling}},
  author    = {Kushnir, Dan and Xu, Shiyun},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/kushnir2024tmlr-accelerated/}
}