Inconsistency-Based Data-Centric Active Open-Set Annotation

Abstract

Active learning, a method to reduce labeling effort for training deep neural networks, is often limited by the assumption that all unlabeled data belong to known classes. This closed-world assumption fails in practical scenarios with unknown classes in the data, leading to active open-set annotation challenges. Existing methods struggle with this uncertainty. We introduce NEAT, a novel, computationally efficient, data-centric active learning approach for open-set data. NEAT differentiates and labels known classes from a mix of known and unknown classes, using a clusterability criterion and a consistency mea- sure that detects inconsistencies between model predictions and feature distribution. In contrast to recent learning-centric solutions, NEAT shows superior performance in active open- set annotation, as our experiments confirm. Additional details on the further evaluation metrics, implementation, and archi- tecture of our method can be found in the public document at https://arxiv.org/pdf/2401.04923.pdf.

Cite

Text

Mao et al. "Inconsistency-Based Data-Centric Active Open-Set Annotation." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I5.28213

Markdown

[Mao et al. "Inconsistency-Based Data-Centric Active Open-Set Annotation." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/mao2024aaai-inconsistency/) doi:10.1609/AAAI.V38I5.28213

BibTeX

@inproceedings{mao2024aaai-inconsistency,
  title     = {{Inconsistency-Based Data-Centric Active Open-Set Annotation}},
  author    = {Mao, Ruiyu and Xu, Ouyang and Guo, Yunhui},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4180-4188},
  doi       = {10.1609/AAAI.V38I5.28213},
  url       = {https://mlanthology.org/aaai/2024/mao2024aaai-inconsistency/}
}