Inconsistency-Based Data-Centric Active Open-Set Annotation
Abstract
Active learning, a method to reduce labeling effort for training deep neural networks, is often limited by the assumption that all unlabeled data belong to known classes. This closed-world assumption fails in practical scenarios with unknown classes in the data, leading to active open-set annotation challenges. Existing methods struggle with this uncertainty. We introduce NEAT, a novel, computationally efficient, data-centric active learning approach for open-set data. NEAT differentiates and labels known classes from a mix of known and unknown classes, using a clusterability criterion and a consistency mea- sure that detects inconsistencies between model predictions and feature distribution. In contrast to recent learning-centric solutions, NEAT shows superior performance in active open- set annotation, as our experiments confirm. Additional details on the further evaluation metrics, implementation, and archi- tecture of our method can be found in the public document at https://arxiv.org/pdf/2401.04923.pdf.
Cite
Text
Mao et al. "Inconsistency-Based Data-Centric Active Open-Set Annotation." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I5.28213Markdown
[Mao et al. "Inconsistency-Based Data-Centric Active Open-Set Annotation." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/mao2024aaai-inconsistency/) doi:10.1609/AAAI.V38I5.28213BibTeX
@inproceedings{mao2024aaai-inconsistency,
title = {{Inconsistency-Based Data-Centric Active Open-Set Annotation}},
author = {Mao, Ruiyu and Xu, Ouyang and Guo, Yunhui},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {4180-4188},
doi = {10.1609/AAAI.V38I5.28213},
url = {https://mlanthology.org/aaai/2024/mao2024aaai-inconsistency/}
}