Finding Frequent Entities in Continuous Data

Abstract

In many applications that involve processing high-dimensional data, it is important to identify a small set of entities that account for a significant fraction of detections. Rather than formalize this as a clustering problem, in which all detections must be grouped into hard or soft categories, we formalize it as an instance of the frequent items or heavy hitters problem, which finds groups of tightly clustered objects that have a high density in the feature space. We show that the heavy hitters formulation generates solutions that are more accurate and effective than the clustering formulation. In addition, we present a novel online algorithm for heavy hitters, called HAC, which addresses problems in continuous space, and demonstrate its effectiveness on real video and household domains.

Cite

Text

Alet et al. "Finding Frequent Entities in Continuous Data." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/275

Markdown

[Alet et al. "Finding Frequent Entities in Continuous Data." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/alet2018ijcai-finding/) doi:10.24963/IJCAI.2018/275

BibTeX

@inproceedings{alet2018ijcai-finding,
  title     = {{Finding Frequent Entities in Continuous Data}},
  author    = {Alet, Ferran and Chitnis, Rohan and Kaelbling, Leslie Pack and Lozano-Pérez, Tomás},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {1992-1999},
  doi       = {10.24963/IJCAI.2018/275},
  url       = {https://mlanthology.org/ijcai/2018/alet2018ijcai-finding/}
}