Fair Clustering in the Sliding Window Model

Abstract

We study streaming algorithms for proportionally fair clustering, a notion originally suggested by Chierichetti et al. (2017), in the sliding window model. We show that although there exist efficient streaming algorithms in the insertion-only model, surprisingly no algorithm can achieve finite ratio without violating the fairness constraint in sliding window. Hence, the problem of fair clustering is a rare separation between the insertion-only streaming model and the sliding window model. On the other hand, we show that if the fairness constraint is relaxed by a multiplicative $(1+\varepsilon)$ factor, there exists a $(1 + \varepsilon)$-approximate sliding window algorithm that uses $\text{poly}(k\varepsilon^{-1}\log n)$ space. This achieves essentially the best parameters (up to degree in the polynomial) provided the aforementioned lower bound. We also implement a number of empirical evaluations on real datasets to complement our theoretical results.

Cite

Text

Cohen-Addad et al. "Fair Clustering in the Sliding Window Model." International Conference on Learning Representations, 2025.

Markdown

[Cohen-Addad et al. "Fair Clustering in the Sliding Window Model." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/cohenaddad2025iclr-fair/)

BibTeX

@inproceedings{cohenaddad2025iclr-fair,
  title     = {{Fair Clustering in the Sliding Window Model}},
  author    = {Cohen-Addad, Vincent and Jiang, Shaofeng H.-C. and Yang, Qiaoyuan and Zhang, Yubo and Zhou, Samson},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/cohenaddad2025iclr-fair/}
}