Fair Associative Co-Clustering
Abstract
Co-clustering is a powerful data mining tool that extracts summary information from a data matrix, by simultaneously computing row and column clusters that provide a compact representation of the data. However, if the matrix contains data about individuals, the co-clustering results may be influenced by the societal biases that are reproduced in the data. Consequently, subsequent tasks such as recommendation systems may also be influenced by these biases, thereby compromising the fairness and integrity of the overall knowledge discovery or machine learning process. Despite the extensive research on fairness considerations in clustering, this issue has not been addressed in the context of co-clustering algorithms. In addressing this critical gap in the literature, this paper proposes a novel fair co-clustering algorithm. The proposed algorithm is based on an associative measure derived from the Goodman-Kruskal’s tau , which has demonstrated good convergence properties. This ensures optimal clustering and fairness performance by implementing an in-process rebalancing mechanism inspired by the fair assignment problem. An extensive experimental validation is provided to demonstrate the efficacy of our approach, also in comparison to a state-of-the-art method that uses co-clustering for fair recommendation.
Cite
Text
Peiretti and Pensa. "Fair Associative Co-Clustering." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-05962-8_17Markdown
[Peiretti and Pensa. "Fair Associative Co-Clustering." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/peiretti2025ecmlpkdd-fair/) doi:10.1007/978-3-032-05962-8_17BibTeX
@inproceedings{peiretti2025ecmlpkdd-fair,
title = {{Fair Associative Co-Clustering}},
author = {Peiretti, Federico and Pensa, Ruggero G.},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2025},
pages = {282-300},
doi = {10.1007/978-3-032-05962-8_17},
url = {https://mlanthology.org/ecmlpkdd/2025/peiretti2025ecmlpkdd-fair/}
}