Bipartite Correlation Clustering: Maximizing Agreements

Abstract

In Bipartite Correlation Clustering (BCC) we are given a complete bipartite graph $G$ with `+' and `-' edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all `+' edges within clusters plus all `-' edges cut across clusters. BCC is known to be NP-hard. We present a novel approximation algorithm for $k$-BCC, a variant of BCC with an upper bound $k$ on the number of clusters. Our algorithm outputs a $k$-clustering that provably achieves a number of agreements within a multiplicative ${(1-\delta)}$-factor from the optimal, for any desired accuracy $\delta$. It relies on solving a combinatorially constrained bilinear maximization on the bi-adjacency matrix of $G$. It runs in time exponential in $k$ and $\delta^{-1}$, but linear in the size of the input. Further, we show that, in the (unconstrained) BCC setting, an ${(1-\delta)}$-approximation can be achieved by $O(\delta^{-1})$ clusters regardless of the size of the graph. In turn, our $k$-BCC algorithm implies an Efficient PTAS for the BCC objective of maximizing agreements.

Cite

Text

Asteris et al. "Bipartite Correlation Clustering: Maximizing Agreements." International Conference on Artificial Intelligence and Statistics, 2016.

Markdown

[Asteris et al. "Bipartite Correlation Clustering: Maximizing Agreements." International Conference on Artificial Intelligence and Statistics, 2016.](https://mlanthology.org/aistats/2016/asteris2016aistats-bipartite/)

BibTeX

@inproceedings{asteris2016aistats-bipartite,
  title     = {{Bipartite Correlation Clustering: Maximizing Agreements}},
  author    = {Asteris, Megasthenis and Kyrillidis, Anastasios and Papailiopoulos, Dimitris S. and Dimakis, Alexandros G.},
  booktitle = {International Conference on Artificial Intelligence and Statistics},
  year      = {2016},
  pages     = {121-129},
  url       = {https://mlanthology.org/aistats/2016/asteris2016aistats-bipartite/}
}