Data Clustering with a Relational Push-Pull Model

Abstract

We present a new generative model for relational data in which relations between objects can have ei- ther a binding or a separating effect. For example, in a group of students separated into gender clusters, a "dating" relation would appear most frequently between the clusters, but a "roommate" relation would appear more often within clusters. In visualizing these rela- tions, one can imagine that the "dating" relation effec- tively pushes clusters apart, while the "roommate" re- lation pulls clusters into tighter formations. A unique aspect of the model is that an edge's existence is depen- dent on both the clusters to which the two connected objects belong and the features of the connected objects. We use simulated annealing to search for optimal val- ues of the unknown model parameters, where the ob- jective function is a Bayesian score derived from the generative model. Results describing the performance of the model are shown with artificial data as well as a subset of the Internet Movie Database. The results show that discovering a relation's tendency to either push or pull is critical to discovering a consistent clus- tering.

Cite

Text

Anthony and desJardins. "Data Clustering with a Relational Push-Pull Model." AAAI Conference on Artificial Intelligence, 2007. doi:10.1109/icdmw.2007.61

Markdown

[Anthony and desJardins. "Data Clustering with a Relational Push-Pull Model." AAAI Conference on Artificial Intelligence, 2007.](https://mlanthology.org/aaai/2007/anthony2007aaai-data/) doi:10.1109/icdmw.2007.61

BibTeX

@inproceedings{anthony2007aaai-data,
  title     = {{Data Clustering with a Relational Push-Pull Model}},
  author    = {Anthony, Adam and desJardins, Marie},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2007},
  pages     = {1840-1841},
  doi       = {10.1109/icdmw.2007.61},
  url       = {https://mlanthology.org/aaai/2007/anthony2007aaai-data/}
}