Statistical Predicate Invention

Abstract

We propose statistical predicate invention as a key problem for statistical relational learning. SPI is the problem of discovering new concepts, properties and relations in structured data, and generalizes hidden variable discovery in statistical models and predicate invention in ILP. We propose an initial model for SPI based on second-order Markov logic, in which predicates as well as arguments can be variables, and the domain of discourse is not fully known in advance. Our approach iteratively refines clusters of symbols based on the clusters of symbols they appear in atoms with (e.g., it clusters relations by the clusters of the ob jects they relate). Since different clusterings are better for predicting different subsets of the atoms, we allow multiple cross-cutting clusterings. We show that this approach outperforms Markov logic structure learning and the recently introduced infinite relational model on a number of relational datasets.

Cite

Text

Kok and Domingos. "Statistical Predicate Invention." International Conference on Machine Learning, 2007. doi:10.1145/1273496.1273551

Markdown

[Kok and Domingos. "Statistical Predicate Invention." International Conference on Machine Learning, 2007.](https://mlanthology.org/icml/2007/kok2007icml-statistical/) doi:10.1145/1273496.1273551

BibTeX

@inproceedings{kok2007icml-statistical,
  title     = {{Statistical Predicate Invention}},
  author    = {Kok, Stanley and Domingos, Pedro M.},
  booktitle = {International Conference on Machine Learning},
  year      = {2007},
  pages     = {433-440},
  doi       = {10.1145/1273496.1273551},
  url       = {https://mlanthology.org/icml/2007/kok2007icml-statistical/}
}