Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Abstract

We present a new approach to learning hypertext classifiers that combines a statistical text-learning method with a relational rule learner. This approach is well suited to learning in hypertext domains because its statistical component allows it to characterize text in terms of word frequencies, whereas its relational component is able to describe how neighboring documents are related to each other by hyperlinks that connect them. We evaluate our approach by applying it to tasks that involve learning definitions for (i) classes of pages, (ii) particular relations that exist between pairs of pages, and (iii) locating a particular class of information in the internal structure of pages. Our experiments demonstrate that this new approach is able to learn more accurate classifiers than either of its constituent methods alone.

Cite

Text

Craven and Slattery. "Relational Learning with Statistical Predicate Invention: Better Models for Hypertext." Machine Learning, 2001. doi:10.1023/A:1007676901476

Markdown

[Craven and Slattery. "Relational Learning with Statistical Predicate Invention: Better Models for Hypertext." Machine Learning, 2001.](https://mlanthology.org/mlj/2001/craven2001mlj-relational/) doi:10.1023/A:1007676901476

BibTeX

@article{craven2001mlj-relational,
  title     = {{Relational Learning with Statistical Predicate Invention: Better Models for Hypertext}},
  author    = {Craven, Mark and Slattery, Seán},
  journal   = {Machine Learning},
  year      = {2001},
  pages     = {97-119},
  doi       = {10.1023/A:1007676901476},
  volume    = {43},
  url       = {https://mlanthology.org/mlj/2001/craven2001mlj-relational/}
}