Utilizing Context in Generative Bayesian Models for Linked Corpus

Abstract

In an interlinked corpus of documents, the context in which a citation appears provides extra information about the cited document. However, associating terms in the context to the cited document remains an open problem. We propose a novel document generation approach that statistically incorporates the context in which a document links to another document. We quantitatively show that the proposed generation scheme explains the linking phenomenon better than previous approaches. The context information along with the actual content of the document provides significant improvements over the previous approaches for various real world evaluation tasks such as link prediction and log-likelihood estimation on unseen content. The proposed method is more scalable to large collection of documents compared to the previous approaches.

Cite

Text

Kataria et al. "Utilizing Context in Generative Bayesian Models for Linked Corpus." AAAI Conference on Artificial Intelligence, 2010. doi:10.1609/AAAI.V24I1.7520

Markdown

[Kataria et al. "Utilizing Context in Generative Bayesian Models for Linked Corpus." AAAI Conference on Artificial Intelligence, 2010.](https://mlanthology.org/aaai/2010/kataria2010aaai-utilizing/) doi:10.1609/AAAI.V24I1.7520

BibTeX

@inproceedings{kataria2010aaai-utilizing,
  title     = {{Utilizing Context in Generative Bayesian Models for Linked Corpus}},
  author    = {Kataria, Saurabh and Mitra, Prasenjit and Bhatia, Sumit},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2010},
  pages     = {1340-1345},
  doi       = {10.1609/AAAI.V24I1.7520},
  url       = {https://mlanthology.org/aaai/2010/kataria2010aaai-utilizing/}
}