Language Models as Knowledge Embeddings

Abstract

Knowledge embeddings (KE) represent a knowledge graph (KG) by embedding entities and relations into continuous vector spaces. Existing methods are mainly structure-based or description-based. Structure-based methods learn representations that preserve the inherent structure of KGs. They cannot well represent abundant long-tail entities in real-world KGs with limited structural information. Description-based methods leverage textual information and language models. Prior approaches in this direction barely outperform structure-based ones, and suffer from problems like expensive negative sampling and restrictive description demand. In this paper, we propose LMKE, which adopts Language Models to derive Knowledge Embeddings, aiming at both enriching representations of long-tail entities and solving problems of prior description-based methods. We formulate description-based KE learning with a contrastive learning framework to improve efficiency in training and evaluation. Experimental results show that LMKE achieves state-of-the-art performance on KE benchmarks of link prediction and triple classification, especially for long-tail entities.

Cite

Text

Wang et al. "Language Models as Knowledge Embeddings." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/318

Markdown

[Wang et al. "Language Models as Knowledge Embeddings." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/wang2022ijcai-language/) doi:10.24963/IJCAI.2022/318

BibTeX

@inproceedings{wang2022ijcai-language,
  title     = {{Language Models as Knowledge Embeddings}},
  author    = {Wang, Xintao and He, Qianyu and Liang, Jiaqing and Xiao, Yanghua},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {2291-2297},
  doi       = {10.24963/IJCAI.2022/318},
  url       = {https://mlanthology.org/ijcai/2022/wang2022ijcai-language/}
}