Task-Specific Representation Learning for Web-Scale Entity Disambiguation

Abstract

Named entity disambiguation (NED) is a central problem in information extraction. The goal is to link entities in a knowledge graph (KG) to their mention spans in unstructured text. Each distinct mention span (like John Smith, Jordan or Apache) represents a multi-class classification task. NED can therefore be modeled as a multitask problem with tens of millions of tasks for realistic KGs. We initiate an investigation into neural representations, network architectures, and training protocols for multitask NED. Specifically, we propose a task-sensitive representation learning framework that learns mention dependent representations, followed by a common classifier. Parameter learning in our framework can be decomposed into solving multiple smaller problems involving overlapping groups of tasks. We prove bounds for excess risk, which provide additional insight into the problem of multi-task representation learning. While remaining practical in terms of training memory and time requirements, our approach outperforms recent strong baselines, on four benchmark data sets.

Cite

Text

Kar et al. "Task-Specific Representation Learning for Web-Scale Entity Disambiguation." AAAI Conference on Artificial Intelligence, 2018. doi:10.1609/AAAI.V32I1.12066

Markdown

[Kar et al. "Task-Specific Representation Learning for Web-Scale Entity Disambiguation." AAAI Conference on Artificial Intelligence, 2018.](https://mlanthology.org/aaai/2018/kar2018aaai-task/) doi:10.1609/AAAI.V32I1.12066

BibTeX

@inproceedings{kar2018aaai-task,
  title     = {{Task-Specific Representation Learning for Web-Scale Entity Disambiguation}},
  author    = {Kar, Rijula and Reddy, Susmija and Bhattacharya, Sourangshu and Dasgupta, Anirban and Chakrabarti, Soumen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {5812-5819},
  doi       = {10.1609/AAAI.V32I1.12066},
  url       = {https://mlanthology.org/aaai/2018/kar2018aaai-task/}
}