GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks

Abstract

Entity resolution (ER) aims to identify entity records that refer to the same real-world entity, which is a critical problem in data cleaning and integration. Most of the existing models are attribute-centric, that is, matching entity pairs by comparing similarities of pre-aligned attributes, which require the schemas of records to be identical and are too coarse-grained to capture subtle key information within a single attribute. In this paper, we propose a novel graph-based ER model GraphER. Our model is token-centric: the final matching results are generated by directly aggregating token-level comparison features, in which both the semantic and structural information has been softly embedded into token embeddings by training an Entity Record Graph Convolutional Network (ER-GCN). To the best of our knowledge, our work is the first effort to do token-centric entity resolution with the help of GCN in entity resolution task. Extensive experiments on two real-world datasets demonstrate that our model stably outperforms state-of-the-art models.

Cite

Text

Li et al. "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I05.6330

Markdown

[Li et al. "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/li2020aaai-grapher/) doi:10.1609/AAAI.V34I05.6330

BibTeX

@inproceedings{li2020aaai-grapher,
  title     = {{GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks}},
  author    = {Li, Bing and Wang, Wei and Sun, Yifang and Zhang, Linhan and Ali, Muhammad Asif and Wang, Yi},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {8172-8179},
  doi       = {10.1609/AAAI.V34I05.6330},
  url       = {https://mlanthology.org/aaai/2020/li2020aaai-grapher/}
}