GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks
Abstract
Entity resolution (ER) aims to identify entity records that refer to the same real-world entity, which is a critical problem in data cleaning and integration. Most of the existing models are attribute-centric, that is, matching entity pairs by comparing similarities of pre-aligned attributes, which require the schemas of records to be identical and are too coarse-grained to capture subtle key information within a single attribute. In this paper, we propose a novel graph-based ER model GraphER. Our model is token-centric: the final matching results are generated by directly aggregating token-level comparison features, in which both the semantic and structural information has been softly embedded into token embeddings by training an Entity Record Graph Convolutional Network (ER-GCN). To the best of our knowledge, our work is the first effort to do token-centric entity resolution with the help of GCN in entity resolution task. Extensive experiments on two real-world datasets demonstrate that our model stably outperforms state-of-the-art models.
Cite
Text
Li et al. "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I05.6330Markdown
[Li et al. "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/li2020aaai-grapher/) doi:10.1609/AAAI.V34I05.6330BibTeX
@inproceedings{li2020aaai-grapher,
title = {{GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks}},
author = {Li, Bing and Wang, Wei and Sun, Yifang and Zhang, Linhan and Ali, Muhammad Asif and Wang, Yi},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2020},
pages = {8172-8179},
doi = {10.1609/AAAI.V34I05.6330},
url = {https://mlanthology.org/aaai/2020/li2020aaai-grapher/}
}