TableRank: A Ranking Algorithm for Table Search and Retrieval
Abstract
Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have be-come a valuable information repository. Therefore, effec-tively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search re-sults largely because the current ranking schemes are inade-quate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we de-sign and evaluate a novel table ranking algorithm – TableRank to improve the performance of our table search engine Table-Seer. Given a keyword based table query, TableRank facili-ties TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innova-tive term weighting scheme by aggregating multiple weight-ing factors from three levels: term, table and document. The experimental results show that our table search engine out-performs existing search engines on table search. In addition, incorporating multiple weighting factors can significantly im-prove the ranking results.
Cite
Text
Liu et al. "TableRank: A Ranking Algorithm for Table Search and Retrieval." AAAI Conference on Artificial Intelligence, 2007.Markdown
[Liu et al. "TableRank: A Ranking Algorithm for Table Search and Retrieval." AAAI Conference on Artificial Intelligence, 2007.](https://mlanthology.org/aaai/2007/liu2007aaai-tablerank/)BibTeX
@inproceedings{liu2007aaai-tablerank,
title = {{TableRank: A Ranking Algorithm for Table Search and Retrieval}},
author = {Liu, Ying and Bai, Kun and Mitra, Prasenjit and Giles, C. Lee},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2007},
pages = {317-322},
url = {https://mlanthology.org/aaai/2007/liu2007aaai-tablerank/}
}