Toward Efficient Navigation of Massive-Scale Geo-Textual Streams
Abstract
With the popularization of portable devices, numerous applications continuously produce huge streams of geo-tagged textual data, thus posing challenges to index geo-textual streaming data efficiently, which is an important task in both data management and AI applications, e.g., real-time data streams mining and targeted advertising. This, however, is not possible with the state-of-the-art indexing methods as they focus on search optimizations of static datasets, and have high index maintenance cost. In this paper, we present NQ-tree, which combines new structure designs and self-tuning methods to navigate between update and search efficiency. Our contributions include: (1) the design of multiple stores each with a different emphasis on write-friendness and read-friendness; (2) utilizing data compression techniques to reduce the I/O cost; (3) exploiting both spatial and keyword information to improve the pruning efficiency; (4) proposing an analytical cost model, and using an online self-tuning method to achieve efficient accesses to different workloads. Experiments on two real-world datasets show that NQ-tree outperforms two well designed baselines by up to 10×.
Cite
Text
Yang et al. "Toward Efficient Navigation of Massive-Scale Geo-Textual Streams." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/672Markdown
[Yang et al. "Toward Efficient Navigation of Massive-Scale Geo-Textual Streams." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/yang2019ijcai-efficient/) doi:10.24963/IJCAI.2019/672BibTeX
@inproceedings{yang2019ijcai-efficient,
title = {{Toward Efficient Navigation of Massive-Scale Geo-Textual Streams}},
author = {Yang, Chengcheng and Chen, Lisi and Shang, Shuo and Zhu, Fan and Liu, Li and Shao, Ling},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2019},
pages = {4838-4845},
doi = {10.24963/IJCAI.2019/672},
url = {https://mlanthology.org/ijcai/2019/yang2019ijcai-efficient/}
}