HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization
Abstract
The rapid growth of open data on the Web promotes the development of data portals that facilitate finding useful datasets. To help users quickly inspect a dataset found in a portal, we propose to summarize its contents and generate a hierarchical grouping of entities connected by relations. Our generic approach, called HIEDS, considers coverage of dataset, height of hierarchy, cohesion within groups, overlap between groups, and homogeneity of groups, and integrates these configurable factors into a combinatorial optimization problem to solve. We present an efficient solution, to serve users with dynamically configured summaries with acceptable latency. We systematically experiment with our approach on real-world RDF datasets. PDF
Cite
Text
Cheng et al. "HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization." International Joint Conference on Artificial Intelligence, 2016.Markdown
[Cheng et al. "HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization." International Joint Conference on Artificial Intelligence, 2016.](https://mlanthology.org/ijcai/2016/cheng2016ijcai-hieds/)BibTeX
@inproceedings{cheng2016ijcai-hieds,
title = {{HIEDS: A Generic and Efficient Approach to Hierarchical Dataset Summarization}},
author = {Cheng, Gong and Jin, Cheng and Qu, Yuzhong},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2016},
pages = {3705-3711},
url = {https://mlanthology.org/ijcai/2016/cheng2016ijcai-hieds/}
}