Usage-Centric Benchmarking of RDF Triple Stores

Abstract

A central component in many applications is the underlying data management layer. In Data-Web applications, the central component of this layer is the triple store. It is thus evident that finding the most adequate store for the application to develop is of crucial importance for individual projects as well as for data integration on the Data Web in general. In this paper, we propose a generic benchmark creation procedure for SPARQL, which we apply to the DBpedia knowledge base. In contrast to previous approaches, our benchmark is based on queries that were actually issued by humans and applications against existing RDF data not resembling a relational schema. In addition, our approach does not only take the query string but also the features of the queries into consideration during the benchmark generation process. Our generic procedure for benchmark creation is based on query-log mining, SPARQL feature analysis and clustering. After presenting the method underlying our benchmark generation algorithm, we use the generated benchmark to compare the popular triple store implementations Virtuoso, Sesame, Jena-TDB, and BigOWLIM.

Cite

Text

Morsey et al. "Usage-Centric Benchmarking of RDF Triple Stores." AAAI Conference on Artificial Intelligence, 2012. doi:10.1609/AAAI.V26I1.8448

Markdown

[Morsey et al. "Usage-Centric Benchmarking of RDF Triple Stores." AAAI Conference on Artificial Intelligence, 2012.](https://mlanthology.org/aaai/2012/morsey2012aaai-usage/) doi:10.1609/AAAI.V26I1.8448

BibTeX

@inproceedings{morsey2012aaai-usage,
  title     = {{Usage-Centric Benchmarking of RDF Triple Stores}},
  author    = {Morsey, Mohamed and Lehmann, Jens and Auer, Sören and Ngomo, Axel-Cyrille Ngonga},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2012},
  pages     = {2134-2140},
  doi       = {10.1609/AAAI.V26I1.8448},
  url       = {https://mlanthology.org/aaai/2012/morsey2012aaai-usage/}
}