Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings

Abstract

Knowledge graph embedding (KGE) models are an effective and popular approach to represent and reason with multi-relational data. Prior studies have shown that KGE models are sensitive to hyperparameter settings, however, and that suitable choices are dataset-dependent. In this paper, we explore hyperparameter optimization (HPO) for very large knowledge graphs, where the cost of evaluating individual hyperparameter configurations is excessive. Prior studies often avoided this cost by using various heuristics; e.g., by training on a subgraph or by using fewer epochs. We systematically discuss and evaluate the quality and cost savings of such heuristics and other low-cost approximation techniques. Based on our findings, we introduce GraSH , an efficient multi-fidelity HPO algorithm for large-scale KGEs that combines both graph and epoch reduction techniques and runs in multiple rounds of increasing fidelities. We conducted an experimental study and found that GraSH obtains state-of-the-art results on large graphs at a low cost (three complete training runs in total). Source code and auxiliary material at https://github.com/uma-pi1/GraSH .

Cite

Text

Kochsiek et al. "Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022. doi:10.1007/978-3-031-26390-3_9

Markdown

[Kochsiek et al. "Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022.](https://mlanthology.org/ecmlpkdd/2022/kochsiek2022ecmlpkdd-start/) doi:10.1007/978-3-031-26390-3_9

BibTeX

@inproceedings{kochsiek2022ecmlpkdd-start,
  title     = {{Start Small, Think Big: On Hyperparameter Optimization for Large-Scale Knowledge Graph Embeddings}},
  author    = {Kochsiek, Adrian and Niesel, Fritz and Gemulla, Rainer},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2022},
  pages     = {138-154},
  doi       = {10.1007/978-3-031-26390-3_9},
  url       = {https://mlanthology.org/ecmlpkdd/2022/kochsiek2022ecmlpkdd-start/}
}