Knowledge Elicitation Using Deep Metric Learning and Psychometric Testing

Abstract

Knowledge present in a domain is well expressed as relationships between corresponding concepts. For example, in zoology, animal species form complex hierarchies; in genomics, the different (parts of) molecules are organized in groups and subgroups based on their functions; plants, molecules, and astronomical objects all form complex taxonomies. Nevertheless, when applying supervised machine learning (ML) in such domains, we commonly reduce the complex and rich knowledge to a fixed set of labels, and induce a model shows good generalization performance with respect to these labels. The main reason for such a reductionist approach is the difficulty in eliciting the domain knowledge from the experts. Developing a label structure with sufficient fidelity and providing comprehensive multi-label annotation can be exceedingly labor-intensive in many real-world applications. In this paper, we provide a method for efficient hierarchical knowledge elicitation (HKE) from experts working with high-dimensional data such as images or videos. Our method is based on psychometric testing and active deep metric learning. The developed models embed the high-dimensional data in a metric space where distances are semantically meaningful, and the data can be organized in a hierarchical structure. We provide empirical evidence with a series of experiments on a synthetically generated dataset of simple shapes, and Cifar 10 and Fashion-MNIST benchmarks that our method is indeed successful in uncovering hierarchical structures.

Cite

Text

Yin et al. "Knowledge Elicitation Using Deep Metric Learning and Psychometric Testing." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2020. doi:10.1007/978-3-030-67661-2_10

Markdown

[Yin et al. "Knowledge Elicitation Using Deep Metric Learning and Psychometric Testing." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2020.](https://mlanthology.org/ecmlpkdd/2020/yin2020ecmlpkdd-knowledge/) doi:10.1007/978-3-030-67661-2_10

BibTeX

@inproceedings{yin2020ecmlpkdd-knowledge,
  title     = {{Knowledge Elicitation Using Deep Metric Learning and Psychometric Testing}},
  author    = {Yin, Lu and Menkovski, Vlado and Pechenizkiy, Mykola},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2020},
  pages     = {154-169},
  doi       = {10.1007/978-3-030-67661-2_10},
  url       = {https://mlanthology.org/ecmlpkdd/2020/yin2020ecmlpkdd-knowledge/}
}