Multiple Instance Learning with Bag-Level Randomized Trees

Abstract

Knowledge discovery in databases with a flexible structure poses a great challenge to machine learning community. Multiple Instance Learning (MIL) aims at learning from samples (called bags) represented by multiple feature vectors (called instances) as opposed to single feature vectors characteristic for the traditional data representation. This relaxation turns out to be useful in formulating many machine learning problems including classification of molecules, cancer detection from tissue images or identification of malicious network communications. However, despite the recent progress in this area, the current set of MIL tools still seems to be very application specific and/or burdened with many tuning parameters or processing steps. In this paper, we propose a simple, yet effective tree-based algorithm for solving MIL classification problems. Empirical evaluation against 28 classifiers on 29 publicly available benchmark datasets shows a high level performance of the proposed solution even with its default parameter settings. Data related to this paper are available at: https://github.com/komartom/MIDatasets.jl . Code related to this paper is available at: https://github.com/komartom/BLRT.jl .

Cite

Text

Komárek and Somol. "Multiple Instance Learning with Bag-Level Randomized Trees." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018. doi:10.1007/978-3-030-10925-7_16

Markdown

[Komárek and Somol. "Multiple Instance Learning with Bag-Level Randomized Trees." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018.](https://mlanthology.org/ecmlpkdd/2018/komarek2018ecmlpkdd-multiple/) doi:10.1007/978-3-030-10925-7_16

BibTeX

@inproceedings{komarek2018ecmlpkdd-multiple,
  title     = {{Multiple Instance Learning with Bag-Level Randomized Trees}},
  author    = {Komárek, Tomás and Somol, Petr},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2018},
  pages     = {259-272},
  doi       = {10.1007/978-3-030-10925-7_16},
  url       = {https://mlanthology.org/ecmlpkdd/2018/komarek2018ecmlpkdd-multiple/}
}