NetSDM: Semantic Data Mining with Network Analysis

Abstract

Semantic data mining (SDM) is a form of relational data mining that uses annotated data together with complex semantic background knowledge to learn rules that can be easily interpreted. The drawback of SDM is a high computational complexity of existing SDM algorithms, resulting in long run times even when applied to relatively small data sets. This paper proposes an effective SDM approach, named NetSDM, which first transforms the available semantic background knowledge into a network format, followed by network analysis based node ranking and pruning to significantly reduce the size of the original background knowledge. The experimental evaluation of the NetSDM methodology on acute lymphoblastic leukemia and breast cancer data demonstrates that NetSDM achieves radical time efficiency improvements and that learned rules are comparable or better than the rules obtained by the original SDM algorithms.

Cite

Text

Kralj et al. "NetSDM: Semantic Data Mining with Network Analysis." Journal of Machine Learning Research, 2019.

Markdown

[Kralj et al. "NetSDM: Semantic Data Mining with Network Analysis." Journal of Machine Learning Research, 2019.](https://mlanthology.org/jmlr/2019/kralj2019jmlr-netsdm/)

BibTeX

@article{kralj2019jmlr-netsdm,
  title     = {{NetSDM: Semantic Data Mining with Network Analysis}},
  author    = {Kralj, Jan and Robnik-Sikonja, Marko and Lavrac, Nada},
  journal   = {Journal of Machine Learning Research},
  year      = {2019},
  pages     = {1-50},
  volume    = {20},
  url       = {https://mlanthology.org/jmlr/2019/kralj2019jmlr-netsdm/}
}