Generalization-Based Similarity for Conceptual Clustering
Abstract
Knowledge extraction represents an important issue that concerns the ability to identify valid, potentially useful and understandable patterns from large data collections. Such a task becomes more difficult if the domain of application cannot be represented by means of an attribute-value representation. Thus, a more powerful representation language, such as First-Order Logic, is necessary. Due to the complexity of handling First-Order Logic formulæ, where the presence of relations causes various portions of one description to be possibly mapped in different ways onto another description, few works presenting techniques for comparing descriptions are available in the literature for this kind of representations. Nevertheless, the ability to assess similarity between first-order descriptions has many applications, ranging from description selection to flexible matching, from instance-based learning to clustering. This paper tackles the case of Conceptual Clustering, where a new approach to similarity evaluation, based on both syntactic and semantic features, is exploited to support the task of grouping together similar items according to their relational description. After presenting a framework for Horn Clauses (including criteria, a function and composition techniques for similarity assessment), classical clustering algorithms are exploited to carry out the grouping task. Experimental results on real-world datasets prove the effectiveness of the proposal.
Cite
Text
Ferilli et al. "Generalization-Based Similarity for Conceptual Clustering." European Conference on Machine Learning, 2007. doi:10.1007/978-3-540-68416-9_2Markdown
[Ferilli et al. "Generalization-Based Similarity for Conceptual Clustering." European Conference on Machine Learning, 2007.](https://mlanthology.org/ecmlpkdd/2007/ferilli2007ecml-generalizationbased/) doi:10.1007/978-3-540-68416-9_2BibTeX
@inproceedings{ferilli2007ecml-generalizationbased,
title = {{Generalization-Based Similarity for Conceptual Clustering}},
author = {Ferilli, Stefano and Basile, Teresa Maria Altomare and Di Mauro, Nicola and Biba, Marenglen and Esposito, Floriana},
booktitle = {European Conference on Machine Learning},
year = {2007},
pages = {13-26},
doi = {10.1007/978-3-540-68416-9_2},
url = {https://mlanthology.org/ecmlpkdd/2007/ferilli2007ecml-generalizationbased/}
}