Learning Trees and Rules with Set-Valued Features
Abstract
In most learning systems examples are represented as fixed-length "feature vectors", the components of which are either real numbers or nominal values. We propose an extension of the featurevector representation that allows the value of a feature to be a set of strings; for instance, to represent a small white and black dog with the nominal features size and species and the setvalued feature color, one might use a feature vector with size=small, species=canis-familiaris and color=fwhite, blackg. Since we make no assumptions about the number of possible set elements, this extension of the traditional feature-vector representation is closely connected to Blum's "infinite attribute" representation. We argue that many decision tree and rule learning algorithms can be easily extended to setvalued features. We also show by example that many real-world learning problems can be efficiently and naturally represented with set-valued features; in particular, text categorization problems and probl...
Cite
Text
Cohen. "Learning Trees and Rules with Set-Valued Features." AAAI Conference on Artificial Intelligence, 1996.Markdown
[Cohen. "Learning Trees and Rules with Set-Valued Features." AAAI Conference on Artificial Intelligence, 1996.](https://mlanthology.org/aaai/1996/cohen1996aaai-learning/)BibTeX
@inproceedings{cohen1996aaai-learning,
title = {{Learning Trees and Rules with Set-Valued Features}},
author = {Cohen, William W.},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {1996},
pages = {709-716},
url = {https://mlanthology.org/aaai/1996/cohen1996aaai-learning/}
}