Sequential Approaches for Learning Datum-Wise Sparse Representations
Abstract
In supervised classification, data representation is usually considered at the dataset level: one looks for the “best” representation of data assuming it to be the same for all the data in the data space. We propose a different approach where the representations used for classification are tailored to each datum in the data space. One immediate goal is to obtain sparse datum-wise representations : our approach learns to build a representation specific to each datum that contains only a small subset of the features, thus allowing classification to be fast and efficient. This representation is obtained by way of a sequential decision process that sequentially chooses which features to acquire before classifying a particular point; this process is learned through algorithms based on Reinforcement Learning. The proposed method performs well on an ensemble of medium-sized sparse classification problems. It offers an alternative to global sparsity approaches, and is a natural framework for sequential classification problems. The method extends easily to a whole family of sparsity-related problem which would otherwise require developing specific solutions. This is the case in particular for cost-sensitive and limited-budget classification, where feature acquisition is costly and is often performed sequentially. Finally, our approach can handle non-differentiable loss functions or combinatorial optimization encountered in more complex feature selection problems.
Cite
Text
Dulac-Arnold et al. "Sequential Approaches for Learning Datum-Wise Sparse Representations." Machine Learning, 2012. doi:10.1007/S10994-012-5306-7Markdown
[Dulac-Arnold et al. "Sequential Approaches for Learning Datum-Wise Sparse Representations." Machine Learning, 2012.](https://mlanthology.org/mlj/2012/dulacarnold2012mlj-sequential/) doi:10.1007/S10994-012-5306-7BibTeX
@article{dulacarnold2012mlj-sequential,
title = {{Sequential Approaches for Learning Datum-Wise Sparse Representations}},
author = {Dulac-Arnold, Gabriel and Denoyer, Ludovic and Preux, Philippe and Gallinari, Patrick},
journal = {Machine Learning},
year = {2012},
pages = {87-122},
doi = {10.1007/S10994-012-5306-7},
volume = {89},
url = {https://mlanthology.org/mlj/2012/dulacarnold2012mlj-sequential/}
}