Online Feature Selection by Adaptive Sub-Gradient Methods

Abstract

The overall goal of online feature selection is to iteratively select, from high-dimensional streaming data, a small, “budgeted” number of features for constructing accurate predictors. In this paper, we address the online feature selection problem using novel truncation techniques for two online sub-gradient methods: Adaptive Regularized Dual Averaging (ARDA) and Adaptive Mirror Descent (AMD). The corresponding truncation-based algorithms are called B-ARDA and B-AMD, respectively. The key aspect of our truncation techniques is to take into account the magnitude of feature values in the current predictor, together with their frequency in the history of predictions. A detailed regret analysis for both algorithms is provided. Experiments on six high-dimensional datasets indicate that both B-ARDA and B-AMD outperform two advanced online feature selection algorithms, OFS and SOFS, especially when the number of selected features is small. Compared to sparse online learning algorithms that use $\ell _1$ regularization, B-ARDA is superior to $\ell _1$ -ARDA, and B-AMD is superior to Ada-Fobos. Code related to this paper is available at: https://github.com/LUCKY-ting/online-feature-selection .

Cite

Text

Zhai et al. "Online Feature Selection by Adaptive Sub-Gradient Methods." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018. doi:10.1007/978-3-030-10928-8_26

Markdown

[Zhai et al. "Online Feature Selection by Adaptive Sub-Gradient Methods." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018.](https://mlanthology.org/ecmlpkdd/2018/zhai2018ecmlpkdd-online/) doi:10.1007/978-3-030-10928-8_26

BibTeX

@inproceedings{zhai2018ecmlpkdd-online,
  title     = {{Online Feature Selection by Adaptive Sub-Gradient Methods}},
  author    = {Zhai, Tingting and Wang, Hao and Koriche, Frédéric and Gao, Yang},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2018},
  pages     = {430-446},
  doi       = {10.1007/978-3-030-10928-8_26},
  url       = {https://mlanthology.org/ecmlpkdd/2018/zhai2018ecmlpkdd-online/}
}