Adaptive Regularization of Weight Vectors
Abstract
We present AROW, an online learning algorithm for binary and multiclass problems that combines large margin training, confidence weighting, and the capacity to handle non-separable data. AROW performs adaptive regularization of the prediction function upon seeing each new instance, allowing it to perform especially well in the presence of label noise. We derive mistake bounds for the binary and multiclass settings that are similar in form to the second order perceptron bound. Our bounds do not assume separability. We also relate our algorithm to recent confidence-weighted online learning techniques. Empirical evaluations show that AROW achieves state-of-the-art performance on a wide range of binary and multiclass tasks, as well as robustness in the face of non-separable data.
Cite
Text
Crammer et al. "Adaptive Regularization of Weight Vectors." Machine Learning, 2013. doi:10.1007/S10994-013-5327-XMarkdown
[Crammer et al. "Adaptive Regularization of Weight Vectors." Machine Learning, 2013.](https://mlanthology.org/mlj/2013/crammer2013mlj-adaptive/) doi:10.1007/S10994-013-5327-XBibTeX
@article{crammer2013mlj-adaptive,
title = {{Adaptive Regularization of Weight Vectors}},
author = {Crammer, Koby and Kulesza, Alex and Dredze, Mark},
journal = {Machine Learning},
year = {2013},
pages = {155-187},
doi = {10.1007/S10994-013-5327-X},
volume = {91},
url = {https://mlanthology.org/mlj/2013/crammer2013mlj-adaptive/}
}