Accurate Max-Margin Training for Structured Output Spaces
Abstract
Tsochantaridis et al 2005 proposed two formulations for maximum margin training of structured spaces: margin scaling and slack scaling. While margin scaling has been extensively used since it requires the same kind of MAP inference as normal structured prediction, slack scaling is believed to be more accurate and better-behaved. We present an efficient variational approximation to the slack scaling method that solves its inference bottleneck while retaining its accuracy advantage over margin scaling. We further argue that existing scaling approaches do not separate the true labeling comprehensively while generating violating constraints. We propose a new max-margin trainer PosLearn that generates violators to ensure separation at each position of a decomposable loss function. Empirical results on real datasets illustrate that PosLearn can reduce test error by up to 25%. Further, PosLearn violators can be generated more efficiently than slack violators; for many structured tasks the time required is just twice that of MAP inference.
Cite
Text
Sarawagi and Gupta. "Accurate Max-Margin Training for Structured Output Spaces." International Conference on Machine Learning, 2008. doi:10.1145/1390156.1390268Markdown
[Sarawagi and Gupta. "Accurate Max-Margin Training for Structured Output Spaces." International Conference on Machine Learning, 2008.](https://mlanthology.org/icml/2008/sarawagi2008icml-accurate/) doi:10.1145/1390156.1390268BibTeX
@inproceedings{sarawagi2008icml-accurate,
title = {{Accurate Max-Margin Training for Structured Output Spaces}},
author = {Sarawagi, Sunita and Gupta, Rahul},
booktitle = {International Conference on Machine Learning},
year = {2008},
pages = {888-895},
doi = {10.1145/1390156.1390268},
url = {https://mlanthology.org/icml/2008/sarawagi2008icml-accurate/}
}