HC-Search: Learning Heuristics and Cost Functions for Structured Prediction

Abstract

Structured prediction is the problem of learning a function from structured inputs to structured outputs with prototypical examples being part-of-speech tagging and image labeling. Inspired by the recent successes of search-based structured prediction, we introduce a new framework for structured prediction called {\em HC-Search}. Given a structured input, the framework uses a search procedure guided by a learned heuristic H to uncover high quality candidate outputs and then uses a separate learned cost function C to select a final prediction among those outputs. We can decompose the regret of the overall approach into the loss due to H not leading to high quality outputs, and the loss due to C not selecting the best among the generated outputs. Guided by this decomposition, we minimize the overall regret in a greedy stage-wise manner by first training H to quickly uncover high quality outputs via imitation learning, and then training C to correctly rank the outputs generated via H according to their true losses. Experiments on several benchmark domains show that our approach significantly outperforms the state-of-the-art methods.

Cite

Text

Doppa et al. "HC-Search: Learning Heuristics and Cost Functions for Structured Prediction." AAAI Conference on Artificial Intelligence, 2013. doi:10.1609/AAAI.V27I1.8664

Markdown

[Doppa et al. "HC-Search: Learning Heuristics and Cost Functions for Structured Prediction." AAAI Conference on Artificial Intelligence, 2013.](https://mlanthology.org/aaai/2013/doppa2013aaai-hc/) doi:10.1609/AAAI.V27I1.8664

BibTeX

@inproceedings{doppa2013aaai-hc,
  title     = {{HC-Search: Learning Heuristics and Cost Functions for Structured Prediction}},
  author    = {Doppa, Janardhan Rao and Fern, Alan and Tadepalli, Prasad},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2013},
  pages     = {253-259},
  doi       = {10.1609/AAAI.V27I1.8664},
  url       = {https://mlanthology.org/aaai/2013/doppa2013aaai-hc/}
}