A Reinforcement Learning Framework for Combinatorial Optimization

Boyan, Justin A.

A Reinforcement Learning Framework for Combinatorial Optimization

AAAI 1996 pp. 1379

/aaai/1996/boyan1996aaai-reinforcement/

Abstract

The combination of reinforcement learning methods with neural networks has found success on a growing number of large-scale applications, including backgam-mon move selection (Tesauro 1992), elevator control (Crites &amp; Barto 1996), and job-shop scheduling (Zhang &amp; Dietterich 1995). In this work, we modify and generalize the scheduling paradigm used by Zhang and Di-etterich to produce a general reinforcement-learning-based framework for combinatorial optimization. The problem of combinatorial optimization is simply stated: given a finite state space X and an objective function f: X--+ 532, find an optimal state z * = argmax, eX f(x). Typically, X is huge, and finding an optimal x * is intractable. However, there are many effective heuristic algorithms that attempt to exploit

PDF AAAI Semantic Scholar

Cite

Text

Boyan. "A Reinforcement Learning Framework for Combinatorial Optimization." AAAI Conference on Artificial Intelligence, 1996.

Markdown

[Boyan. "A Reinforcement Learning Framework for Combinatorial Optimization." AAAI Conference on Artificial Intelligence, 1996.](https://mlanthology.org/aaai/1996/boyan1996aaai-reinforcement/)

BibTeX

@inproceedings{boyan1996aaai-reinforcement,
  title     = {{A Reinforcement Learning Framework for Combinatorial Optimization}},
  author    = {Boyan, Justin A.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {1996},
  pages     = {1379},
  url       = {https://mlanthology.org/aaai/1996/boyan1996aaai-reinforcement/}
}