A Reinforcement Learning Framework for Combinatorial Optimization
Abstract
The combination of reinforcement learning methods with neural networks has found success on a growing number of large-scale applications, including backgam-mon move selection (Tesauro 1992), elevator control (Crites & Barto 1996), and job-shop scheduling (Zhang & Dietterich 1995). In this work, we modify and generalize the scheduling paradigm used by Zhang and Di-etterich to produce a general reinforcement-learning-based framework for combinatorial optimization. The problem of combinatorial optimization is simply stated: given a finite state space X and an objective function f: X--+ 532, find an optimal state z * = argmax, eX f(x). Typically, X is huge, and finding an optimal x * is intractable. However, there are many effective heuristic algorithms that attempt to exploit
Cite
Text
Boyan. "A Reinforcement Learning Framework for Combinatorial Optimization." AAAI Conference on Artificial Intelligence, 1996.Markdown
[Boyan. "A Reinforcement Learning Framework for Combinatorial Optimization." AAAI Conference on Artificial Intelligence, 1996.](https://mlanthology.org/aaai/1996/boyan1996aaai-reinforcement/)BibTeX
@inproceedings{boyan1996aaai-reinforcement,
title = {{A Reinforcement Learning Framework for Combinatorial Optimization}},
author = {Boyan, Justin A.},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {1996},
pages = {1379},
url = {https://mlanthology.org/aaai/1996/boyan1996aaai-reinforcement/}
}