Confidence Based Dual Reinforcement Q-Routing: An Adaptive Online Network Routing Algorithm

Abstract

This paper describes and evaluates the Confidence-based Dual Reinforcement QRouting algorithm (CDRQ-Routing) for adaptive packet routing in communication networks. CDRQ-Routing is based on an application of the Q-learning framework to network routing, as first proposed by Littman and Boyan (1993). The main contribution of CDRQ-routing is an increased quantity and an improved quality of exploration. Compared to Q-Routing, the state-of-the-art adaptive Bellman-Ford Routing algorithm, and the non-adaptive shortest path method, CDRQ-Routing learns superior policies significantly faster. Moreover, the overhead due to exploration is shown to be insignificant compared to the improvements achieved, which makes CDRQ-Routing a practical method for real communication networks. 1 Introduction In a communication network information is transferred from one node to another as data packets [ Tanenbaum, 1989 ] . The process of sending a packet P (s; d) from its source node s to its destination node d ...

Cite

Text

Kumar and Miikkulainen. "Confidence Based Dual Reinforcement Q-Routing: An Adaptive Online Network Routing Algorithm." International Joint Conference on Artificial Intelligence, 1999.

Markdown

[Kumar and Miikkulainen. "Confidence Based Dual Reinforcement Q-Routing: An Adaptive Online Network Routing Algorithm." International Joint Conference on Artificial Intelligence, 1999.](https://mlanthology.org/ijcai/1999/kumar1999ijcai-confidence/)

BibTeX

@inproceedings{kumar1999ijcai-confidence,
  title     = {{Confidence Based Dual Reinforcement Q-Routing: An Adaptive Online Network Routing Algorithm}},
  author    = {Kumar, Shailesh and Miikkulainen, Risto},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {1999},
  pages     = {758-763},
  url       = {https://mlanthology.org/ijcai/1999/kumar1999ijcai-confidence/}
}