Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage
Abstract
Interactive dynamic influence diagrams~(I-DIDs) provide an explicit way of modeling how a subject agent solves decision making problems in the presence of other agents in a common setting. To optimize its decisions, the subject agent needs to predict the other agents' behavior, that is generally obtained by solving their candidate models. This becomes extremely difficult since the model space may be rather large, and grows when the other agents act and observe over the time. A recent proposal for solving I-DIDs lies in a concept of value equivalence (VE) that shows potential advances on significantly reducing the model space. In this paper, we establish a principled framework to implement the VE techniques and propose an approximate method to compute VE of candidate models. The development offers ample opportunity of exploiting VE to further improve the scalability of I-DID solutions. We theoretically analyze properties of the approximate techniques and show empirical results in multiple problem domains. PDF
Cite
Text
Conroy et al. "Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage." International Joint Conference on Artificial Intelligence, 2016.Markdown
[Conroy et al. "Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage." International Joint Conference on Artificial Intelligence, 2016.](https://mlanthology.org/ijcai/2016/conroy2016ijcai-approximating/)BibTeX
@inproceedings{conroy2016ijcai-approximating,
title = {{Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage}},
author = {Conroy, Ross and Zeng, Yifeng and Tang, Jing},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2016},
pages = {201-207},
url = {https://mlanthology.org/ijcai/2016/conroy2016ijcai-approximating/}
}