Continuous Value Function Approximation for Sequential Bidding Policies
Abstract
Market-based mechanisms such as auctions are being studied as an appropriate means for resource allocation in distributed and multiagent decision problems. When agents value resources in combination rather than in isolation. they must often deliberate about appropriate bidding strategies for a sequence of auctions offering resources of interest. We briefly describe a discrete dynamic programming model for constructing appropriate bidding policies for resources exhibiting both complementarities substitutability. We then introduce a continuous approximation of this model, assuming that money (or the numeraire good) is infinitely divisible. Though this has the potential to reduce the computational cost of computing policies, value functions in the transformed problem do not have a convenient closed form representation. We develop grid-based approximations for such value functions, representing value functions using piecewise linear approximations. We show that these methods can offer significant computational savings with relatively small cost in solution quality.
Cite
Text
Boutilier et al. "Continuous Value Function Approximation for Sequential Bidding Policies." Conference on Uncertainty in Artificial Intelligence, 1999.Markdown
[Boutilier et al. "Continuous Value Function Approximation for Sequential Bidding Policies." Conference on Uncertainty in Artificial Intelligence, 1999.](https://mlanthology.org/uai/1999/boutilier1999uai-continuous/)BibTeX
@inproceedings{boutilier1999uai-continuous,
title = {{Continuous Value Function Approximation for Sequential Bidding Policies}},
author = {Boutilier, Craig and Goldszmidt, Moisés and Sabata, Bikash},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {1999},
pages = {81-90},
url = {https://mlanthology.org/uai/1999/boutilier1999uai-continuous/}
}