Factorization Ranking Model for Move Prediction in the Game of Go

Abstract

In this paper, we investigate the move prediction problem in the game of Go by proposing a new ranking model named Factorization Bradley Terry (FBT) model. This new model considers the move prediction problem as group competitions while also taking the interaction between features into account. A FBT model is able to provide a probability distribution that expresses a preference over moves. Therefore it can be easily compiled into an evaluation function and applied in a modern Go program. We propose a Stochastic Gradient Decent (SGD) algorithm to train a FBT model using expert game records, and provide two methods for fast computation of the gradient in order to speed up the training process. Experimental results show that our FBT model outperforms the state-of-the-art move prediction system of Latent Factor Ranking (LFR).

Cite

Text

Xiao and Müller. "Factorization Ranking Model for Move Prediction in the Game of Go." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10180

Markdown

[Xiao and Müller. "Factorization Ranking Model for Move Prediction in the Game of Go." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/xiao2016aaai-factorization/) doi:10.1609/AAAI.V30I1.10180

BibTeX

@inproceedings{xiao2016aaai-factorization,
  title     = {{Factorization Ranking Model for Move Prediction in the Game of Go}},
  author    = {Xiao, Chenjun and Müller, Martin},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {1359-1365},
  doi       = {10.1609/AAAI.V30I1.10180},
  url       = {https://mlanthology.org/aaai/2016/xiao2016aaai-factorization/}
}