On the Pitfalls of Visual Learning in Referential Games

Abstract

This paper focuses on the effect of game design and visual representations of real-world entities on emergent languages in referential games. Strikingly, we find that the agents in such games can learn to successfully communicate even when provided with visual features from a randomly initialized neural network. Through a series of experiments, we highlight the agents' inability to effectively utilize high-level features. Using Gradient weighted-Class Activation Mapping, we verify that the agents often 'look' at regions not related to entities. Culminating with a positive result, we show how environmental pressure from agent population can nudge the learners into effectively capturing high-level visual features.

Cite

Text

Verma. "On the Pitfalls of Visual Learning in Referential Games." NeurIPS 2022 Workshops: LaReL, 2022.

Markdown

[Verma. "On the Pitfalls of Visual Learning in Referential Games." NeurIPS 2022 Workshops: LaReL, 2022.](https://mlanthology.org/neuripsw/2022/verma2022neuripsw-pitfalls/)

BibTeX

@inproceedings{verma2022neuripsw-pitfalls,
  title     = {{On the Pitfalls of Visual Learning in Referential Games}},
  author    = {Verma, Shresth},
  booktitle = {NeurIPS 2022 Workshops: LaReL},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/verma2022neuripsw-pitfalls/}
}