Line Attractor Dynamics in Recurrent Networks for Sentiment Classiﬁcation

Maheswaranathan, Niru; Williams, Alex H.; Golub, Matthew D.; Ganguli, Surya; Sussillo, David

Line Attractor Dynamics in Recurrent Networks for Sentiment Classiﬁcation

Niru Maheswaranathan, Alex H. Williams, Matthew D. Golub, Surya Ganguli, David Sussillo

ICMLW 2019

/icmlw/2019/maheswaranathan2019icmlw-line/

Abstract

Recurrent neural networks (RNNs) are a powerful tool for modeling sequential data. Despite their widespread usage, understanding how RNNs solve complex problems remains elusive. Here, we characterize how popular RNN architectures perform document-level sentiment classification. Despite their theoretical capacity to implement complex, high-dimensional computations, we find that trained networks converge to highly interpretable, low-dimensional representations. We identify a simple mechanism, integration along an approximate line attractor, and find this mechanism present across RNN architectures (including LSTMs, GRUs, and vanilla RNNs). Overall, these results demonstrate that surprisingly universal and human interpretable computations can arise across a range of recurrent networks.

PDF ICMLW OpenReview Semantic Scholar

Cite

Text

Maheswaranathan et al. "Line Attractor Dynamics in Recurrent Networks for Sentiment Classiﬁcation." ICML 2019 Workshops: Deep_Phenomena, 2019.

Markdown

[Maheswaranathan et al. "Line Attractor Dynamics in Recurrent Networks for Sentiment Classiﬁcation." ICML 2019 Workshops: Deep_Phenomena, 2019.](https://mlanthology.org/icmlw/2019/maheswaranathan2019icmlw-line/)

BibTeX

@inproceedings{maheswaranathan2019icmlw-line,
  title     = {{Line Attractor Dynamics in Recurrent Networks for Sentiment Classiﬁcation}},
  author    = {Maheswaranathan, Niru and Williams, Alex H. and Golub, Matthew D. and Ganguli, Surya and Sussillo, David},
  booktitle = {ICML 2019 Workshops: Deep_Phenomena},
  year      = {2019},
  url       = {https://mlanthology.org/icmlw/2019/maheswaranathan2019icmlw-line/}
}