The Illustrated Transformer

Abstract

The Transformer was proposed in the paper Attention is All You Need. A TensorFlow implementation of it is available as a part of the Tensor2Tensor package. Harvard’s NLP group created a guide annotating the paper with PyTorch implementation. In this post, we will attempt to oversimplify things a bit and introduce the concepts one by one to hopefully make it easier to understand to people without in-depth knowledge of the subject matter.

Cite

Text

Alammar. "The Illustrated Transformer." Blog post, 2018.

Markdown

[Alammar. "The Illustrated Transformer." Blog post, 2018.](https://mlanthology.org/misc/2018/alammar2018misc-illustrated/)

BibTeX

@misc{alammar2018misc-illustrated,
  title     = {{The Illustrated Transformer}},
  author    = {Alammar, Jay},
  howpublished = {Blog post},
  year      = {2018},
  url       = {https://mlanthology.org/misc/2018/alammar2018misc-illustrated/}
}