Improvements to the Sequence Memoizer

Abstract

The sequence memoizer is a model for sequence data with state-of-the-art performance on language modeling and compression. We propose a number of improvements to the model and inference algorithm, including an enlarged range of hyperparameters, a memory-efficient representation, and inference algorithms operating on the new representation. Our derivations are based on precise definitions of the various processes that will also allow us to provide an elementary proof of the mysterious" coagulation and fragmentation properties used in the original paper on the sequence memoizer by Wood et al. (2009). We present some experimental results supporting our improvements."

Cite

Text

Gasthaus and Teh. "Improvements to the Sequence Memoizer." Neural Information Processing Systems, 2010.

Markdown

[Gasthaus and Teh. "Improvements to the Sequence Memoizer." Neural Information Processing Systems, 2010.](https://mlanthology.org/neurips/2010/gasthaus2010neurips-improvements/)

BibTeX

@inproceedings{gasthaus2010neurips-improvements,
  title     = {{Improvements to the Sequence Memoizer}},
  author    = {Gasthaus, Jan and Teh, Yee W.},
  booktitle = {Neural Information Processing Systems},
  year      = {2010},
  pages     = {685-693},
  url       = {https://mlanthology.org/neurips/2010/gasthaus2010neurips-improvements/}
}