Subtractive Mixture Models via Squaring: Representation and Learning
Abstract
Mixture models are traditionally represented and learned by adding several distributions as components. Allowing mixtures to subtract probability mass or density can drastically reduce the number of components needed to model complex distributions. However, learning such subtractive mixtures while ensuring they still encode a non-negative function is challenging. We investigate how to learn and perform inference on deep subtractive mixtures by squaring them. We do this in the framework of probabilistic circuits, which enable us to represent tensorized mixtures and generalize several other subtractive models. We theoretically prove that the class of squared circuits allowing subtractions can be exponentially more expressive than traditional additive mixtures; and, we empirically show this increased expressiveness on a series of real-world distribution estimation tasks.
Cite
Text
Loconte et al. "Subtractive Mixture Models via Squaring: Representation and Learning." International Conference on Learning Representations, 2024.Markdown
[Loconte et al. "Subtractive Mixture Models via Squaring: Representation and Learning." International Conference on Learning Representations, 2024.](https://mlanthology.org/iclr/2024/loconte2024iclr-subtractive/)BibTeX
@inproceedings{loconte2024iclr-subtractive,
title = {{Subtractive Mixture Models via Squaring: Representation and Learning}},
author = {Loconte, Lorenzo and Sladek, Aleksanteri Mikulus and Mengel, Stefan and Trapp, Martin and Solin, Arno and Gillis, Nicolas and Vergari, Antonio},
booktitle = {International Conference on Learning Representations},
year = {2024},
url = {https://mlanthology.org/iclr/2024/loconte2024iclr-subtractive/}
}