One-Shot Compression of Large Edge-Exchangeable Graphs Using Bits-Back Coding

Abstract

We present a one-shot method for compressing large labeled graphs called Random Edge Coding. When paired with a parameter-free model based on Pólya’s Urn, the worst-case computational and memory complexities scale quasi-linearly and linearly with the number of observed edges, making it efficient on sparse graphs, and requires only integer arithmetic. Key to our method is bits-back coding, which is used to sample edges and vertices without replacement from the edge-list in a way that preserves the structure of the graph. Optimality is proven under a class of random graph models that are invariant to permutations of the edges and of vertices within an edge. Experiments indicate Random Edge Coding can achieve competitive compression performance on real-world network datasets and scales to graphs with millions of nodes and edges.

Cite

Text

Severo et al. "One-Shot Compression of Large Edge-Exchangeable Graphs Using Bits-Back Coding." International Conference on Machine Learning, 2023.

Markdown

[Severo et al. "One-Shot Compression of Large Edge-Exchangeable Graphs Using Bits-Back Coding." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/severo2023icml-oneshot/)

BibTeX

@inproceedings{severo2023icml-oneshot,
  title     = {{One-Shot Compression of Large Edge-Exchangeable Graphs Using Bits-Back Coding}},
  author    = {Severo, Daniel and Townsend, James and Khisti, Ashish J and Makhzani, Alireza},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {30633-30645},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/severo2023icml-oneshot/}
}