Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame

Tang, Luyang; Yang, Jiayu; Peng, Rui; Zhai, Yongqi; Shen, Shihe; Wang, Ronggang

doi:10.1609/AAAI.V39I7.32780

Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame

Luyang Tang, Jiayu Yang, Rui Peng, Yongqi Zhai, Shihe Shen, Ronggang Wang

AAAI 2025 pp. 7257-7265

doi:10.1609/AAAI.V39I7.32780 /aaai/2025/tang2025aaai-compressing/

Abstract

The success of 3D Gaussian Splatting (3DGS) in static scenes has inspired numerous attempts to construct Free-Viewpoint Videos (FVVs) of dynamic scenes from multi-view videos. Despite advancements in current techniques, simultaneously achieving photo-realistic view synthesis results, fast on-the-fly training, real-time rendering, and low storage costs remains a formidable problem. To address these challenges, we propose the first Gaussian-based streamable FVV intelligent compression framework named iFVC. Specifically, we utilize an anchor-based Gaussian representation to model the scene. To achieve on-the-fly training, we propose a Binary Transformation Cache (BTC) to model the dynamic changes between adjacent timesteps, which not only ensures compactness but also supports precise bit rate estimation. Furthermore, we carefully design a high-resolution transformation tri-plane assisted by a saliency grid as our BTC, allowing for accurate dynamic capture. The entire pipeline is regarded as a joint optimization of rate and distortion to achieve optimal compression performance. Experiments on widely used datasets demonstrate the state-of-the-art performance of our framework in both synthesis quality and efficiency, i.e., achieving per-frame training in 13 seconds with a storage cost of 0.1 MB and real-time rendering at 120 FPS.

PDF AAAI Semantic Scholar

Cite

Text

Tang et al. "Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I7.32780

Markdown

[Tang et al. "Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/tang2025aaai-compressing/) doi:10.1609/AAAI.V39I7.32780

BibTeX

@inproceedings{tang2025aaai-compressing,
  title     = {{Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame}},
  author    = {Tang, Luyang and Yang, Jiayu and Peng, Rui and Zhai, Yongqi and Shen, Shihe and Wang, Ronggang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {7257-7265},
  doi       = {10.1609/AAAI.V39I7.32780},
  url       = {https://mlanthology.org/aaai/2025/tang2025aaai-compressing/}
}