Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network
Abstract
Neural Video Compression (NVC) has achieved remarkable performance in recent years. However, precise rate control remains a challenge due to the inherent limitations of learning-based codecs. To solve this issue, we propose a dynamic video compression framework designed for variable bitrate scenarios. First, to achieve variable bitrate implementation, we propose the Dynamic-Route Autoencoder with variable coding routes, each occupying partial computational complexity of the whole network and navigating to a distinct RD trade-off. Second, to approach the target bitrate, the Rate Control Agent estimates the bitrate of each route and adjusts the coding route of DRA at run time. To encompass a broad spectrum of variable bitrates while preserving overall RD performance, we employ the Joint-Routes Optimization strategy, achieving collaborative training of various routes. Extensive experiments on the HEVC and UVG datasets show that the proposed method achieves an average BD-Rate reduction of 14.8% and BD-PSNR gain of 0.47dB over state-of-the-art methods while maintaining an average bitrate error of 1.66%, achieving Rate-Distortion-Complexity Optimization (RDCO) for various bitrate and bitrate-constrained applications.
Cite
Text
Zhang and Gao. "Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73013-9_14Markdown
[Zhang and Gao. "Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhang2024eccv-learned/) doi:10.1007/978-3-031-73013-9_14BibTeX
@inproceedings{zhang2024eccv-learned,
title = {{Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network}},
author = {Zhang, Chenhao and Gao, Wei},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73013-9_14},
url = {https://mlanthology.org/eccv/2024/zhang2024eccv-learned/}
}