Real-Time High-Resolution Background Matting

Abstract

We introduce a real-time, high-resolution background replacement technique which operates at 30fps in 4K resolution, and 60fps for HD on a modern GPU. Our technique is based on background matting, where an additional frame of the background is captured and used to inform the alpha matte and the foreground layer. The main challenge is to compute a high-quality alpha matte, preserving strand-level hair details, while processing high-resolution images in real-time. To achieve this goal, we employ two neural networks; the base network computes a low-resolution result which is refined by a second network operating at high-resolution on selective patches. We introduce two large-scale video and image matting datasets: VideoMatte240K and PhotoMatte13K/85. Our approach yields higher quality results compared to the previous state-of-the-art in background matting, while simultaneously yielding a dramatic boost in both speed and resolution.

Cite

Text

Lin et al. "Real-Time High-Resolution Background Matting." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00865

Markdown

[Lin et al. "Real-Time High-Resolution Background Matting." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/lin2021cvpr-realtime/) doi:10.1109/CVPR46437.2021.00865

BibTeX

@inproceedings{lin2021cvpr-realtime,
  title     = {{Real-Time High-Resolution Background Matting}},
  author    = {Lin, Shanchuan and Ryabtsev, Andrey and Sengupta, Soumyadip and Curless, Brian L. and Seitz, Steven M. and Kemelmacher-Shlizerman, Ira},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {8762-8771},
  doi       = {10.1109/CVPR46437.2021.00865},
  url       = {https://mlanthology.org/cvpr/2021/lin2021cvpr-realtime/}
}