Scaling On-Device GPU Inference for Large Generative Models

Cite

Text

Tang et al. "Scaling On-Device GPU Inference for Large Generative Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Tang et al. "Scaling On-Device GPU Inference for Large Generative Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/tang2025cvprw-scaling/)

BibTeX

@inproceedings{tang2025cvprw-scaling,
  title     = {{Scaling On-Device GPU Inference for Large Generative Models}},
  author    = {Tang, Jiuqiang and Sorokin, Raman and Ignasheva, Ekaterina and Jensen, Grant and Chen, Lin and Lee, Juhyun and Kulik, Andrei and Grundmann, Matthias},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {6355-6364},
  url       = {https://mlanthology.org/cvprw/2025/tang2025cvprw-scaling/}
}