VQ-Transplant: Efficient VQ-Module Integration for Pre-Trained Visual Tokenizers

Abstract

Vector Quantization (VQ) underpins modern discrete visual tokenization. However, training quantization modules for state-of-the-art VQ-based models requires significant computational resources which, in practice, all but prevents the development of novel, cutting-edge VQ techniques under resource constraints. To address this limitation, we propose VQ-Transplant, a simple framework that enables plug-and-play integration of new VQ modules into frozen, pre-trained tokenizers by replacing their native VQ modules. Crucially, the proposed transplantation process preserves all encoder-decoder parameters, obviating the need for costly end-to-end retraining when modifying the quantization method. To mitigate decoder-quantization mismatch, we introduce a lightweight decoder adaptation strategy (trained for only 5 epochs on ImageNet-1k) to align feature priors with the new quantization space. In our empirical evaluation, we find that VQ-Transplant allows obtaining near state-of-the-art reconstruction fidelity for industry-level models like VAR while reducing the training cost by 95%. VQ-Transplant democratizes quantization research by enabling resource-efficient integration of novel VQ techniques while matching industry-level reconstruction performance.

Cite

Text

Fang et al. "VQ-Transplant: Efficient VQ-Module Integration for Pre-Trained Visual Tokenizers." International Conference on Learning Representations, 2026.

Markdown

[Fang et al. "VQ-Transplant: Efficient VQ-Module Integration for Pre-Trained Visual Tokenizers." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/fang2026iclr-vqtransplant/)

BibTeX

@inproceedings{fang2026iclr-vqtransplant,
  title     = {{VQ-Transplant: Efficient VQ-Module Integration for Pre-Trained Visual Tokenizers}},
  author    = {Fang, Xianghong and Yuan, Yuan and Kong, Dehan and Rudner, Tim G. J.},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/fang2026iclr-vqtransplant/}
}