AnyUp: Universal Feature Upsampling
Abstract
We introduce AnyUp, a method for feature upsampling that can be applied to any vision feature at any resolution, without encoder-specific training. Existing learning-based upsamplers for features like DINO or CLIP need to be re-trained for every feature extractor and thus do not generalize to different feature types at inference time. In this work, we propose an *inference-time* feature-agnostic upsampling architecture to alleviate this limitation and improve upsampling quality. In our experiments, AnyUp sets a new state of the art for upsampled features, generalizes to different feature types, and preserves feature semantics while being efficient and easy to apply to a wide range of downstream tasks.
Cite
Text
Wimmer et al. "AnyUp: Universal Feature Upsampling." International Conference on Learning Representations, 2026.Markdown
[Wimmer et al. "AnyUp: Universal Feature Upsampling." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wimmer2026iclr-anyup/)BibTeX
@inproceedings{wimmer2026iclr-anyup,
title = {{AnyUp: Universal Feature Upsampling}},
author = {Wimmer, Thomas and Truong, Prune and Rakotosaona, Marie-Julie and Oechsle, Michael and Tombari, Federico and Schiele, Bernt and Lenssen, Jan Eric},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/wimmer2026iclr-anyup/}
}