Sieberling, Oliver

4 publications

ICLR 2026 MesaNet: Sequence Modeling by Locally Optimal Test-Time Training Johannes von Oswald, Nino Scherrer, Seijin Kobayashi, Luca Versari, Songlin Yang, Maximilian Schlegel, Kaitlin Maile, Yanick Schimpf, Oliver Sieberling, Alexander Meulemans, Guillaume Lajoie, Rif A. Saurous, Charlotte Frenkel, Razvan Pascanu, Blaise Aguera y Arcas, Joao Sacramento
ICML 2025 EvoPress: Accurate Dynamic Model Compression via Evolutionary Search Oliver Sieberling, Denis Kuznedelev, Eldar Kurtic, Dan Alistarh
ICLRW 2025 EvoPress: Accurate Dynamic Model Compression via Evolutionary Search Oliver Sieberling, Denis Kuznedelev, Dan Alistarh
NeurIPS 2025 Quartet: Native FP4 Training Can Be Optimal for Large Language Models Roberto L. Castro, Andrei Panferov, Soroush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, Dan Alistarh