Bershatsky, Daniel

3 publications

ICLRW 2025 On the Spatial Structure of Mixture-of-Experts in Transformers Daniel Bershatsky, Ivan Oseledets
ICML 2023 Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction Georgii Sergeevich Novikov, Daniel Bershatsky, Julia Gusak, Alex Shonenkov, Denis Valerievich Dimitrov, Ivan Oseledets
IJCAI 2022 Survey on Efficient Training of Large Neural Networks Julia Gusak, Daria Cherniuk, Alena Shilova, Alexandr Katrutsa, Daniel Bershatsky, Xunyi Zhao, Lionel Eyraud-Dubois, Oleh Shliazhko, Denis Dimitrov, Ivan V. Oseledets, Olivier Beaumont