Tarasov, Denis

20 publications

ICML 2025 Latent Action Learning Requires Supervision in the Presence of Distractors Alexander Nikulin, Ilya Zisman, Denis Tarasov, Lyubaykin Nikita, Andrei Polubarov, Igor Kiselev, Vladislav Kurenkov
ICLRW 2025 Latent Action Learning Requires Supervision in the Presence of Distractors Alexander Nikulin, Ilya Zisman, Denis Tarasov, Lyubaykin Nikita, Andrei Polubarov, Igor Kiselev, Vladislav Kurenkov
ICLRW 2025 N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs Ilya Zisman, Alexander Nikulin, Viacheslav Sinii, Denis Tarasov, Lyubaykin Nikita, Andrei Polubarov, Igor Kiselev, Vladislav Kurenkov
ICLRW 2025 Object-Centric Latent Action Learning Albina Klepach, Alexander Nikulin, Ilya Zisman, Denis Tarasov, Alexander Derevyagin, Andrei Polubarov, Lyubaykin Nikita, Vladislav Kurenkov
ICML 2025 Vintix: Action Model via In-Context Reinforcement Learning Andrei Polubarov, Lyubaykin Nikita, Alexander Derevyagin, Ilya Zisman, Denis Tarasov, Alexander Nikulin, Vladislav Kurenkov
ICLRW 2025 Yes, Q-Learning Helps Offline In-Context RL Denis Tarasov, Alexander Nikulin, Ilya Zisman, Albina Klepach, Andrei Polubarov, Lyubaykin Nikita, Alexander Derevyagin, Igor Kiselev, Vladislav Kurenkov
ICMLW 2024 Distilling LLMs’ Decomposition Abilities into Compact Language Models Denis Tarasov, Kumar Shridhar
ICMLW 2024 Distilling LLMs’ Decomposition Abilities into Compact Language Models Denis Tarasov, Kumar Shridhar
TMLR 2024 Is Value Functions Estimation with Classification Plug-and- Play for Offline Reinforcement Learning? Denis Tarasov, Kirill Brilliantov, Dmitrii Kharlapenko
ICMLW 2024 Is Value Functions Estimation with Classification Plug-and-Play for Offline Reinforcement Learning? Denis Tarasov, Kirill Brilliantov, Dmitrii Kharlapenko
ICML 2023 Anti-Exploration by Random Network Distillation Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov
NeurIPS 2023 CORL: Research-Oriented Deep Offline Reinforcement Learning Library Denis Tarasov, Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov
NeurIPS 2023 Katakomba: Tools and Benchmarks for Data-Driven NetHack Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov
NeurIPSW 2023 Offline RL for Generative Design of Protein Binders Denis Tarasov, Ulrich Armel Mbou Sob, Miguel Arbesú, Nima H. Siboni, Sebastien Boyer, Andries Petrus Smit, Oliver Bent, Arnu Pretorius, Marcin J. Skwark
ICLRW 2023 Revisiting Behavior Regularized Actor-Critic Denis Tarasov, Vladislav Kurenkov, Alexander Nikulin, Sergey Kolesnikov
NeurIPS 2023 Revisiting the Minimalist Approach to Offline Reinforcement Learning Denis Tarasov, Vladislav Kurenkov, Alexander Nikulin, Sergey Kolesnikov
NeurIPSW 2022 CORL: Research-Oriented Deep Offline Reinforcement Learning Library Denis Tarasov, Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov
NeurIPSW 2022 Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows Dmitry Akimov, Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov
ICLRW 2022 Prompts and Pre-Trained Language Models for Offline Reinforcement Learning Denis Tarasov, Vladislav Kurenkov, Sergey Kolesnikov
NeurIPSW 2022 Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Dmitry Akimov, Sergey Kolesnikov