Ostaszewski, Mateusz

13 publications

IJCAI 2025 A Case for Validation Buffer in Pessimistic Actor-Critic Michal Nauman, Mateusz Ostaszewski, Marek Cygan
ICLR 2025 Learning Continually by Spectral Regularization Alex Lewandowski, Michał Bortkiewicz, Saurabh Kumar, András György, Dale Schuurmans, Mateusz Ostaszewski, Marlos C. Machado
NeurIPS 2025 PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors Yimeng Chen, Piotr Piękos, Mateusz Ostaszewski, Firas Laakom, Jürgen Schmidhuber
ICMLW 2024 A Case for Validation Buffer in Pessimistic Actor-Critic Michal Nauman, Mateusz Ostaszewski, Marek Cygan
NeurIPS 2024 Bigger, Regularized, Optimistic: Scaling for Compute and Sample Efficient Continuous Control Michal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Miłoś, Marek Cygan
ICMLW 2024 Bigger, Regularized, Optimistic: Scaling for Compute and Sample-Efficient Continuous Control Michal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Miłoś, Marek Cygan
ICLR 2024 Curriculum Reinforcement Learning for Quantum Architecture Search Under Hardware Errors Yash J. Patel, Akash Kundu, Mateusz Ostaszewski, Xavier Bonet-Monroig, Vedran Dunjko, Onur Danaci
ICML 2024 Fine-Tuning Reinforcement Learning Models Is Secretly a Forgetting Mitigation Problem Maciej Wolczyk, Bartłomiej Cupiał, Mateusz Ostaszewski, Michał Bortkiewicz, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś
ICML 2024 Overestimation, Overfitting, and Plasticity in Actor-Critic: The Bitter Lesson of Reinforcement Learning Michal Nauman, Michał Bortkiewicz, Piotr Miłoś, Tomasz Trzcinski, Mateusz Ostaszewski, Marek Cygan
NeurIPSW 2023 On Consequences of Finetuning on Data with Highly Discriminative Features Wojciech Masarczyk, Tomasz Trzcinski, Mateusz Ostaszewski
CoLLAs 2023 The Effectiveness of World Models for Continual Reinforcement Learning Samuel Kessler, Mateusz Ostaszewski, MichałPaweł Bortkiewicz, Mateusz Żarski, Maciej Wolczyk, Jack Parker-Holder, Stephen J. Roberts, Piotr Miłoś
NeurIPS 2023 The Tunnel Effect: Building Data Representations in Deep Neural Networks Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Miłoś, Tomasz Trzcinski
NeurIPS 2021 Reinforcement Learning for Optimization of Variational Quantum Circuit Architectures Mateusz Ostaszewski, Lea M. Trenkwalder, Wojciech Masarczyk, Eleanor Scerri, Vedran Dunjko