ML Anthology
Authors
Search
About
Ostaszewski, Mateusz
13 publications
IJCAI
2025
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
,
Mateusz Ostaszewski
,
Marek Cygan
ICLR
2025
Learning Continually by Spectral Regularization
Alex Lewandowski
,
Michał Bortkiewicz
,
Saurabh Kumar
,
András György
,
Dale Schuurmans
,
Mateusz Ostaszewski
,
Marlos C. Machado
NeurIPS
2025
PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors
Yimeng Chen
,
Piotr Piękos
,
Mateusz Ostaszewski
,
Firas Laakom
,
Jürgen Schmidhuber
ICMLW
2024
A Case for Validation Buffer in Pessimistic Actor-Critic
Michal Nauman
,
Mateusz Ostaszewski
,
Marek Cygan
NeurIPS
2024
Bigger, Regularized, Optimistic: Scaling for Compute and Sample Efficient Continuous Control
Michal Nauman
,
Mateusz Ostaszewski
,
Krzysztof Jankowski
,
Piotr Miłoś
,
Marek Cygan
ICMLW
2024
Bigger, Regularized, Optimistic: Scaling for Compute and Sample-Efficient Continuous Control
Michal Nauman
,
Mateusz Ostaszewski
,
Krzysztof Jankowski
,
Piotr Miłoś
,
Marek Cygan
ICLR
2024
Curriculum Reinforcement Learning for Quantum Architecture Search Under Hardware Errors
Yash J. Patel
,
Akash Kundu
,
Mateusz Ostaszewski
,
Xavier Bonet-Monroig
,
Vedran Dunjko
,
Onur Danaci
ICML
2024
Fine-Tuning Reinforcement Learning Models Is Secretly a Forgetting Mitigation Problem
Maciej Wolczyk
,
Bartłomiej Cupiał
,
Mateusz Ostaszewski
,
Michał Bortkiewicz
,
Michał Zając
,
Razvan Pascanu
,
Łukasz Kuciński
,
Piotr Miłoś
ICML
2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: The Bitter Lesson of Reinforcement Learning
Michal Nauman
,
Michał Bortkiewicz
,
Piotr Miłoś
,
Tomasz Trzcinski
,
Mateusz Ostaszewski
,
Marek Cygan
NeurIPSW
2023
On Consequences of Finetuning on Data with Highly Discriminative Features
Wojciech Masarczyk
,
Tomasz Trzcinski
,
Mateusz Ostaszewski
CoLLAs
2023
The Effectiveness of World Models for Continual Reinforcement Learning
Samuel Kessler
,
Mateusz Ostaszewski
,
MichałPaweł Bortkiewicz
,
Mateusz Żarski
,
Maciej Wolczyk
,
Jack Parker-Holder
,
Stephen J. Roberts
,
Piotr Miłoś
NeurIPS
2023
The Tunnel Effect: Building Data Representations in Deep Neural Networks
Wojciech Masarczyk
,
Mateusz Ostaszewski
,
Ehsan Imani
,
Razvan Pascanu
,
Piotr Miłoś
,
Tomasz Trzcinski
NeurIPS
2021
Reinforcement Learning for Optimization of Variational Quantum Circuit Architectures
Mateusz Ostaszewski
,
Lea M. Trenkwalder
,
Wojciech Masarczyk
,
Eleanor Scerri
,
Vedran Dunjko