Moalla, Skander

5 publications

NeurIPS 2025 Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions Simon Matrenok, Skander Moalla, Caglar Gulcehre
NeurIPS 2024 Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers Xiuying Wei, Skander Moalla, Razvan Pascanu, Caglar Gulcehre
NeurIPS 2024 No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO Skander Moalla, Andrea Miele, Daniil Pyatko, Razvan Pascanu, Caglar Gulcehre
ICMLW 2024 No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO Skander Moalla, Andrea Miele, Razvan Pascanu, Caglar Gulcehre
NeurIPS 2023 SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning Benjamin Ellis, Jonathan Cook, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob Foerster, Shimon Whiteson