ML Anthology
Authors
Search
About
Matrenok, Simon
1 publications
NeurIPS
2025
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
Simon Matrenok
,
Skander Moalla
,
Caglar Gulcehre