Ostuni, Vito Claudio

1 publications

ICLR 2026 Rank-GRPO: Training LLM-Based Conversational Recommender Systems with Reinforcement Learning Yaochen Zhu, Harald Steck, Dawen Liang, Yinhan He, Vito Claudio Ostuni, Jundong Li, Nathan Kallus