Shenfeld, Idan

9 publications

NeurIPS 2025 KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity Gholamali Aminian, Amir R. Asadi, Idan Shenfeld, Youssef Mroueh
ICLR 2025 Learning How Hard to Think: Input-Adaptive Allocation of LM Computation Mehul Damani, Idan Shenfeld, Andi Peng, Andreea Bobu, Jacob Andreas
NeurIPSW 2024 Curiosity-Driven Red Teaming for Large Language Models Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal
ICLR 2024 Curiosity-Driven Red-Teaming for Large Language Models Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James R. Glass, Akash Srivastava, Pulkit Agrawal
ICLRW 2024 Value Augmented Sampling: Predict Your Rewards to Align Language Models Seungwook Han, Idan Shenfeld, Akash Srivastava, Yoon Kim, Pulkit Agrawal
ICML 2023 TGRL: An Algorithm for Teacher Guided Reinforcement Learning Idan Shenfeld, Zhang-Wei Hong, Aviv Tamar, Pulkit Agrawal
ICLRW 2023 TGRL: Teacher Guided Reinforcement Learning Algorithm for POMDPs Idan Shenfeld, Zhang-Wei Hong, Aviv Tamar, Pulkit Agrawal
ICLRW 2021 Offline Meta Learning of Exploration Ron Dorfman, Idan Shenfeld, Aviv Tamar
NeurIPS 2021 Offline Meta Reinforcement Learning -- Identifiability Challenges and Effective Data Collection Strategies Ron Dorfman, Idan Shenfeld, Aviv Tamar