ML Anthology
Authors
Search
About
Shenfeld, Idan
12 publications
ICLR
2026
Best-of-N Through the Smoothing Lens: KL Divergence and Regret Analysis
Gholamali Aminian
,
Idan Shenfeld
,
Amir R. Asadi
,
Ahmad Beirami
,
Youssef Mroueh
ICLR
2026
Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty
Mehul Damani
,
Isha Puri
,
Stewart Slocum
,
Idan Shenfeld
,
Leshem Choshen
,
Yoon Kim
,
Jacob Andreas
ICLR
2026
RL's Razor: Why Online Reinforcement Learning Forgets Less
Idan Shenfeld
,
Jyothish Pari
,
Pulkit Agrawal
NeurIPS
2025
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
Gholamali Aminian
,
Amir R. Asadi
,
Idan Shenfeld
,
Youssef Mroueh
ICLR
2025
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Mehul Damani
,
Idan Shenfeld
,
Andi Peng
,
Andreea Bobu
,
Jacob Andreas
NeurIPSW
2024
Curiosity-Driven Red Teaming for Large Language Models
Zhang-Wei Hong
,
Idan Shenfeld
,
Tsun-Hsuan Wang
,
Yung-Sung Chuang
,
Aldo Pareja
,
James R. Glass
,
Akash Srivastava
,
Pulkit Agrawal
ICLR
2024
Curiosity-Driven Red-Teaming for Large Language Models
Zhang-Wei Hong
,
Idan Shenfeld
,
Tsun-Hsuan Wang
,
Yung-Sung Chuang
,
Aldo Pareja
,
James R. Glass
,
Akash Srivastava
,
Pulkit Agrawal
ICLRW
2024
Value Augmented Sampling: Predict Your Rewards to Align Language Models
Seungwook Han
,
Idan Shenfeld
,
Akash Srivastava
,
Yoon Kim
,
Pulkit Agrawal
ICML
2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
,
Zhang-Wei Hong
,
Aviv Tamar
,
Pulkit Agrawal
ICLRW
2023
TGRL: Teacher Guided Reinforcement Learning Algorithm for POMDPs
Idan Shenfeld
,
Zhang-Wei Hong
,
Aviv Tamar
,
Pulkit Agrawal
ICLRW
2021
Offline Meta Learning of Exploration
Ron Dorfman
,
Idan Shenfeld
,
Aviv Tamar
NeurIPS
2021
Offline Meta Reinforcement Learning -- Identifiability Challenges and Effective Data Collection Strategies
Ron Dorfman
,
Idan Shenfeld
,
Aviv Tamar