ML Anthology
Authors
Search
About
Shenfeld, Idan
9 publications
NeurIPS
2025
KL-Regularized RLHF with Multiple Reference Models: Exact Solutions and Sample Complexity
Gholamali Aminian
,
Amir R. Asadi
,
Idan Shenfeld
,
Youssef Mroueh
ICLR
2025
Learning How Hard to Think: Input-Adaptive Allocation of LM Computation
Mehul Damani
,
Idan Shenfeld
,
Andi Peng
,
Andreea Bobu
,
Jacob Andreas
NeurIPSW
2024
Curiosity-Driven Red Teaming for Large Language Models
Zhang-Wei Hong
,
Idan Shenfeld
,
Tsun-Hsuan Wang
,
Yung-Sung Chuang
,
Aldo Pareja
,
James R. Glass
,
Akash Srivastava
,
Pulkit Agrawal
ICLR
2024
Curiosity-Driven Red-Teaming for Large Language Models
Zhang-Wei Hong
,
Idan Shenfeld
,
Tsun-Hsuan Wang
,
Yung-Sung Chuang
,
Aldo Pareja
,
James R. Glass
,
Akash Srivastava
,
Pulkit Agrawal
ICLRW
2024
Value Augmented Sampling: Predict Your Rewards to Align Language Models
Seungwook Han
,
Idan Shenfeld
,
Akash Srivastava
,
Yoon Kim
,
Pulkit Agrawal
ICML
2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
,
Zhang-Wei Hong
,
Aviv Tamar
,
Pulkit Agrawal
ICLRW
2023
TGRL: Teacher Guided Reinforcement Learning Algorithm for POMDPs
Idan Shenfeld
,
Zhang-Wei Hong
,
Aviv Tamar
,
Pulkit Agrawal
ICLRW
2021
Offline Meta Learning of Exploration
Ron Dorfman
,
Idan Shenfeld
,
Aviv Tamar
NeurIPS
2021
Offline Meta Reinforcement Learning -- Identifiability Challenges and Effective Data Collection Strategies
Ron Dorfman
,
Idan Shenfeld
,
Aviv Tamar