Weng, Paul
33 publications
ICML
2025
Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries
NeurIPS
2025
Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning
ICMLW
2024
Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries
IJCAI
2019
Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains