ML Anthology
Authors
Search
About
Wang, Zhilin
10 publications
ICLR
2026
LitmusValues: Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
Yu Ying Chiu
,
Zhilin Wang
,
Sharan Maiya
,
Yejin Choi
,
Kyle Fish
,
Sydney Levine
,
Evan J Hubinger
ICLR
2026
ProfBench: Multi-Domain Rubrics Requiring Professional Knowledge to Answer and Judge
Zhilin Wang
,
Jaehun Jung
,
Ximing Lu
,
Shizhe Diao
,
Ellie Evans
,
Jiaqi Zeng
,
Pavlo Molchanov
,
Yejin Choi
,
Jan Kautz
,
Yi Dong
ICLR
2026
RLBFF: Binary Flexible Feedback to Bridge Between Human Feedback & Verifiable Rewards
Zhilin Wang
,
Jiaqi Zeng
,
Olivier Delalleau
,
Ellie Evans
,
Daniel Egert
,
Hoo-Chang Shin
,
Felipe Soares
,
Yi Dong
,
Oleksii Kuchaiev
ICML
2025
Diverging Preferences: When Do Annotators Disagree and Do Models Know?
Michael Jq Zhang
,
Zhilin Wang
,
Jena D. Hwang
,
Yi Dong
,
Olivier Delalleau
,
Yejin Choi
,
Eunsol Choi
,
Xiang Ren
,
Valentina Pyatkin
AAAI
2025
HLMEA: Unsupervised Entity Alignment Based on Hybrid Language Models
Xiongnan Jin
,
Zhilin Wang
,
Jinpeng Chen
,
Liu Yang
,
Byungkook Oh
,
Seung-won Hwang
,
Jianqiang Li
ICLR
2025
HelpSteer2-Preference: Complementing Ratings with Preferences
Zhilin Wang
,
Alexander Bukharin
,
Olivier Delalleau
,
Daniel Egert
,
Gerald Shen
,
Jiaqi Zeng
,
Oleksii Kuchaiev
,
Yi Dong
NeurIPS
2025
HelpSteer3-Preference: Open Human-Annotated Preference Data Across Diverse Tasks and Languages
Zhilin Wang
,
Jiaqi Zeng
,
Olivier Delalleau
,
Hoo-Chang Shin
,
Felipe Soares
,
Alexander Bukharin
,
Ellie Evans
,
Yi Dong
,
Oleksii Kuchaiev
NeurIPSW
2024
Diverging Preferences: When Do Annotators Disagree and Do Models Know?
Michael JQ Zhang
,
Zhilin Wang
,
Jena D. Hwang
,
Yi Dong
,
Olivier Delalleau
,
Yejin Choi
,
Eunsol Choi
,
Xiang Ren
,
Valentina Pyatkin
NeurIPS
2024
HelpSteer 2: Open-Source Dataset for Training Top-Performing Reward Models
Zhilin Wang
,
Yi Dong
,
Olivier Delalleau
,
Jiaqi Zeng
,
Gerald Shen
,
Daniel Egert
,
Jimmy J. Zhang
,
Makesh Narsimhan Sreedhar
,
Oleksii Kuchaiev
AAAI
2020
FFA-Net: Feature Fusion Attention Network for Single Image Dehazing
Xu Qin
,
Zhilin Wang
,
Yuanchao Bai
,
Xiaodong Xie
,
Huizhu Jia