Yu, Bei
38 publications
NeurIPS
2025
DLoFT: Gradient-Decoupled Fine-Tuning for Generalizable Long Chain-of-Thought Reasoning
NeurIPS
2025
LithoSim: A Large, Holistic Lithography Simulation Benchmark for AI-Driven Semiconductor Manufacturing
NeurIPS
2025
On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding
ICML
2025
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization