Deng, Wenlong

8 publications

TMLR 2026 Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients Christos Thrampoulidis, Sadegh Mahdavi, Wenlong Deng
ICLR 2026 Textual Equilibrium Propagation for Deep Compound AI Systems Minghui Chen, Wenlong Deng, James Zou, Han Yu, Xiaoxiao Li
ICLR 2026 Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning Wenlong Deng, Yi Ren, Yushu Li, Boying Gong, Danica J. Sutherland, Xiaoxiao Li, Christos Thrampoulidis
ICLR 2025 Can Textual Gradient Work in Federated Learning? Minghui Chen, Ruinan Jin, Wenlong Deng, Yuanyuan Chen, Zhi Huang, Han Yu, Xiaoxiao Li
ICLR 2025 DARE the Extreme: Revisiting Delta-Parameter Pruning for Fine-Tuned Models Wenlong Deng, Yize Zhao, Vala Vakilian, Minghui Chen, Xiaoxiao Li, Christos Thrampoulidis
ICLR 2025 GMValuator: Similarity-Based Data Valuation for Generative Models Jiaxi Yang, Wenlong Deng, Benlin Liu, Yangsibo Huang, James Zou, Xiaoxiao Li
NeurIPS 2025 On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization Wenlong Deng, Yi Ren, Muchen Li, Danica J. Sutherland, Xiaoxiao Li, Christos Thrampoulidis
CVPR 2024 Unlocking the Potential of Prompt-Tuning in Bridging Generalized and Personalized Federated Learning Wenlong Deng, Christos Thrampoulidis, Xiaoxiao Li