Yang, Xu
87 publications
ICLR
2026
OPRIDE: Efficient Offline Preference-Based Reinforcement Learning via In-Dataset Exploration
ICLR
2026
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
NeurIPS
2025
Adversarial Graph Fusion for Incomplete Multi-View Semi-Supervised Learning with Tensorial Imputation
ICML
2025
Learngene Tells You How to Customize: Task-Aware Parameter Initialization at Flexible Scales
NeurIPS
2025
R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization
NeurIPS
2025
RAPID Hand: Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Embodied Intelligence
AAAI
2025
Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
NeurIPS
2024
Initializing Variable-Sized Vision Transformers from Learngene with Learnable Transformation