Yang, Yaodong
106 publications
NeurIPS
2025
DexFlyWheel: A Scalable and Self-Improving Data Generation Framework for Dexterous Manipulation
ICLR
2025
Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models
NeurIPS
2025
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
ICLR
2025
Magnetic Preference Optimization: Achieving Last-Iterate Convergence for Language Model Alignment
NeurIPS
2025
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning
ICML
2024
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
NeurIPSW
2024
Emerging Safety Attack and Defense in Federated Instruction Tuning of Large Language Models
TMLR
2024
MaskMA: Towards Zero-Shot Multi-Agent Decision Making with Mask-Based Collaborative Learning
CoRL
2024
Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
NeurIPS
2024
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset
NeurIPS
2023
Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning
NeurIPS
2022
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control
NeurIPS
2022
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-Based Reinforcement Learning
NeurIPS
2022
Transformer-Based Working Memory for Multiagent Reinforcement Learning with Action Parsing
NeurIPS
2021
Towards Unifying Behavioral and Response Diversity for Open-Ended Learning in Zero-Sum Games