Yu, Yang

152 publications

NeurIPS 2025 Adaptable Safe Policy Learning from Multi-Task Data with Constraint Prioritized Decision Transformer Ruiqi Xue, Ziqian Zhang, Lihe Li, Cong Guan, Lei Yuan, Yang Yu
ICLR 2025 Any-Step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu
ICML 2025 Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning Chen-Xiao Gao, Chenyang Wu, Mingjun Cao, Chenjun Xiao, Yang Yu, Zongzhang Zhang
ICML 2025 Controlling Large Language Model with Latent Action Chengxing Jia, Ziniu Li, Pengyuan Wang, Yi-Chen Li, Zhenyu Hou, Yuxiao Dong, Yang Yu
TMLR 2025 Efficient Multi-Agent Cooperation Learning Through Teammate Lookahead Feng Chen, Xinwei Chen, Rong-Jun Qin, Cong Guan, Lei Yuan, Zongzhang Zhang, Yang Yu
ICLR 2025 Efficient Multi-Agent Offline Coordination via Diffusion-Based Trajectory Stitching Lei Yuan, Yuqi Bian, Lihe Li, Ziqian Zhang, Cong Guan, Yang Yu
NeurIPS 2025 Focus-Then-Reuse: Fast Adaptation in Visual Perturbation Environments Jiahui Wang, Chao Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu
AAAI 2025 GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs Songwei Zhao, Yuan Jiang, Zijing Zhang, Yang Yu, Hechang Chen
NeurIPS 2025 Geometric Mixture Models for Electrolyte Conductivity Prediction Anyi Li, Jiacheng Cen, Songyou Li, Mingze Li, Yang Yu, Wenbing Huang
AAAI 2025 GuideNER: Annotation Guidelines Are Better than Examples for In-Context Named Entity Recognition Shizhou Huang, Bo Xu, Yang Yu, Changqun Li, Xin Alex Lin
ICML 2025 Improving Reward Model Generalization from Adversarial Process Enhanced Preferences Zhilong Zhang, Tian Xu, Xinghao Du, Xingchen Cao, Yihao Sun, Yang Yu
TMLR 2025 Interactive Large Language Models for Reliable Answering Under Incomplete Context Jing-Cheng Pang, Heng-Bo Fan, Pengyuan Wang, Jia-Hao Xiao, Nan Tang, Si-Hang Yang, Chengxing Jia, Ming-Kun Xie, Xiang Chen, Sheng-Jun Huang, Yang Yu
ICML 2025 LLM Data Selection and Utilization via Dynamic Bi-Level Optimization Yang Yu, Kai Han, Hang Zhou, Yehui Tang, Kaiqi Huang, Yunhe Wang, Dacheng Tao
ICML 2025 LLM-Assisted Semantically Diverse Teammate Generation for Efficient Multi-Agent Coordination Lihe Li, Lei Yuan, Pengsen Liu, Tao Jiang, Yang Yu
ICLR 2025 LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch Caigao Jiang, Xiang Shu, Hong Qian, Xingyu Lu, Jun Zhou, Aimin Zhou, Yang Yu
MLJ 2025 Learning De-Biased Environment Models for Delivery Incentive Policy Optimization on Food Delivery Platforms Yu-Ren Liu, Xiong-Hui Chen, Siyuan Xiao, Xinyu Yang, Xintong Qi, Linjun Zhou, Yang Yu, Fangsheng Huang
ICLR 2025 Learning View-Invariant World Models for Visual Robotic Manipulation Jing-Cheng Pang, Nan Tang, Kaiyuan Li, Yuting Tang, Xin-Qiang Cai, Zhen-Yu Zhang, Gang Niu, Masashi Sugiyama, Yang Yu
ICML 2025 Learning to Reuse Policies in State Evolvable Environments Ziqian Zhang, Bohan Yang, Lihe Li, Yuqi Bian, Ruiqi Xue, Feng Chen, Yi-Chen Li, Lei Yuan, Yang Yu
CVPR 2025 MedUnifier: Unifying Vision-and-Language Pre-Training on Medical Data with Vision Generation Task Using Discrete Visual Representations Ziyang Zhang, Yang Yu, Yucheng Chen, Xulei Yang, Si Yong Yeo
NeurIPS 2025 Multi-Agent Imitation by Learning and Sampling from Factorized Soft Q-Function Yi-Chen Li, Zhongxiang Ling, Tao Jiang, Fuxiang Zhang, Pengyuan Wang, Lei Yuan, Zongzhang Zhang, Yang Yu
ICLR 2025 On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models Xu-Hui Liu, Yali Du, Jun Wang, Yang Yu
ICLR 2025 Q-Adapter: Customizing Pre-Trained LLMs to New Preferences with Forgetting Mitigation Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu, Bo An
ICLR 2025 SOO-Bench: Benchmarks for Evaluating the Stability of Offline Black-Box Optimization Hong Qian, Yiyi Zhu, Xiang Shu, Shuo Liu, Yaolin Wen, Xin An, Huakang Lu, Aimin Zhou, Ke Tang, Yang Yu
ICLR 2025 Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning Tian-Shuo Liu, Xu-Hui Liu, Ruifeng Chen, Lixuan Jin, Pengyuan Wang, Zhilong Zhang, Yang Yu
NeurIPS 2025 TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine Jiacheng Xie, Yang Yu, Ziyang Zhang, Shuai Zeng, Jiaxuan He, Ayush Vasireddy, Xiaoting Tang, Congyu Guo, Lening Zhao, Congcong Jing, Guanghui An, Dong Xu
NeurIPS 2025 Uncertainty-Sensitive Privileged Learning Fan-Ming Luo, Lei Yuan, Yang Yu
AAAI 2025 VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention Jiangning Wei, Lixiong Qin, Bo Yu, Tianjian Zou, Chuhan Yan, Dandan Xiao, Yang Yu, Lan Yang, Ke Li, Jun Liu
AAAI 2024 ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning Chenxiao Gao, Chenyang Wu, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu
IJCAI 2024 ADMN: Agent-Driven Modular Network for Dynamic Parameter Sharing in Cooperative Multi-Agent Reinforcement Learning Yang Yu, Qiyue Yin, Junge Zhang, Pei Xu, Kaiqi Huang
AAAI 2024 ANEDL: Adaptive Negative Evidential Deep Learning for Open-Set Semi-Supervised Learning Yang Yu, Danruo Deng, Furui Liu, Qi Dou, Yueming Jin, Guangyong Chen, Pheng-Ann Heng
NeurIPS 2024 Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency Yiran Liu, Ke Yang, Zehan Qi, Xiao Liu, Yang Yu, ChengXiang Zhai
ICML 2024 Causality Based Front-Door Defense Against Backdoor Attack on Language Models Yiran Liu, Xiaoang Xu, Zhiyi Hou, Yang Yu
IJCAI 2024 Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal Lihe Li, Ruotong Chen, Ziqian Zhang, Zhichao Wu, Yi-Chen Li, Cong Guan, Yang Yu, Lei Yuan
ICML 2024 Debiased Offline Representation Learning for Fast Online Adaptation in Non-Stationary Dynamics Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu
ICML 2024 Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Xu-Hui Liu, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Yang Yu, Anqi Huang, Kai Xu, Zongzhang Zhang
ECML-PKDD 2024 Dynamics Adaptive Safe Reinforcement Learning with a Misspecified Simulator Ruiqi Xue, Ziqian Zhang, Lihe Li, Feng Chen, Yi-Chen Li, Yang Yu, Lei Yuan
ICLRW 2024 Efficient Human-AI Coordination via Preparatory Language-Based Convention Cong Guan, Lichao Zhang, Chunpeng Fan, Yi-Chen Li, Feng Chen, Lihe Li, Yunjia Tian, Lei Yuan, Yang Yu
NeurIPS 2024 Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate Fan-Ming Luo, Zuolin Tu, Zefang Huang, Yang Yu
ICML 2024 Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Ruifeng Chen, Zhilong Zhang, Xinwei Chen, Yang Yu
AAAI 2024 Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward Haoxin Lin, Hongqiu Wu, Jiaji Zhang, Yihao Sun, Junyin Ye, Yang Yu
ICLR 2024 Flow to Better: Offline Preference-Based Reinforcement Learning via Preferred Trajectory Generation Zhilong Zhang, Yihao Sun, Junyin Ye, Tian-Shuo Liu, Jiaji Zhang, Yang Yu
AAAI 2024 Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning Chao Chen, Jiacheng Xu, Weijian Liao, Hao Ding, Zongzhang Zhang, Yang Yu, Rui Zhao
AAAI 2024 Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations Renzhe Zhou, Chenxiao Gao, Zongzhang Zhang, Yang Yu
NeurIPS 2024 KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Xiong-Hui Chen, Nan Tang, Yang Yu
ICLR 2024 Language Model Self-Improvement by Reinforcement Learning Contemplation Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu
ICML 2024 Limited Preference Aided Imitation Learning from Imperfect Demonstrations Xingchen Cao, Fan-Ming Luo, Junyin Ye, Tian Xu, Zhilong Zhang, Yang Yu
NeurIPS 2024 Multi-Agent Domain Calibration with a Handful of Offline Data Tao Jiang, Lei Yuan, Lihe Li, Cong Guan, Zongzhang Zhang, Yang Yu
ICML 2024 Offline Transition Modeling via Contrastive Energy Learning Ruifeng Chen, Chengxing Jia, Zefang Huang, Tian-Shuo Liu, Xu-Hui Liu, Yang Yu
TMLR 2024 One by One, Continual Coordinating with Humans via Hyper-Teammate Identification Cong Guan, Feng Chen, Ke Xue, Chunpeng Fan, Lichao Zhang, Ziqian Zhang, Pengyao Zhao, Zongzhang Zhang, Chao Qian, Lei Yuan, Yang Yu
NeurIPS 2024 Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting Xiong-Hui Chen, Ziyan Wang, Yali Du, Shengyi Jiang, Meng Fang, Yang Yu, Jun Wang
ICLR 2024 Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning Chengxing Jia, Chenxiao Gao, Hao Yin, Fuxiang Zhang, Xiong-Hui Chen, Tian Xu, Lei Yuan, Zongzhang Zhang, Zhi-Hua Zhou, Yang Yu
ICML 2024 Policy-Conditioned Environment Models Are More Generalizable Ruifeng Chen, Xiong-Hui Chen, Yihao Sun, Siyuan Xiao, Minhui Li, Yang Yu
IJCAI 2024 Pre-Training General User Representation with Multi-Type APP Behaviors Yuren Zhang, Min Hou, Kai Zhang, Yuqing Yuan, Chao Song, Zhihao Ye, Enhong Chen, Yang Yu
NeurIPS 2024 Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation Tian Xu, Zhilong Zhang, Ruishuo Chen, Yihao Sun, Yang Yu
ICML 2024 ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo
AAAI 2024 Rethinking the Development of Large Language Models from the Causal Perspective: A Legal Text Prediction Case Study Haotian Chen, Lingwei Zhang, Yiran Liu, Yang Yu
ICLR 2024 Reward-Consistent Dynamics Models Are Strongly Generalizable for Offline Reinforcement Learning Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu
NeurIPS 2024 Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning Lanqing Li, Hai Zhang, Xinyu Zhang, Shatong Zhu, Yang Yu, Junqiao Zhao, Pheng-Ann Heng
CVPR 2024 Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-Based Hyperspectral Image Synthesis Yang Yu, Erting Pan, Xinya Wang, Yuheng Wu, Xiaoguang Mei, Jiayi Ma
IJCAI 2023 A Unified View of Deep Learning for Reaction and Retrosynthesis Prediction: Current Status and Future Challenges Ziqiao Meng, Peilin Zhao, Yang Yu, Irwin King
NeurIPS 2023 AdaptSSR: Pre-Training User Model with Augmentation-Adaptive Self-Supervised Ranking Yang Yu, Qi Liu, Kai Zhang, Yuren Zhang, Chao Song, Min Hou, Yuqing Yuan, Zhihao Ye, Zaixi Zhang, Sanshi Lei Yu
NeurIPS 2023 Adversarial Counterfactual Environment Model Learning Xiong-Hui Chen, Yang Yu, Zhengmao Zhu, ZhiHua Yu, Chen Zhenjun, Chenghe Wang, Yinan Wu, Rong-Jun Qin, Hongqiu Wu, Ruijin Ding, Huang Fangsheng
AAAI 2023 Anti-Drifting Feature Selection via Deep Reinforcement Learning (Student Abstract) Aoran Wang, Hongyang Yang, Feng Mao, Zongzhang Zhang, Yang Yu, Xiaoyang Liu
NeurIPS 2023 CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations Yazhou Zhang, Yang Yu, Qing Guo, Benyou Wang, Dongming Zhao, Sagar Uprety, Dawei Song, Qiuchi Li, Jing Qin
AAAI 2023 Deep Anomaly Detection and Search via Reinforcement Learning (Student Abstract) Chao Chen, Dawei Wang, Feng Mao, Zongzhang Zhang, Yang Yu
ICLR 2023 Discovering Generalizable Multi-Agent Coordination Skills from Multi-Task Offline Data Fuxiang Zhang, Chengxing Jia, Yi-Chen Li, Lei Yuan, Yang Yu, Zongzhang Zhang
IJCAI 2023 Doubly Stochastic Graph-Based Non-Autoregressive Reaction Prediction Ziqiao Meng, Peilin Zhao, Yang Yu, Irwin King
NeurIPSW 2023 Exploring the Building Blocks of Cell Organization as High-Order Network Motifs with Graph Isomorphism Network Yang Yu, Shuang Wang, Dong Xu, Juexin Wang
UAI 2023 Fast Teammate Adaptation in the Presence of Sudden Policy Change Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu
NeurIPS 2023 Imitation Learning from Imperfection: Theoretical Justifications and Algorithms Ziniu Li, Tian Xu, Zeyu Qin, Yang Yu, Zhi-Quan Luo
AAAI 2023 Learning Generalizable Batch Active Learning Strategies via Deep Q-Networks (Student Abstract) Yi-Chen Li, Wen-Jie Shen, Boyu Zhang, Feng Mao, Zongzhang Zhang, Yang Yu
NeurIPS 2023 Learning World Models with Identifiable Factorization Yuren Liu, Biwei Huang, Zhengmao Zhu, Honglong Tian, Mingming Gong, Yang Yu, Kun Zhang
AAAI 2023 Model-Based Offline Weighted Policy Optimization (Student Abstract) Renzhe Zhou, Zongzhang Zhang, Yang Yu
ICML 2023 Model-Bellman Inconsistency for Model-Based Offline Reinforcement Learning Yihao Sun, Jiaji Zhang, Chengxing Jia, Haoxin Lin, Junyin Ye, Yang Yu
NeurIPS 2023 Natural Language Instruction-Following with Task-Related Language Development and Translation Jing-Cheng Pang, Xin-Yu Yang, Si-Hang Yang, Xiong-Hui Chen, Yang Yu
ICML 2023 Policy Regularization with Dataset Constraint for Offline Reinforcement Learning Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu
AAAI 2023 Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning Weijian Liao, Zongzhang Zhang, Yang Yu
UAI 2023 Provably Efficient Adversarial Imitation Learning with Unknown Transitions Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo
AAAI 2023 Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu
ICML 2023 Uncertainty Estimation by Fisher Information-Based Evidential Deep Learning Danruo Deng, Guangyong Chen, Yang Yu, Furui Liu, Pheng-Ann Heng
AAAI 2023 Untargeted Attack Against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense Yang Yu, Qi Liu, Likang Wu, Runlong Yu, Sanshi Lei Yu, Zaixi Zhang
ICLR 2022 Active Hierarchical Exploration with Stable Subgoal Representation Learning Siyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang
AAAI 2022 Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy Fan-Ming Luo, Shengyi Jiang, Yang Yu, Zongzhang Zhang, Yi-Feng Zhang
NeurIPS 2022 Bayesian Optimistic Optimization: Optimistic Exploration for Model-Based Reinforcement Learning Chenyang Wu, Tianci Li, Zongzhang Zhang, Yang Yu
ICLR 2022 Context-Aware Sparse Deep Coordination Graphs Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang
JMLR 2022 Distributed Bootstrap for Simultaneous Inference Under High Dimensionality Yang Yu, Shih-Kang Chao, Guang Cheng
NeurIPS 2022 Efficient Multi-Agent Communication via Self-Supervised Information Aggregation Cong Guan, Feng Chen, Lei Yuan, Chenghe Wang, Hao Yin, Zongzhang Zhang, Yang Yu
IJCAI 2022 Efficient Multi-Agent Communication via Shapley Message Value Di Xue, Lei Yuan, Zongzhang Zhang, Yang Yu
MLJ 2022 Improve Generated Adversarial Imitation Learning with Reward Variance Regularization Yi-Feng Zhang, Fan-Ming Luo, Yang Yu
AAAI 2022 Invariant Action Effect Model for Reinforcement Learning Zheng-Mao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang
ICLR 2022 Learning Efficient Online 3D Bin Packing on Packing Configuration Trees Hang Zhao, Yang Yu, Kai Xu
IJCAI 2022 Multi-Agent Concentrative Coordination with Decentralized Task Representation Lei Yuan, Chenghe Wang, Jianhao Wang, Fuxiang Zhang, Feng Chen, Cong Guan, Zongzhang Zhang, Chongjie Zhang, Yang Yu
NeurIPS 2022 Multi-Agent Dynamic Algorithm Configuration Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu
AAAI 2022 Multi-Agent Incentive Communication via Decentralized Teammate Modeling Lei Yuan, Jianhao Wang, Fuxiang Zhang, Chenghe Wang, Zongzhang Zhang, Yang Yu, Chongjie Zhang
NeurIPSW 2022 Multi-Agent Policy Transfer via Task Relationship Modeling Rong-Jun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Yipeng Kang, Zongzhang Zhang, Chongjie Zhang, Yang Yu
NeurIPS 2022 NeoRL: A near Real-World Benchmark for Offline Reinforcement Learning Rong-Jun Qin, Xingyuan Zhang, Songyi Gao, Xiong-Hui Chen, Zewen Li, Weinan Zhang, Yang Yu
JAIR 2022 On Efficient Reinforcement Learning for Full-Length Game of StarCraft II Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu
ICML 2022 The Teaching Dimension of Regularized Kernel Learners Hong Qian, Xu-Hui Liu, Chen-Xi Su, Aimin Zhou, Yang Yu
NeurIPS 2021 Adaptive Online Packing-Guided Search for POMDPs Chenyang Wu, Guoyu Yang, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu, Jianye Hao
AAAI 2021 Circles Are like Ellipses, or Ellipses Are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Word Embeddings and the Implications to Representation Learning Wei Zhang, Murray Campbell, Yang Yu, Sadhana Kumaravel
NeurIPS 2021 Cross-Modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning Xiong-Hui Chen, Shengyi Jiang, Feng Xu, Zongzhang Zhang, Yang Yu
AAAI 2021 Enhancing Context-Based Meta-Reinforcement Learning Algorithms via an Efficient Task Encoder (Student Abstract) Feng Xu, Shengyi Jiang, Hao Yin, Zongzhang Zhang, Yang Yu, Ming Li, Dong Li, Wulong Liu
IJCAI 2021 Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints Chao Bian, Chao Qian, Frank Neumann, Yang Yu
AAAI 2021 Incorporating Bidirection-Interactive Information and Semantic Features for Relational Facts Extraction (Student Abstract) Yang Yu, Guohua Wang, Haopeng Ren, Yi Cai
AAAI 2021 LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract) Chenyang Wu, Rui Kong, Guoyu Yang, Xianghan Kong, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu
NeurIPS 2021 Offline Model-Based Adaptable Policy Learning Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei Qin, Wenjie Shang, Jieping Ye
MLJ 2021 Partially Observable Environment Estimation with Uplift Inference for Reinforcement Learning Based Recommendation Wenjie Shang, Qingyang Li, Zhiwei Qin, Yang Yu, Yiping Meng, Jieping Ye
ICLR 2021 QPLEX: Duplex Dueling Multi-Agent Q-Learning Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang
NeurIPS 2021 Regret Minimization Experience Replay in Off-Policy Reinforcement Learning Xu-Hui Liu, Zhenghai Xue, Jingcheng Pang, Shengyi Jiang, Feng Xu, Yang Yu
AAAI 2020 An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints Chao Bian, Chao Feng, Chao Qian, Yang Yu
NeurIPS 2020 Error Bounds of Imitating Policies and Environments Tian Xu, Ziniu Li, Yang Yu
NeurIPS 2020 Offline Imitation Learning with a Misspecified Simulator Shengyi Jiang, Jingcheng Pang, Yang Yu
NeurIPS 2020 RetroXpert: Decompose Retrosynthesis Prediction like a Chemist Chaochao Yan, Qianggang Ding, Peilin Zhao, Shuangjia Zheng, Jinyu Yang, Yang Yu, Junzhou Huang
ICML 2020 Simultaneous Inference for Massive Data: Distributed Bootstrap Yang Yu, Shih-Kang Chao, Guang Cheng
NeurIPS 2019 Bridging Machine Learning and Logical Reasoning by Abductive Learning Wang-Zhou Dai, Qiuling Xu, Yang Yu, Zhi-Hua Zhou
IJCAI 2019 Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit Yi-Qi Hu, Yang Yu, Jun-Da Liao
AAAI 2019 Multi-Fidelity Automatic Hyper-Parameter Tuning via Transfer Series Expansion Yi-Qi Hu, Yang Yu, Wei-Wei Tu, Qiang Yang, Yuqiang Chen, Wenyuan Dai
AAAI 2019 On Reinforcement Learning for Full-Length Game of StarCraft Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang, Yang Yu, Tong Lu
IJCAI 2019 Reinforcement Learning Experience Reuse with Policy Residual Representation Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou
AAAI 2019 Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, Anxiang Zeng
IJCAI 2018 Approximation Guarantees of Stochastic Greedy Algorithms for Subset Selection Chao Qian, Yang Yu, Ke Tang
IJCAI 2018 Experienced Optimization with Reusable Directional Model for Hyper-Parameter Search Yi-Qi Hu, Yang Yu, Zhi-Hua Zhou
IJCAI 2018 Learning Environmental Calibration Actions for Policy Self-Evolution Chao Zhang, Yang Yu, Zhi-Hua Zhou
IJCAI 2018 Mixture of GANs for Clustering Yang Yu, Wen-Ji Zhou
NeurIPS 2018 Multi-Layered Gradient Boosting Decision Trees Ji Feng, Yang Yu, Zhi-Hua Zhou
AAAI 2018 Noisy Derivative-Free Optimization with Value Suppression Hong Wang, Hong Qian, Yang Yu
IJCAI 2018 Towards Sample Efficient Reinforcement Learning Yang Yu
IJCAI 2017 AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles Jianbing Zhang, Yixin Sun, Shujian Huang, Cam-Tu Nguyen, Xiaoliang Wang, Xinyu Dai, Jiajun Chen, Yang Yu
IJCAI 2017 Binary Linear Compression for Multi-Label Classification Wen-Ji Zhou, Yang Yu, Min-Ling Zhang
IJCAI 2017 Life-Stage Modeling by Customer-Manifold Embedding Jing-Wen Yang, Yang Yu, Xiao-Peng Zhang
IJCAI 2017 On Subset Selection with General Cost Constraints Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang
IJCAI 2017 Open Category Classification by Adversarial Sample Generation Yang Yu, Wei-Yang Qu, Nan Li, Zimin Guo
IJCAI 2017 Optimizing Ratio of Monotone Set Functions Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou
AAAI 2017 Sequential Classification-Based Optimization for Direct Policy Search Yi-Qi Hu, Hong Qian, Yang Yu
AAAI 2017 Solving High-Dimensional Multi-Objective Optimization Problems with Low Effective Dimensions Hong Qian, Yang Yu
NeurIPS 2017 Subset Selection Under Noise Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou
AAAI 2016 Decentralized Robust Subspace Clustering Bo Liu, Xiao-Tong Yuan, Yang Yu, Qingshan Liu, Dimitris N. Metaxas
IJCAI 2016 Derivative-Free Optimization of High-Dimensional Non-Convex Functions by Sequential Random Embeddings Hong Qian, Yi-Qi Hu, Yang Yu
AAAI 2016 Derivative-Free Optimization via Classification Yang Yu, Hong Qian, Yi-Qi Hu
AAAI 2016 MicroScholar: Mining Scholarly Information from Chinese Microblogs Yang Yu, Xiaojun Wan
IJCAI 2016 Parallel Pareto Optimization for Subset Selection Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou
AAAI 2016 Scaling Simultaneous Optimistic Optimization for High-Dimensional Non-Convex Functions with Low Effective Dimensions Hong Qian, Yang Yu
IJCAI 2015 On Constrained Boolean Pareto Optimization Chao Qian, Yang Yu, Zhi-Hua Zhou
AAAI 2015 Pareto Ensemble Pruning Chao Qian, Yang Yu, Zhi-Hua Zhou
NeurIPS 2015 Subset Selection by Pareto Optimization Chao Qian, Yang Yu, Zhi-Hua Zhou
AAAI 2014 Learning with Augmented Class by Exploiting Unlabeled Data Qing Da, Yang Yu, Zhi-Hua Zhou
IJCAI 2013 On the Approximation Ability of Evolutionary Optimization with Application to Minimum Set Cover: Extended Abstract Yang Yu, Xin Yao, Zhi-Hua Zhou
ECML-PKDD 2012 Diversity Regularized Ensemble Pruning Nan Li, Yang Yu, Zhi-Hua Zhou
IJCAI 2011 Diversity Regularized Machine Yang Yu, Yufeng Li, Zhi-Hua Zhou
CVPR 2010 Automatic Image Annotation Using Group Sparsity Shaoting Zhang, Junzhou Huang, Yuchi Huang, Yang Yu, Hongsheng Li, Dimitris N. Metaxas
JAIR 2008 Spectrum of Variable-Random Trees Fei Tony Liu, Kai Ming Ting, Yang Yu, Zhi-Hua Zhou
AAAI 2006 A New Approach to Estimating the Expected First Hitting Time of Evolutionary Algorithms Yang Yu, Zhi-Hua Zhou