An, Bo
159 publications
NeurIPS
2025
Deciphering the Extremes: A Novel Approach for Pathological Long-Tailed Recognition in Scientific Discovery
NeurIPS
2025
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
NeurIPS
2025
Establishing Linear Surrogate Regret Bounds for Convex Smooth Losses via Convolutional Fenchel–Young Losses
NeurIPS
2025
Improving Reward Models with Proximal Policy Exploration for Preference-Based Reinforcement Learning
NeurIPS
2025
Let's Revise Step-by-Step: A Unified Local Search Framework for Code Generation with LLMs
NeurIPS
2025
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework
ICML
2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
ICML
2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
NeurIPS
2024
MaNo: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts
NeurIPSW
2024
Optimizing Reward Models with Proximal Policy Exploration in Preference-Based Reinforcement Learning
ICLRW
2024
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study
ICML
2023
Controlling Type Confounding in Ad Hoc Teamwork with Instance-Wise Teammate Feedback Rectification
TMLR
2023
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets
NeurIPS
2023
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning
IJCAI
2022
Correlation-Based Algorithm for Team-Maxmin Equilibrium in Multiplayer Extensive-Form Games
NeurIPS
2022
Deep Attentive Belief Propagation: Integrating Reasoning and Learning for Solving Constraint Optimization Problems
NeurIPS
2022
Generalizing Consistent Multi-Class Classification with Rejection to Be Compatible with Arbitrary Losses
NeurIPS
2022
Out-of-Distribution Detection with an Adaptive Likelihood Ratio on Informative Hierarchical VAE
IJCAI
2021
CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space
AAAI
2021
Commission Fee Is Not Enough: A Hierarchical Reinforced Framework for Portfolio Management
AAAI
2021
Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games
IJCAI
2021
Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play
IJCAI
2020
Speeding up Incomplete GDL-Based Algorithms for Multi-Agent Optimization with Dense Local Utilities
AAAI
2016
Optimizing Personalized Email Filtering Thresholds to Mitigate Sequential Spear Phishing Attacks