Wachi, Akifumi

13 publications

NeurIPS 2025 A Provable Approach for End-to-End Safe Reinforcement Learning Akifumi Wachi, Kohei Miyaguchi, Takumi Tanabe, Rei Sato, Youhei Akimoto
NeurIPS 2025 Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies Runze Yan, Xun Shen, Akifumi Wachi, Sebastien Gros, Anni Zhao, Xiao Hu
IJCAI 2024 A Survey of Constraint Formulations in Safe Reinforcement Learning Akifumi Wachi, Xun Shen, Yanan Sui
NeurIPS 2024 Flipping-Based Policy for Chance-Constrained Markov Decision Processes Xun Shen, Shuo Jiang, Akifumi Wachi, Kazumune Hashimoto, Sebastien Gros
AAAI 2024 Long-Term Safe Reinforcement Learning with Binary Feedback Akifumi Wachi, Wataru Hashimoto, Kazumune Hashimoto
NeurIPS 2024 Stepwise Alignment for Constrained Language Model Policy Optimization Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto
NeurIPS 2023 Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms Akifumi Wachi, Wataru Hashimoto, Xun Shen, Kazumune Hashimoto
NeurIPSW 2023 Verbosity Bias in Preference Labeling by Large Language Models Keita Saito, Akifumi Wachi, Koki Wataoka, Youhei Akimoto
NeurIPSW 2022 SCERL: A Benchmark for Intersecting Language and Safe Reinforcement Learning Lan Hoang, Shivam Ratnakar, Nicolas Galichet, Akifumi Wachi, Keerthiram Murugesan, Songtao Lu, Mattia Atzeni, Michael Katz, Subhajit Chaudhury
NeurIPS 2021 Safe Policy Optimization with Local Generalized Linear Function Approximations Akifumi Wachi, Yunyue Wei, Yanan Sui
ICML 2020 Safe Reinforcement Learning in Constrained Markov Decision Processes Akifumi Wachi, Yanan Sui
IJCAI 2019 Failure-Scenario Maker for Rule-Based Agent Using Multi-Agent Adversarial Reinforcement Learning and Its Application to Autonomous Driving Akifumi Wachi
AAAI 2018 Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes Akifumi Wachi, Yanan Sui, Yisong Yue, Masahiro Ono