ML Anthology
Authors
Search
About
Wachi, Akifumi
13 publications
NeurIPS
2025
A Provable Approach for End-to-End Safe Reinforcement Learning
Akifumi Wachi
,
Kohei Miyaguchi
,
Takumi Tanabe
,
Rei Sato
,
Youhei Akimoto
NeurIPS
2025
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies
Runze Yan
,
Xun Shen
,
Akifumi Wachi
,
Sebastien Gros
,
Anni Zhao
,
Xiao Hu
IJCAI
2024
A Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi
,
Xun Shen
,
Yanan Sui
NeurIPS
2024
Flipping-Based Policy for Chance-Constrained Markov Decision Processes
Xun Shen
,
Shuo Jiang
,
Akifumi Wachi
,
Kazumune Hashimoto
,
Sebastien Gros
AAAI
2024
Long-Term Safe Reinforcement Learning with Binary Feedback
Akifumi Wachi
,
Wataru Hashimoto
,
Kazumune Hashimoto
NeurIPS
2024
Stepwise Alignment for Constrained Language Model Policy Optimization
Akifumi Wachi
,
Thien Q. Tran
,
Rei Sato
,
Takumi Tanabe
,
Youhei Akimoto
NeurIPS
2023
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms
Akifumi Wachi
,
Wataru Hashimoto
,
Xun Shen
,
Kazumune Hashimoto
NeurIPSW
2023
Verbosity Bias in Preference Labeling by Large Language Models
Keita Saito
,
Akifumi Wachi
,
Koki Wataoka
,
Youhei Akimoto
NeurIPSW
2022
SCERL: A Benchmark for Intersecting Language and Safe Reinforcement Learning
Lan Hoang
,
Shivam Ratnakar
,
Nicolas Galichet
,
Akifumi Wachi
,
Keerthiram Murugesan
,
Songtao Lu
,
Mattia Atzeni
,
Michael Katz
,
Subhajit Chaudhury
NeurIPS
2021
Safe Policy Optimization with Local Generalized Linear Function Approximations
Akifumi Wachi
,
Yunyue Wei
,
Yanan Sui
ICML
2020
Safe Reinforcement Learning in Constrained Markov Decision Processes
Akifumi Wachi
,
Yanan Sui
IJCAI
2019
Failure-Scenario Maker for Rule-Based Agent Using Multi-Agent Adversarial Reinforcement Learning and Its Application to Autonomous Driving
Akifumi Wachi
AAAI
2018
Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes
Akifumi Wachi
,
Yanan Sui
,
Yisong Yue
,
Masahiro Ono