Achiam, Joshua

4 publications

ICMLW 2024 Rule Based Rewards for Fine-Grained LLM Safety Tong Mu, Alec Helyar, Johannes Heidecke, Joshua Achiam, Andrea Vallone, Ian D Kivlichan, Molly Lin, Alex Beutel, John Schulman, Lilian Weng
NeurIPS 2024 Rule Based Rewards for Language Model Safety Tong Mu, Alec Helyar, Johannes Heidecke, Joshua Achiam, Andrea Vallone, Ian Kivlichan, Molly Lin, Alex Beutel, John Schulman, Lilian Weng
ICML 2020 Responsive Safety in Reinforcement Learning by PID Lagrangian Methods Adam Stooke, Joshua Achiam, Pieter Abbeel
ICML 2017 Constrained Policy Optimization Joshua Achiam, David Held, Aviv Tamar, Pieter Abbeel