ML Anthology
Authors
Search
About
Lang, Leon
5 publications
TMLR
2025
Modeling Human Beliefs About AI Behavior for Scalable Oversight
Leon Lang
,
Patrick Forré
ICML
2025
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret
Lukas Fluri
,
Leon Lang
,
Alessandro Abate
,
Patrick Forré
,
David Krueger
,
Joar Max Viktor Skalse
NeurIPS
2024
When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback
Leon Lang
,
Davis Foote
,
Stuart Russell
,
Anca Dragan
,
Erik Jenner
,
Scott Emmons
ICLR
2022
A Program to Build E(N)-Equivariant Steerable CNNs
Gabriele Cesa
,
Leon Lang
,
Maurice Weiler
ICLR
2021
A Wigner-Eckart Theorem for Group Equivariant Convolution Kernels
Leon Lang
,
Maurice Weiler