ML Anthology
Authors
Search
About
Kummerfeld, Jonathan K.
2 publications
ICML
2024
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity
Andrew Lee
,
Xiaoyan Bai
,
Itamar Pres
,
Martin Wattenberg
,
Jonathan K. Kummerfeld
,
Rada Mihalcea
NeurIPS
2019
No-Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
,
Yuchen Lu
,
Seton Steven Bocco
,
Max Smith
,
Satya O.-G.
,
Jonathan K. Kummerfeld
,
Joelle Pineau
,
Satinder Singh
,
Aaron C. Courville