ML Anthology
Authors
Search
About
Zhao, Rosie
10 publications
ICLR
2025
Deconstructing What Makes a Good Optimizer for Autoregressive Language Models
Rosie Zhao
,
Depen Morwani
,
David Brandfonbrener
,
Nikhil Vyas
,
Sham M. Kakade
ICLR
2025
SOAP: Improving and Stabilizing Shampoo Using Adam for Language Modeling
Nikhil Vyas
,
Depen Morwani
,
Rosie Zhao
,
Itai Shapira
,
David Brandfonbrener
,
Lucas Janson
,
Sham M. Kakade
ICML
2024
Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
Nikhil Vyas
,
Depen Morwani
,
Rosie Zhao
,
Gal Kaplun
,
Sham M. Kakade
,
Boaz Barak
NeurIPSW
2024
Deconstructing What Makes a Good Optimizer for Language Models
Rosie Zhao
,
Depen Morwani
,
David Brandfonbrener
,
Nikhil Vyas
,
Sham M. Kakade
NeurIPSW
2024
Distributional Scaling Laws for Emergent Capabilities
Rosie Zhao
,
Naomi Saphra
,
Sham M. Kakade
ICLR
2024
Feature Emergence via Margin Maximization: Case Studies in Algebraic Tasks
Depen Morwani
,
Benjamin L. Edelman
,
Costin-Andrei Oncescu
,
Rosie Zhao
,
Sham M. Kakade
JMLR
2024
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
,
Sahand Rezaei-Shoshtari
,
Rosie Zhao
,
David Meger
,
Doina Precup
NeurIPSW
2024
SOAP: Improving and Stabilizing Shampoo Using Adam
Nikhil Vyas
,
Depen Morwani
,
Rosie Zhao
,
Itai Shapira
,
David Brandfonbrener
,
Lucas Janson
,
Sham M. Kakade
CoLLAs
2023
Loss of Plasticity in Continual Deep Reinforcement Learning
Zaheer Abbas
,
Rosie Zhao
,
Joseph Modayil
,
Adam White
,
Marlos C. Machado
NeurIPS
2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Sahand Rezaei-Shoshtari
,
Rosie Zhao
,
Prakash Panangaden
,
David Meger
,
Doina Precup