ML Anthology
Authors
Search
About
Zhang, Cyril
31 publications
ICML
2025
On the Query Complexity of Verifier-Assisted Language Generation
Edoardo Botta
,
Yuchen Li
,
Aashay Mehta
,
Jordan T. Ash
,
Cyril Zhang
,
Andrej Risteski
ICLRW
2025
On the Query Complexity of Verifier-Assisted Language Generation
Edoardo Botta
,
Yuchen Li
,
Aashay Mehta
,
Jordan T. Ash
,
Cyril Zhang
,
Andrej Risteski
ICLR
2025
Self-Improvement in Language Models: The Sharpening Mechanism
Audrey Huang
,
Adam Block
,
Dylan J Foster
,
Dhruv Rohatgi
,
Cyril Zhang
,
Max Simchowitz
,
Jordan T. Ash
,
Akshay Krishnamurthy
ICLR
2024
Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression
Adam Block
,
Dylan J Foster
,
Akshay Krishnamurthy
,
Max Simchowitz
,
Cyril Zhang
NeurIPS
2024
Can Large Language Models Explore In-Context?
Akshay Krishnamurthy
,
Keegan Harris
,
Dylan J. Foster
,
Cyril Zhang
,
Aleksandrs Slivkins
ICMLW
2024
Can Large Language Models Explore In-Context?
Akshay Krishnamurthy
,
Keegan Harris
,
Dylan J Foster
,
Cyril Zhang
,
Aleksandrs Slivkins
NeurIPSW
2024
Self-Improvement in Language Models: The Sharpening Mechanism
Audrey Huang
,
Adam Block
,
Dylan J Foster
,
Dhruv Rohatgi
,
Cyril Zhang
,
Max Simchowitz
,
Jordan T. Ash
,
Akshay Krishnamurthy
NeurIPS
2023
Exposing Attention Glitches with Flip-Flop Language Modeling
Bingbin Liu
,
Jordan Ash
,
Surbhi Goel
,
Akshay Krishnamurthy
,
Cyril Zhang
ICMLW
2023
Exposing Attention Glitches with Flip-Flop Language Modeling
Bingbin Liu
,
Jordan T. Ash
,
Surbhi Goel
,
Akshay Krishnamurthy
,
Cyril Zhang
COLT
2023
Learning Hidden Markov Models Using Conditional Samples
Gaurav Mahajan
,
Sham Kakade
,
Akshay Krishnamurthy
,
Cyril Zhang
NeurIPS
2023
Pareto Frontiers in Deep Feature Learning: Data, Compute, Width, and Luck
Benjamin Edelman
,
Surbhi Goel
,
Sham Kakade
,
Eran Malach
,
Cyril Zhang
ICLR
2023
Transformers Learn Shortcuts to Automata
Bingbin Liu
,
Jordan T. Ash
,
Surbhi Goel
,
Akshay Krishnamurthy
,
Cyril Zhang
ICLR
2022
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
,
Cyril Zhang
,
Surbhi Goel
,
Akshay Krishnamurthy
,
Sham M. Kakade
NeurIPS
2022
Hidden Progress in Deep Learning: SGD Learns Parities near the Computational Limit
Boaz Barak
,
Benjamin Edelman
,
Surbhi Goel
,
Sham Kakade
,
Eran Malach
,
Cyril Zhang
ICML
2022
Inductive Biases and Variable Creation in Self-Attention Mechanisms
Benjamin L Edelman
,
Surbhi Goel
,
Sham Kakade
,
Cyril Zhang
NeurIPS
2022
Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Surbhi Goel
,
Sham Kakade
,
Adam Kalai
,
Cyril Zhang
ICML
2022
Sparsity in Partially Controllable Linear Systems
Yonathan Efroni
,
Sham Kakade
,
Akshay Krishnamurthy
,
Cyril Zhang
ICML
2022
Understanding Contrastive Learning Requires Incorporating Inductive Biases
Nikunj Saunshi
,
Jordan Ash
,
Surbhi Goel
,
Dipendra Misra
,
Cyril Zhang
,
Sanjeev Arora
,
Sham Kakade
,
Akshay Krishnamurthy
ICML
2021
Acceleration via Fractal Learning Rate Schedules
Naman Agarwal
,
Surbhi Goel
,
Cyril Zhang
NeurIPSW
2021
Catastrophic Failures of Neural Active Learning on Heteroskedastic Distributions
Savya Khosla
,
Alex Lamb
,
Jordan T. Ash
,
Cyril Zhang
,
Kenji Kawaguchi
ICML
2020
Calibration, Entropy Rates, and Memory in Language Models
Mark Braverman
,
Xinyi Chen
,
Sham Kakade
,
Karthik Narasimhan
,
Cyril Zhang
,
Yi Zhang
ICLR
2020
Extreme Tensoring for Low-Memory Preconditioning
Xinyi Chen
,
Naman Agarwal
,
Elad Hazan
,
Cyril Zhang
,
Yi Zhang
COLT
2020
No-Regret Prediction in Marginally Stable Systems
Udaya Ghai
,
Holden Lee
,
Karan Singh
,
Cyril Zhang
,
Yi Zhang
ICLR
2020
Revisiting the Generalization of Adaptive Gradient Methods
Naman Agarwal
,
Rohan Anil
,
Elad Hazan
,
Tomer Koren
,
Cyril Zhang
ALT
2020
Robust Guarantees for Learning an Autoregressive Filter
Holden Lee
,
Cyril Zhang
NeurIPS
2020
Stochastic Optimization with Laggard Data Pipelines
Naman Agarwal
,
Rohan Anil
,
Tomer Koren
,
Kunal Talwar
,
Cyril Zhang
ICML
2019
Efficient Full-Matrix Adaptive Regularization
Naman Agarwal
,
Brian Bullins
,
Xinyi Chen
,
Elad Hazan
,
Karan Singh
,
Cyril Zhang
,
Yi Zhang
ICLR
2018
Not-so-Random Features
Brian Bullins
,
Cyril Zhang
,
Yi Zhang
NeurIPS
2018
Spectral Filtering for General Linear Dynamical Systems
Elad Hazan
,
Holden Lee
,
Karan Singh
,
Cyril Zhang
,
Yi Zhang
ICML
2017
Efficient Regret Minimization in Non-Convex Games
Elad Hazan
,
Karan Singh
,
Cyril Zhang
NeurIPS
2017
Learning Linear Dynamical Systems via Spectral Filtering
Elad Hazan
,
Karan Singh
,
Cyril Zhang