Zhang, Cyril

31 publications

ICML 2025 On the Query Complexity of Verifier-Assisted Language Generation Edoardo Botta, Yuchen Li, Aashay Mehta, Jordan T. Ash, Cyril Zhang, Andrej Risteski
ICLRW 2025 On the Query Complexity of Verifier-Assisted Language Generation Edoardo Botta, Yuchen Li, Aashay Mehta, Jordan T. Ash, Cyril Zhang, Andrej Risteski
ICLR 2025 Self-Improvement in Language Models: The Sharpening Mechanism Audrey Huang, Adam Block, Dylan J Foster, Dhruv Rohatgi, Cyril Zhang, Max Simchowitz, Jordan T. Ash, Akshay Krishnamurthy
ICLR 2024 Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression Adam Block, Dylan J Foster, Akshay Krishnamurthy, Max Simchowitz, Cyril Zhang
NeurIPS 2024 Can Large Language Models Explore In-Context? Akshay Krishnamurthy, Keegan Harris, Dylan J. Foster, Cyril Zhang, Aleksandrs Slivkins
ICMLW 2024 Can Large Language Models Explore In-Context? Akshay Krishnamurthy, Keegan Harris, Dylan J Foster, Cyril Zhang, Aleksandrs Slivkins
NeurIPSW 2024 Self-Improvement in Language Models: The Sharpening Mechanism Audrey Huang, Adam Block, Dylan J Foster, Dhruv Rohatgi, Cyril Zhang, Max Simchowitz, Jordan T. Ash, Akshay Krishnamurthy
NeurIPS 2023 Exposing Attention Glitches with Flip-Flop Language Modeling Bingbin Liu, Jordan Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang
ICMLW 2023 Exposing Attention Glitches with Flip-Flop Language Modeling Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang
COLT 2023 Learning Hidden Markov Models Using Conditional Samples Gaurav Mahajan, Sham Kakade, Akshay Krishnamurthy, Cyril Zhang
NeurIPS 2023 Pareto Frontiers in Deep Feature Learning: Data, Compute, Width, and Luck Benjamin Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang
ICLR 2023 Transformers Learn Shortcuts to Automata Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang
ICLR 2022 Anti-Concentrated Confidence Bonuses for Scalable Exploration Jordan T. Ash, Cyril Zhang, Surbhi Goel, Akshay Krishnamurthy, Sham M. Kakade
NeurIPS 2022 Hidden Progress in Deep Learning: SGD Learns Parities near the Computational Limit Boaz Barak, Benjamin Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang
ICML 2022 Inductive Biases and Variable Creation in Self-Attention Mechanisms Benjamin L Edelman, Surbhi Goel, Sham Kakade, Cyril Zhang
NeurIPS 2022 Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms Surbhi Goel, Sham Kakade, Adam Kalai, Cyril Zhang
ICML 2022 Sparsity in Partially Controllable Linear Systems Yonathan Efroni, Sham Kakade, Akshay Krishnamurthy, Cyril Zhang
ICML 2022 Understanding Contrastive Learning Requires Incorporating Inductive Biases Nikunj Saunshi, Jordan Ash, Surbhi Goel, Dipendra Misra, Cyril Zhang, Sanjeev Arora, Sham Kakade, Akshay Krishnamurthy
ICML 2021 Acceleration via Fractal Learning Rate Schedules Naman Agarwal, Surbhi Goel, Cyril Zhang
NeurIPSW 2021 Catastrophic Failures of Neural Active Learning on Heteroskedastic Distributions Savya Khosla, Alex Lamb, Jordan T. Ash, Cyril Zhang, Kenji Kawaguchi
ICML 2020 Calibration, Entropy Rates, and Memory in Language Models Mark Braverman, Xinyi Chen, Sham Kakade, Karthik Narasimhan, Cyril Zhang, Yi Zhang
ICLR 2020 Extreme Tensoring for Low-Memory Preconditioning Xinyi Chen, Naman Agarwal, Elad Hazan, Cyril Zhang, Yi Zhang
COLT 2020 No-Regret Prediction in Marginally Stable Systems Udaya Ghai, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang
ICLR 2020 Revisiting the Generalization of Adaptive Gradient Methods Naman Agarwal, Rohan Anil, Elad Hazan, Tomer Koren, Cyril Zhang
ALT 2020 Robust Guarantees for Learning an Autoregressive Filter Holden Lee, Cyril Zhang
NeurIPS 2020 Stochastic Optimization with Laggard Data Pipelines Naman Agarwal, Rohan Anil, Tomer Koren, Kunal Talwar, Cyril Zhang
ICML 2019 Efficient Full-Matrix Adaptive Regularization Naman Agarwal, Brian Bullins, Xinyi Chen, Elad Hazan, Karan Singh, Cyril Zhang, Yi Zhang
ICLR 2018 Not-so-Random Features Brian Bullins, Cyril Zhang, Yi Zhang
NeurIPS 2018 Spectral Filtering for General Linear Dynamical Systems Elad Hazan, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang
ICML 2017 Efficient Regret Minimization in Non-Convex Games Elad Hazan, Karan Singh, Cyril Zhang
NeurIPS 2017 Learning Linear Dynamical Systems via Spectral Filtering Elad Hazan, Karan Singh, Cyril Zhang