Merrill, William

12 publications

TMLR 2026 The Transformer Cookbook Andy Yang, Christopher Watson, Anton Xue, Satwik Bhattamishra, Jose Llarena, William Merrill, Emile Dos Santos Ferreira, Anej Svete, David Chiang
NeurIPS 2025 A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers William Merrill, Ashish Sabharwal
NeurIPS 2025 Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training William Merrill, Shane Arora, Dirk Groeneveld, Hannaneh Hajishirzi
NeurIPS 2025 Exact Expressive Power of Transformers with Padding William Merrill, Ashish Sabharwal
NeurIPSW 2024 A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers William Merrill, Ashish Sabharwal
ICML 2024 How Language Model Hallucinations Can Snowball Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith
ICLR 2024 The Expressive Power of Transformers with Chain of Thought William Merrill, Ashish Sabharwal
ICML 2024 The Illusion of State in State-Space Models William Merrill, Jackson Petty, Ashish Sabharwal
NeurIPS 2023 A Logic for Expressing Log-Precision Transformers William Merrill, Ashish Sabharwal
ICLRW 2023 A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks William Merrill, Nikolaos Tsilivis, Aman Shukla
NeurIPSW 2023 The Expressive Power of Transformers with Chain of Thought William Merrill, Ashish Sabharwal
ICLR 2020 On the Linguistic Capacity of Real-Time Counter Automata William Merrill