Merrill, William

12 publications

TMLR 2026 The Transformer Cookbook Andy Yang, Christopher Watson, Anton Xue, Satwik Bhattamishra, Jose Llarena, William Merrill, Emile Dos Santos Ferreira, Anej Svete, David Chiang

NeurIPS 2025 A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers William Merrill, Ashish Sabharwal

NeurIPS 2025 Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training William Merrill, Shane Arora, Dirk Groeneveld, Hannaneh Hajishirzi

NeurIPS 2025 Exact Expressive Power of Transformers with Padding William Merrill, Ashish Sabharwal

NeurIPSW 2024 A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers William Merrill, Ashish Sabharwal

ICML 2024 How Language Model Hallucinations Can Snowball Muru Zhang, Ofir Press, William Merrill, Alisa Liu, Noah A. Smith

ICLR 2024 The Expressive Power of Transformers with Chain of Thought William Merrill, Ashish Sabharwal

ICML 2024 The Illusion of State in State-Space Models William Merrill, Jackson Petty, Ashish Sabharwal

NeurIPS 2023 A Logic for Expressing Log-Precision Transformers William Merrill, Ashish Sabharwal

ICLRW 2023 A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks William Merrill, Nikolaos Tsilivis, Aman Shukla

NeurIPSW 2023 The Expressive Power of Transformers with Chain of Thought William Merrill, Ashish Sabharwal

ICLR 2020 On the Linguistic Capacity of Real-Time Counter Automata William Merrill