ML Anthology
Authors
Search
About
Merrill, William
12 publications
TMLR
2026
The Transformer Cookbook
Andy Yang
,
Christopher Watson
,
Anton Xue
,
Satwik Bhattamishra
,
Jose Llarena
,
William Merrill
,
Emile Dos Santos Ferreira
,
Anej Svete
,
David Chiang
NeurIPS
2025
A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers
William Merrill
,
Ashish Sabharwal
NeurIPS
2025
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training
William Merrill
,
Shane Arora
,
Dirk Groeneveld
,
Hannaneh Hajishirzi
NeurIPS
2025
Exact Expressive Power of Transformers with Padding
William Merrill
,
Ashish Sabharwal
NeurIPSW
2024
A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers
William Merrill
,
Ashish Sabharwal
ICML
2024
How Language Model Hallucinations Can Snowball
Muru Zhang
,
Ofir Press
,
William Merrill
,
Alisa Liu
,
Noah A. Smith
ICLR
2024
The Expressive Power of Transformers with Chain of Thought
William Merrill
,
Ashish Sabharwal
ICML
2024
The Illusion of State in State-Space Models
William Merrill
,
Jackson Petty
,
Ashish Sabharwal
NeurIPS
2023
A Logic for Expressing Log-Precision Transformers
William Merrill
,
Ashish Sabharwal
ICLRW
2023
A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
William Merrill
,
Nikolaos Tsilivis
,
Aman Shukla
NeurIPSW
2023
The Expressive Power of Transformers with Chain of Thought
William Merrill
,
Ashish Sabharwal
ICLR
2020
On the Linguistic Capacity of Real-Time Counter Automata
William Merrill