Foote, Alex

2 publications

ICMLW 2024 Tackling Polysemanticity with Neuron Embeddings Alex Foote
ICLRW 2023 N2G: A Scalable Approach for Quantifying Interpretable Neuron Representation in LLMs Alex Foote, Neel Nanda, Esben Kran, Ioannis Konstas, Fazl Barez