Thurnherr, Hannes

1 publications

ICMLW 2024 TracrBench: Generating Interpretability Testbeds with Large Language Models Hannes Thurnherr, Jérémy Scheurer