Carlos Trujillo, Juan

1 publications

AISTATS 2024 How Does GPT-2 Predict Acronyms? Extracting and Understanding a Circuit via Mechanistic Interpretability Jorge García-Carrasco, Alejandro Maté, Juan Carlos Trujillo