Lee, Daniel J

1 publications

NeurIPSW 2024 Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs Daniel J Lee, Stefan Heimersheim