Härle, Ruben

2 publications

NeurIPS 2025 Measuring and Guiding Monosemanticity Ruben Härle, Felix Friedrich, Manuel Brack, Björn Deiseroth, Stephan Waeldchen, Patrick Schramowski, Kristian Kersting
NeurIPSW 2024 SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs Ruben Härle, Felix Friedrich, Manuel Brack, Björn Deiseroth, Patrick Schramowski, Kristian Kersting