Härle, Ruben

3 publications

ICLR 2026 ActivationReasoning: Logical Reasoning in Latent Activation Spaces Lukas Helff, Ruben Härle, Wolfgang Stammer, Felix Friedrich, Manuel Brack, Antonia Wüst, Hikaru Shindo, Patrick Schramowski, Kristian Kersting
NeurIPS 2025 Measuring and Guiding Monosemanticity Ruben Härle, Felix Friedrich, Manuel Brack, Björn Deiseroth, Stephan Waeldchen, Patrick Schramowski, Kristian Kersting
NeurIPSW 2024 SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs Ruben Härle, Felix Friedrich, Manuel Brack, Björn Deiseroth, Patrick Schramowski, Kristian Kersting