ML Anthology
Authors
Search
About
Härle, Ruben
3 publications
ICLR
2026
ActivationReasoning: Logical Reasoning in Latent Activation Spaces
Lukas Helff
,
Ruben Härle
,
Wolfgang Stammer
,
Felix Friedrich
,
Manuel Brack
,
Antonia Wüst
,
Hikaru Shindo
,
Patrick Schramowski
,
Kristian Kersting
NeurIPS
2025
Measuring and Guiding Monosemanticity
Ruben Härle
,
Felix Friedrich
,
Manuel Brack
,
Björn Deiseroth
,
Stephan Waeldchen
,
Patrick Schramowski
,
Kristian Kersting
NeurIPSW
2024
SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs
Ruben Härle
,
Felix Friedrich
,
Manuel Brack
,
Björn Deiseroth
,
Patrick Schramowski
,
Kristian Kersting