Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora
Abstract
Controlling stylistic attributes of Large Language Models (LLMs), such as formality or conceptual complexity, is crucial for effective human-AI interaction. However, current methods often suffer from discreteness, reliance on expensive parallel corpora, and instability, limiting their practical utility. This paper introduces a novel framework for robust activation steering that eliminates the need for parallel corpora, enabling continuous, fine-grained, and linear control over LLM outputs. Our key insight is to reframe Low-Rank Adaptation (LoRA) as a causal intervention tool. By contrasting activations on identical inputs with and without a LoRA perturbation trained via a contrastive objective, we separate the influence of content. To enhance reliability, we introduce a robust aggregation pipeline that uses Principal Component Analysis (PCA) for denoising and the geometric median for centrality estimation, yielding a stable and disentangled style vector. At inference, this vector allows for precise bidirectional control via activation steering with negligible computational overhead. We demonstrate state-of-the-art performance on controlling conceptual complexity, text detoxification, and formality control. Our method not only provides superior control but also generalizes across different models and tasks, and enables simultaneous multi-attribute control.
Cite
Text
Wang et al. "Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora." International Conference on Learning Representations, 2026.Markdown
[Wang et al. "Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wang2026iclr-causalsteer/)BibTeX
@inproceedings{wang2026iclr-causalsteer,
title = {{Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora}},
author = {Wang, Qingsong and Yao, Chang and Chen, Jingyuan},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/wang2026iclr-causalsteer/}
}