Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora

Wang, Qingsong; Yao, Chang; Chen, Jingyuan

Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora

ICLR 2026

/iclr/2026/wang2026iclr-causalsteer/

Abstract

Controlling stylistic attributes of Large Language Models (LLMs), such as formality or conceptual complexity, is crucial for effective human-AI interaction. However, current methods often suffer from discreteness, reliance on expensive parallel corpora, and instability, limiting their practical utility. This paper introduces a novel framework for robust activation steering that eliminates the need for parallel corpora, enabling continuous, fine-grained, and linear control over LLM outputs. Our key insight is to reframe Low-Rank Adaptation (LoRA) as a causal intervention tool. By contrasting activations on identical inputs with and without a LoRA perturbation trained via a contrastive objective, we separate the influence of content. To enhance reliability, we introduce a robust aggregation pipeline that uses Principal Component Analysis (PCA) for denoising and the geometric median for centrality estimation, yielding a stable and disentangled style vector. At inference, this vector allows for precise bidirectional control via activation steering with negligible computational overhead. We demonstrate state-of-the-art performance on controlling conceptual complexity, text detoxification, and formality control. Our method not only provides superior control but also generalizes across different models and tasks, and enables simultaneous multi-attribute control.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Wang et al. "Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora." International Conference on Learning Representations, 2026.

Markdown

[Wang et al. "Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wang2026iclr-causalsteer/)

BibTeX

@inproceedings{wang2026iclr-causalsteer,
  title     = {{Causal-Steer: Disentangled Continuous Style Control Without Parallel Corpora}},
  author    = {Wang, Qingsong and Yao, Chang and Chen, Jingyuan},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wang2026iclr-causalsteer/}
}