XCOT: Cross-Lingual Instruction Tuning for Cross-Lingual Chain-of-Thought Reasoning

Chai, Linzheng; Yang, Jian; Sun, Tao; Guo, Hongcheng; Liu, Jiaheng; Wang, Bing; Liang, Xinnian; Bai, Jiaqi; Li, Tongliang; Peng, Qiyao; Li, Zhoujun

doi:10.1609/AAAI.V39I22.34524

XCOT: Cross-Lingual Instruction Tuning for Cross-Lingual Chain-of-Thought Reasoning

Linzheng Chai, Jian Yang, Tao Sun, Hongcheng Guo, Jiaheng Liu, Bing Wang, Xinnian Liang, Jiaqi Bai, Tongliang Li, Qiyao Peng, Zhoujun Li

AAAI 2025 pp. 23550-23558

doi:10.1609/AAAI.V39I22.34524 /aaai/2025/chai2025aaai-xcot/

Abstract

Chain-of-thought (CoT) has emerged as a powerful technique to elicit reasoning in large language models and improve a variety of downstream tasks. CoT mainly demonstrates excellent performance in English, but its usage in low-resource languages is constrained due to poor language generalization. To bridge the gap among different languages, we propose a cross-lingual instruction fine-tuning framework (xCoT) to transfer knowledge from high-resource languages to low-resource languages. Specifically, the multilingual instruction training data (xCoT-Instruct) is created to encourage the semantic alignment of multiple languages. We introduce cross-lingual in-context few-shot learning (xICL) to accelerate multilingual agreement in instruction tuning, where some fragments of source languages in examples are randomly substituted by their counterpart translations of target languages. During multilingual instruction tuning, we adopt the randomly online CoT strategy to enhance the multilingual reasoning ability of the large language model by first translating the query to another language and then answering in English. To further facilitate the language transfer, we leverage the high-resource CoT to supervise the training of low-resource languages with cross-lingual distillation. Experimental results demonstrate the superior performance of xCoT in reducing the gap among different languages, highlighting its potential to reduce the cross-lingual gap.

PDF AAAI Semantic Scholar

Cite

Text

Chai et al. "XCOT: Cross-Lingual Instruction Tuning for Cross-Lingual Chain-of-Thought Reasoning." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I22.34524

Markdown

[Chai et al. "XCOT: Cross-Lingual Instruction Tuning for Cross-Lingual Chain-of-Thought Reasoning." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/chai2025aaai-xcot/) doi:10.1609/AAAI.V39I22.34524

BibTeX

@inproceedings{chai2025aaai-xcot,
  title     = {{XCOT: Cross-Lingual Instruction Tuning for Cross-Lingual Chain-of-Thought Reasoning}},
  author    = {Chai, Linzheng and Yang, Jian and Sun, Tao and Guo, Hongcheng and Liu, Jiaheng and Wang, Bing and Liang, Xinnian and Bai, Jiaqi and Li, Tongliang and Peng, Qiyao and Li, Zhoujun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {23550-23558},
  doi       = {10.1609/AAAI.V39I22.34524},
  url       = {https://mlanthology.org/aaai/2025/chai2025aaai-xcot/}
}