Rule-Guided Language Model Alignment for Text Generation Management in Industrial Use Cases

Abstract

Recent advances in Large Language Models (LLMs) have shown significant success in various natural language tasks. However, when implementing LLMs in industry applications, they often need to follow domain-specific rules. Since these rules can be complex and numerous, it is often difficult to precisely identify which rule should be applied to the response. In this paper, we propose a simple yet effective method to address this issue, by taking the following two steps: (1) generate a dataset of rule-applied responses using simplified rule selection, and (2) train an LLM on this dataset. Since the rule selection is not designed to be perfect, the responses in the dataset do not always follow all the necessary rules. However, by training an LLM on this dataset, we expect the LLM to generalize over the rules and correctly identify the task-to-rule dependency. We demonstrated our method in the automotive repair domain, to make a repair recommendation LLM to follow safety rules. Our experimental results show that our approach improves LLM performance compared to solely applying the rules using the simplified rule selection. This suggests that our method could enhance the utility of LLMs in industry applications.

Cite

Text

Akatsuka et al. "Rule-Guided Language Model Alignment for Text Generation Management in Industrial Use Cases." NeurIPS 2024 Workshops: SafeGenAi, 2024.

Markdown

[Akatsuka et al. "Rule-Guided Language Model Alignment for Text Generation Management in Industrial Use Cases." NeurIPS 2024 Workshops: SafeGenAi, 2024.](https://mlanthology.org/neuripsw/2024/akatsuka2024neuripsw-ruleguided/)

BibTeX

@inproceedings{akatsuka2024neuripsw-ruleguided,
  title     = {{Rule-Guided Language Model Alignment for Text Generation Management in Industrial Use Cases}},
  author    = {Akatsuka, Shunichi and Kumar, Aman and Lee, Xian Yeow and Vidyaratne, Lasitha and Ghosh, Dipanjan Dipak and Farahat, Ahmed K.},
  booktitle = {NeurIPS 2024 Workshops: SafeGenAi},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/akatsuka2024neuripsw-ruleguided/}
}