RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms

Zhang, Song

doi:10.24963/ijcai.2024/1047

RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms

Song Zhang

IJCAI 2024 pp. 8851-8854

doi:10.24963/ijcai.2024/1047 /ijcai/2024/zhang2024ijcai-rlop/

Abstract

Fake news and misinformation poses a significant threat to society, making efficient mitigation essential. However, manual fact-checking is costly and lacks scalability. Large Language Models (LLMs) offer promise in automating counter-response generation to mitigate misinformation, but a critical challenge lies in their tendency to hallucinate non-factual information. Existing models mainly rely on LLM self-feedback to reduce hallucination, but this approach is computationally expensive. In this paper, we propose MisMitiFact, Misinformation Mitigation grounded in Facts, an efficient framework for generating fact-grounded counter-responses at scale. MisMitiFact generates simple critique feedback to refine LLM outputs, ensuring responses are grounded in evidence. We develop lightweight, fine-grained critique models trained on data sourced from readily available fact-checking sites to identify and correct errors in key elements such as numerals, entities, and topics in LLM generations. Experiments show that MisMitiFact generates counter-responses of comparable quality to LLMs' self-feedback while using significantly smaller critique models. Importantly, it achieves ~5x increase in feedback generation throughput, making it highly suitable for cost-effective, large-scale misinformation mitigation. Code and additional results are available at https://github.com/xxfwin/MisMitiFact.

PDF IJCAI Semantic Scholar

Cite

Text

Zhang. "RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/1047

Markdown

[Zhang. "RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/zhang2024ijcai-rlop/) doi:10.24963/ijcai.2024/1047

BibTeX

@inproceedings{zhang2024ijcai-rlop,
  title     = {{RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms}},
  author    = {Zhang, Song},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {8851-8854},
  doi       = {10.24963/ijcai.2024/1047},
  url       = {https://mlanthology.org/ijcai/2024/zhang2024ijcai-rlop/}
}