RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms
Abstract
Fake news and misinformation poses a significant threat to society, making efficient mitigation essential. However, manual fact-checking is costly and lacks scalability. Large Language Models (LLMs) offer promise in automating counter-response generation to mitigate misinformation, but a critical challenge lies in their tendency to hallucinate non-factual information. Existing models mainly rely on LLM self-feedback to reduce hallucination, but this approach is computationally expensive. In this paper, we propose MisMitiFact, Misinformation Mitigation grounded in Facts, an efficient framework for generating fact-grounded counter-responses at scale. MisMitiFact generates simple critique feedback to refine LLM outputs, ensuring responses are grounded in evidence. We develop lightweight, fine-grained critique models trained on data sourced from readily available fact-checking sites to identify and correct errors in key elements such as numerals, entities, and topics in LLM generations. Experiments show that MisMitiFact generates counter-responses of comparable quality to LLMs' self-feedback while using significantly smaller critique models. Importantly, it achieves ~5x increase in feedback generation throughput, making it highly suitable for cost-effective, large-scale misinformation mitigation. Code and additional results are available at https://github.com/xxfwin/MisMitiFact.
Cite
Text
Zhang. "RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/1047Markdown
[Zhang. "RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/zhang2024ijcai-rlop/) doi:10.24963/ijcai.2024/1047BibTeX
@inproceedings{zhang2024ijcai-rlop,
title = {{RLOP: A Framework for Reinforcement Learning, Optimization and Planning Algorithms}},
author = {Zhang, Song},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {8851-8854},
doi = {10.24963/ijcai.2024/1047},
url = {https://mlanthology.org/ijcai/2024/zhang2024ijcai-rlop/}
}