Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach

Tan, Zhen; Peng, Jie; Wang, Song; Hu, Lijie; Chen, Tianlong; Liu, Huan

doi:10.1609/AAAI.V39I24.34710

Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach

Zhen Tan, Jie Peng, Song Wang, Lijie Hu, Tianlong Chen, Huan Liu

AAAI 2025 pp. 25237-25245

doi:10.1609/AAAI.V39I24.34710 /aaai/2025/tan2025aaai-tuning/

Abstract

Large Language Models (LLMs) have brought significant advances across various NLP tasks through few-shot or zero-shot prompting, bypassing the need for parameter tuning. However, the "black-box" nature behind their massive parameter sizes increases the "hallucination" concerns, especially in high-stakes applications (e.g., healthcare), where decision mistakes can lead to severe consequences. In contrast, human decision-making relies on complex cognitive processes, such as the ability to sense and adaptively correct mistakes through conceptual understanding. Drawing inspiration from human cognition, we propose an innovative metacognitive approach CLEAR, to equip LLMs with capabilities for self-aware error identification and correction. Our framework constructs concept-specific sparse subnetworks that indicate decision processes. This provides a novel interface for model intervention after deployment. The benefits include: (i) at inference time, our metacognitive LLMs can self-consciously identify potential mispredictions with minimum human involvement, (ii) the model can self-correct its errors efficiently without additional tuning, and (iii) the correction procedure is not only self-explanatory but also user-friendly, enhancing model interpretability and accessibility. With these metacognitive features, our approach pioneers a new path toward the trustworthiness of LLMs.

PDF AAAI Semantic Scholar

Cite

Text

Tan et al. "Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I24.34710

Markdown

[Tan et al. "Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/tan2025aaai-tuning/) doi:10.1609/AAAI.V39I24.34710

BibTeX

@inproceedings{tan2025aaai-tuning,
  title     = {{Tuning-Free Accountable Intervention for LLM Deployment - A Metacognitive Approach}},
  author    = {Tan, Zhen and Peng, Jie and Wang, Song and Hu, Lijie and Chen, Tianlong and Liu, Huan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {25237-25245},
  doi       = {10.1609/AAAI.V39I24.34710},
  url       = {https://mlanthology.org/aaai/2025/tan2025aaai-tuning/}
}