Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-Ended Tasks

Jiang, Chunyang; Zhang, Yonggang; Cai, Yiyang; Chan, Chi-Min; Liu, Yulong; Chen, Mingming; Xue, Wei; Guo, Yike

Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-Ended Tasks

Chunyang Jiang, Yonggang Zhang, Yiyang Cai, Chi-Min Chan, Yulong Liu, Mingming Chen, Wei Xue, Yike Guo

ICLR 2026

/iclr/2026/jiang2026iclr-semantic/

Abstract

The rising cost of acquiring supervised data has driven significant interest in self-improvement for large language models (LLMs). Straightforward unsupervised signals like majority voting have proven effective in generating pseudo-labels for verifiable tasks, while their applicability to unverifiable tasks (e.g., translation) is limited by the open-ended character of responses. As a result, self-evaluation mechanisms (e.g., self-judging and entropy minimization) are predominantly used to derive pseudo-labels. However, self-evaluation relying on LLMs typically incurs high computational overhead and introduces overconfidence issues due to intrinsic biases. To address these challenges, we propose a novel self-evaluation-free approach for unverifiable tasks, designed for lightweight yet effective self-improvement. Inspired by majority voting commonly employed in verifiable tasks, we propose semantic voting as a novel mechanism that relaxes the principle of hard matching (i.e., exact matching) toward soft matching (i.e., semantic similarity). Soft matching is achieved by leveraging a lightweight sentence embedding model to quantify semantic similarity, thereby mitigating excessive computational burden and intrinsic bias-associated limitations of self-evaluation. Comprehensive experiments demonstrate that our method achieves substantial gains in computational efficiency and overall better performance than self-evaluation methods across diverse model architectures and tasks.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Jiang et al. "Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-Ended Tasks." International Conference on Learning Representations, 2026.

Markdown

[Jiang et al. "Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-Ended Tasks." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/jiang2026iclr-semantic/)

BibTeX

@inproceedings{jiang2026iclr-semantic,
  title     = {{Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-Ended Tasks}},
  author    = {Jiang, Chunyang and Zhang, Yonggang and Cai, Yiyang and Chan, Chi-Min and Liu, Yulong and Chen, Mingming and Xue, Wei and Guo, Yike},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/jiang2026iclr-semantic/}
}