ML Anthology
Authors
Search
About
Wu, Junhong
5 publications
ICLR
2026
Emergent Hierarchical Reasoning in LLMs Through Reinforcement Learning
Haozhe Wang
,
Qixin Xu
,
Che Liu
,
Junhong Wu
,
Fangzhen Lin
,
Wenhu Chen
ICLR
2026
Enough Is as Good as a Feast: A Comprehensive Analysis of How Reinforcement Learning Mitigates Task Conflicts in LLMs
Zixuan Ren
,
Jinliang Lu
,
Junhong Wu
,
Yang Zhao
,
Dai Dai
,
Hua Wu
,
Haifeng Wang
,
Chengqing Zong
ICLR
2026
LLMs Are Single-Threaded Reasoners: Demystifying the Working Mechanism of Soft Thinking
Junhong Wu
,
Jinliang Lu
,
Zixuan Ren
,
Gangqiang Hu
,
Zhi Wu
,
Dai Dai
,
Hua Wu
ICLR
2025
Language Imbalance Driven Rewarding for Multilingual Self-Improving
Wen Yang
,
Junhong Wu
,
Chen Wang
,
Chengqing Zong
,
Jiajun Zhang
AAAI
2024
Double Buffers CEM-TD3: More Efficient Evolution and Richer Exploration
Sheng Zhu
,
Chun Shen
,
Shuai Lü
,
Junhong Wu
,
Daolong An