Song, Shiji
51 publications
NeurIPS
2025
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
NeurIPS
2025
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
CVPR
2025
Everything to the Synthetic: Diffusion-Driven Test-Time Adaptation via Synthetic-Domain Alignment
NeurIPS
2024
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
NeurIPS
2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
CVPR
2022
Exploring the Equivalence of Siamese Self-Supervised Learning via a Unified Gradient Framework
NeurIPS
2021
Not All Images Are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition
NeurIPS
2020
Glance and Focus: A Dynamic Approach to Reducing Spatial Redundancy in Image Classification