Song, Mingli
126 publications
ICLR
2026
Incentivizing LLM Reasoning via Reinforcement Learning with Functional Monte Carlo Tree Search
ICML
2025
Assessing Safety Risks and Quantization-Aware Safety Patching for Quantized Large Language Models
NeurIPS
2025
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
AAAI
2025
Disentangled Table-Graph Representation for Interpretable Transmission Line Fault Location
NeurIPS
2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-Aware Perspective
NeurIPS
2024
Can Graph Neural Networks Expose Training Data Properties? an Efficient Risk Assessment Approach
ICCV
2023
Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks
ECCV
2022
Hierarchical Semi-Supervised Contrastive Learning for Contamination-Resistant Anomaly Detection
AAAI
2022
Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers
IJCAI
2019
Amalgamating Filtered Knowledge: Learning Task-Customized Student from Multi-Task Teachers