Huang, Jiaxing
37 publications
NeurIPS
2025
Mulberry: Empowering MLLM with O1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
NeurIPS
2025
Panacea: Mitigating Harmful Fine-Tuning for Large Language Models via Post-Fine-Tuning Perturbation
NeurIPS
2025
R1-ShareVL: Incentivizing Reasoning Capabilities of Multimodal Large Language Models via Share-GRPO
NeurIPS
2025
SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation
CVPR
2023
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds