Zhang, Yiming
34 publications
NeurIPS
2025
InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
NeurIPS
2025
Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds
NeurIPSW
2022
Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance