Wang, Haoxu

1 publications

NeurIPS 2025 SageAttention3: Microscaling FP4 Attention for Inference and an Exploration of 8-Bit Training Jintao Zhang, Jia Wei, Haoxu Wang, Pengle Zhang, Xiaoming Xu, Haofeng Huang, Kai Jiang, Jianfei Chen, Jun Zhu