Chen, Jianfei
47 publications
ICLR
2025
On the Optimization and Generalization of Two-Layer Transformers with Sign Gradient Descent
ICML
2025
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-Thread INT4 Quantization
NeurIPS
2025
SageAttention3: Microscaling FP4 Attention for Inference and an Exploration of 8-Bit Training
ICML
2025
SpargeAttention: Accurate and Training-Free Sparse Attention Accelerating Any Model Inference
ICML
2025
Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
NeurIPS
2025
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
NeurIPS
2022
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps