Zhu, Jun
299 publications
NeurIPS
2025
A Regularized Newton Method for Nonconvex Optimization with Global and Local Complexity Guarantees
ICCV
2025
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Decoupled Video Diffusion
ICLR
2025
On the Optimization and Generalization of Two-Layer Transformers with Sign Gradient Descent
ICML
2025
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-Thread INT4 Quantization
NeurIPS
2025
SageAttention3: Microscaling FP4 Attention for Inference and an Exploration of 8-Bit Training
ICML
2025
SpargeAttention: Accurate and Training-Free Sparse Attention Accelerating Any Model Inference
ICML
2024
Efficient Black-Box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior
NeurIPS
2024
MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models
NeurIPS
2024
On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
NeurIPS
2024
PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs
NeurIPS
2024
Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels
NeurIPS
2023
Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-Optimality
CVPRW
2023
Learning CLIP Guided Visual-Text Fusion Transformer for Video-Based Pedestrian Attribute Recognition
NeurIPS
2023
Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation
NeurIPS
2023
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
ICLR
2022
Analytic-DPM: An Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
CVPR
2022
AutoLoss-GMS: Searching Generalized Margin-Based SoftMax Loss Function for Person Re-Identification
ECCV
2022
Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks
NeurIPS
2022
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
NeurIPS
2022
EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
ICML
2022
GSmooth: Certified Robustness Against Semantic Transformations via Generalized Randomized Smoothing
ICML
2022
Maximum Likelihood Training for Score-Based Diffusion ODEs by High Order Denoising Score Matching
CVPR
2022
OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization
NeurIPSW
2022
Physics-Guided Discovery of Highly Nonlinear Parametric Partial Differential Equations
CVPR
2021
Improving Transferability of Adversarial Patches on Face Recognition with Generative Models
NeurIPS
2021
Rethinking and Reweighting the Univariate Losses for Multi-Label Ranking: Consistency and Generalization
ICML
2021
Variational (Gradient) Estimate of the Score Function in Energy-Based Latent Variable Models
ECCV
2020
Defense Against Adversarial Attacks via Controlling Gradient Leaking on Embedded Manifolds
ICLR
2020
Lazy-CFR: Fast and Near-Optimal Regret Minimization for Extensive Games with Imperfect Information
NeurIPS
2020
Multi-Label Classification: Do Hamming Loss and Subset Accuracy Really Conflict with Each Other?
ECCV
2020
Training Interpretable Convolutional Neural Networks by Differentiating Class-Specific Filters
NeurIPSW
2020
Variational (Gradient) Estimate of the Score Function in Energy-Based Latent Variable Models
IJCAI
2015
Modelling High-Dimensional Sequences with LSTM-RTRBM: Application to Polyphonic Music Generation