Wang, Zhangyang
319 publications
TMLR
2025
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
CVPR
2025
Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
ICLR
2025
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
CPAL
2025
How Iterative Magnitude Pruning Discovers Local Receptive Fields in Fully Connected Neural Networks
NeurIPS
2025
Martian World Model: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
NeurIPS
2025
REPA Works Until It Doesn’t: Early-Stopped, Holistic Alignment Supercharges Diffusion Training
ICML
2025
Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding
AAAI
2025
Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study
NeurIPS
2024
$\textit{Read-ME}$: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
NeurIPS
2024
AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-Wise Pruning of Large Language Models
CVPRW
2024
DGBD: Depth Guided Branched Diffusion for Comprehensive Controllability in Multi-View Generation
ICML
2024
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
ICLRW
2024
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
ECCV
2024
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
NeurIPSW
2024
Enhancing Generalization in Sparse Mixture of Experts Models: The Case for Increased Expert Activation in Compositional Tasks
NeurIPS
2024
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
ICML
2024
Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference
NeurIPSW
2024
OnThePlanning Abilities of OpenAI’s O1 Models: Feasibility, Optimality, and Generalizability
ICML
2024
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
ICLRW
2024
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
NeurIPSW
2024
Sparse Transfer Learning Accelerates and Enhances Certified Robustness: A Comprehensive Study
NeurIPS
2024
Training Dynamics of Transformers to Recognize Word Co-Occurrence via Gradient Flow Analysis
ECCV
2024
VersatileGaussian: Real-Time Neural Rendering for Versatile Tasks Using Gaussian Splatting
ICCV
2023
Enhancing NeRF Akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts
AAAI
2023
Federated Robustness Propagation: Sharing Adversarial Robustness in Heterogeneous Federated Learning
TMLR
2023
How Robust Is Your Fairness? Evaluating and Sustaining Fairness Under Unseen Distribution Shifts
ICML
2023
Instant Soup: Cheap Pruning Ensembles in a Single Pass Can Draw Lottery Tickets from Large Models
TMLR
2023
You Only Transfer What You Share: Intersection-Induced Graph Transfer Learning for Link Prediction
AutoML
2023
“No Free Lunch” in Neural Architectures? a Joint Analysis of Expressivity, Convergence, and Generalization
AISTATS
2022
VFDS: Variational Foresight Dynamic Selection in Bayesian Neural Networks for Efficient Human Activity Recognition
WACV
2022
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
ICLR
2022
Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How
CVPR
2022
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval
AAAI
2022
Federated Dynamic Sparse Training: Computing Less, Communicating Less, yet Learning Better
ICLR
2022
Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining
NeurIPS
2022
M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-Task Learning with Model-Accelerator Co-Design
NeurIPS
2022
Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection Without Clean Datasets
WACV
2022
Sandwich Batch Normalization: A Drop-in Replacement for Feature Distribution Heterogeneity
NeurIPS
2022
Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork
CVPR
2022
VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
NeurIPS
2021
Delayed Propagation Transformer: A Universal Computation Engine Towards Practical Control in Cyber-Physical Systems
ICLR
2021
Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective
NeurIPS
2021
You Are Caught Stealing My Winning Lottery Ticket! Making a Lottery Ticket Claim Its Ownership
WACV
2020
Calibrated Domain-Invariant Learning for Highly Generalizable Large Scale Re-Identification
CVPRW
2020
Focus Longer to See Better: Recursively Refined Attention for Fine-Grained Image Classification
NeurIPS
2020
FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training
NeurIPS
2020
Once-for-All Adversarial Training: In-Situ Tradeoff Between Robustness and Accuracy for Free
AISTATS
2020
Uncertainty Quantification for Deep Context-Aware Mobile Activity Recognition and Unknown Context Discovery
AISTATS
2019
Adaptive Activity Monitoring with Uncertainty Quantification in Switching Gaussian Process Models
ICCVW
2019
Cross-Modal Person Search: A Coarse-to-Fine Framework Using Bi-Directional Text-Image Matching
NeurIPS
2018
Theoretical Linear Convergence of Unfolded ISTA and Its Practical Weights and Thresholds