Zhang, Tong
309 publications
JMLR
2025
Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning
NeurIPS
2025
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
ICCV
2025
TimeBooth: Disentangled Facial Invariant Representation for Diverse and Personalized Face Aging
NeurIPS
2024
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions
CVPR
2024
InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-Degree Neural Radiance Fields
NeurIPS
2024
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
NeurIPS
2024
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
CoRL
2024
Reinforcement Learning with Foundation Priors: Let Embodied Agent Efficiently Learn on Its Own
AAAI
2024
TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection
NeurIPS
2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
AISTATS
2023
Catalyst Acceleration of Error Compensated Methods Leads to Better Communication Complexity
NeurIPS
2023
Double Randomized Underdamped Langevin with Dimension-Independent Convergence Guarantee
ICCV
2023
NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects
NeurIPSW
2022
A Neural Tangent Kernel Perspective on Function-Space Regularization in Neural Networks
NeurIPSW
2022
Benefits of Overparameterized Convolutional Residual Networks: Function Approximation Under Smoothness Constraint
NeurIPS
2022
Model-Based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity
ICML
2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
ICLRW
2022
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets
NeurIPS
2021
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
COLT
2021
Modeling from Features: A Mean-Field Framework for Over-Parameterized Deep Neural Networks
NeurIPS
2020
Bridging the Gap Between Sample-Based and One-Shot Neural Architecture Search with BONAS
NeurIPS
2020
Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems
ICML
2019
DoubleSqueeze: Parallel Stochastic Gradient Descent with Double-Pass Error-Compensated Compression
JMLR
2019
Utilizing Second Order Information in Minibatch Stochastic Variance Reduced Proximal Iterations
IJCAI
2018
A Novel Neural Network Model Based on Cerebral Hemispheric Asymmetry for EEG Emotion Recognition
ICML
2018
An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method
ICML
2018
Error Compensated Quantized SGD and Its Applications to Large-Scale Distributed Optimization
NeurIPS
2018
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator
NeurIPS
2018
Stochastic Primal-Dual Method for Empirical Risk Minimization with O(1) Per-Iteration Complexity
ECCV
2018
Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks
JMLR
2017
A General Distributed Dual Coordinate Optimization Framework for Regularized Loss Minimization
NeurIPS
2017
Diffusion Approximations for Online Principal Component Estimation and Global Convergence
NeurIPS
2017
Efficient Optimization for Linear Dynamical Systems with Applications to Clustering and Sparse Coding
NeurIPS
2016
Learning Additive Exponential Family Graphical Models via $\ell_{2,1}$-Norm Regularized M-Estimation
NeurIPS
2015
Semi-Supervised Convolutional Neural Networks for Text Categorization via Region Embedding
NeurIPS
2007
A General Boosting Method and Its Application to Learning Ranking Functions for Web Search
NeurIPS
2004
Class-Size Independent Generalization Analsysis of Some Discriminative Multi-Category Classification
ICML
2004
Solving Large Scale Linear Prediction Problems Using Stochastic Gradient Descent Algorithms