Lin, Zhouchen
162 publications
NeurIPS
2025
Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads
NeurIPS
2025
Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision
ICML
2025
Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization
NeurIPS
2025
On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm
ICLR
2024
Hebbian Learning Based Orthogonal Projection for Continual Learning of Spiking Neural Networks
NeurIPS
2023
Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective
ICLR
2023
Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning
ACML
2023
Patch-Level Neighborhood Interpolation: A General and Effective Graph-Based Regularization Strategy
NeurIPSW
2023
Transformer-Based Large Language Models Are Not General Learners: A Universal Circuit Perspective
ICLR
2022
Chaos Is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap
NeurIPS
2021
Training Feedback Spiking Neural Networks by Implicit Differentiation on the Equilibrium State
NeurIPS
2018
SPIDER: Near-Optimal Non-Convex Optimization via Stochastic Path-Integrated Differential Estimator
AAAI
2016
Fast Proximal Linearized Alternating Direction Method of Multiplier with Parallel Splitting
CVPR
2015
A New Retraction for Accelerating the Riemannian Three-Factor Low-Rank Matrix Completion Algorithm
ECML-PKDD
2013
A Counterexample for the Validity of Using Nuclear Norm as a Convex Surrogate of Rank
NeurIPS
2011
Linearized Alternating Direction Method with Adaptive Penalty for Low-Rank Representation
NeurIPS
2009
Optimizing Multi-Class Spatio-Spectral Filters via Bayes Error Estimation for EEG Classification