Xu, Shiyun

10 publications

ICLR 2026 Convex Dominance in Deep Learning I: A Scaling Law of Loss and Learning Rate Zhiqi Bu, Shiyun Xu, Jialin Mao
ICLR 2026 FlowNIB: An Information Bottleneck Analysis of Bidirectional vs. Unidirectional Language Models Md Kowsher, Nusrat Jahan Prottasha, Shiyun Xu, Shetu Mohanto, Niloofar Yousefi, Ozlem Garibay, Chen Chen
ICLR 2025 Gradient Descent with Generalized Newton’s Method Zhiqi Bu, Shiyun Xu
TMLR 2024 Accelerated Deep Active Learning with Graph-Based Sub- Sampling Dan Kushnir, Shiyun Xu
ECML-PKDD 2023 Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett
NeurIPS 2022 Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy Zhiqi Bu, Jialin Mao, Shiyun Xu
ICLRW 2022 Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity Shiyun Xu, Zhiqi Bu, Pratik Chaudhari, Ian J. Barnett
AISTATS 2021 A Dynamical View on Optimization Algorithms of Overparameterized Neural Networks Zhiqi Bu, Shiyun Xu, Kan Chen
AISTATS 2021 DebiNet: Debiasing Linear Models with Nonlinear Overparameterized Neural Networks Shiyun Xu, Zhiqi Bu
ECML-PKDD 2021 Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing Kan Chen, Zhiqi Bu, Shiyun Xu