Ji, Ziwei

22 publications

NeurIPS 2025 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Sangmin Bae, Yujin Kim, Reza Bayat, Sungnyun Kim, Jiyoun Ha, Tal Schuster, Adam Fisch, Hrayr Harutyunyan, Ziwei Ji, Aaron Courville, Se-Young Yun
ICLR 2025 Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-Wise LoRA Sangmin Bae, Adam Fisch, Hrayr Harutyunyan, Ziwei Ji, Seungyeon Kim, Tal Schuster
NeurIPS 2024 ANAH-V2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen
ICMLW 2024 Efficient Document Ranking with Learnable Late Interactions Himanshu Jain, Ziwei Ji, Sashank J. Reddi, Ankit Singh Rawat, Felix Yu, Aditya Krishna Menon, Sadeep Jayasumana
ICMLW 2024 Efficient Document Ranking with Learnable Late Interactions Himanshu Jain, Ziwei Ji, Ankit Singh Rawat, Andreas Veit, Sadeep Jayasumana, Sashank J. Reddi, Aditya Krishna Menon, Felix Yu
ICLR 2024 Think Before You Speak: Training Language Models with Pause Tokens Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan
ICML 2023 Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference Yan Xu, Deqian Kong, Dehong Xu, Ziwei Ji, Bo Pang, Pascale Fung, Ying Nian Wu
NeurIPSW 2023 Think Before You Speak: Training Language Models with Pause Tokens Sachin Goyal, Ziwei Ji, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar, Vaishnavh Nagarajan
ICLR 2022 Actor-Critic Is Implicitly Biased Towards High Entropy Optimal Policies Yuzheng Hu, Ziwei Ji, Matus Telgarsky
ICML 2022 Agnostic Learnability of Halfspaces via Logistic Loss Ziwei Ji, Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, Stefani Karp
NeurIPS 2022 Reproducibility in Optimization: Theoretical Framework and Limits Kwangjun Ahn, Prateek Jain, Ziwei Ji, Satyen Kale, Praneeth Netrapalli, Gil I Shamir
ALT 2021 Characterizing the Implicit Bias via a Primal-Dual Analysis Ziwei Ji, Matus Telgarsky
AAAI 2021 CrossNER: Evaluating Cross-Domain Named Entity Recognition Zihan Liu, Yan Xu, Tiezheng Yu, Wenliang Dai, Ziwei Ji, Samuel Cahyawijaya, Andrea Madotto, Pascale Fung
NeurIPS 2021 Early-Stopped Neural Networks Are Consistent Ziwei Ji, Justin Li, Matus J. Telgarsky
ICML 2021 Fast Margin Maximization via Dual Acceleration Ziwei Ji, Nathan Srebro, Matus Telgarsky
ICLR 2021 Generalization Bounds via Distillation Daniel Hsu, Ziwei Ji, Matus Telgarsky, Lan Wang
NeurIPS 2020 Directional Convergence and Alignment in Deep Learning Ziwei Ji, Matus J. Telgarsky
COLT 2020 Gradient Descent Follows the Regularization Path for General Losses Ziwei Ji, Miroslav Dudík, Robert E. Schapire, Matus Telgarsky
ICLR 2020 Neural Tangent Kernels, Transportation Mappings, and Universal Approximation Ziwei Ji, Matus Telgarsky, Ruicheng Xian
ICLR 2020 Polylogarithmic Width Suffices for Gradient Descent to Achieve Arbitrarily Small Test Error with Shallow ReLU Networks Ziwei Ji, Matus Telgarsky
ICLR 2019 Gradient Descent Aligns the Layers of Deep Linear Networks Ziwei Ji, Matus Telgarsky
COLT 2019 The Implicit Bias of Gradient Descent on Nonseparable Data Ziwei Ji, Matus Telgarsky