ML Anthology
Authors
Search
About
Wei, Alexander
7 publications
ICML
2024
Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation
Danny Halawi
,
Alexander Wei
,
Eric Wallace
,
Tony Tong Wang
,
Nika Haghtalab
,
Jacob Steinhardt
NeurIPS
2023
Jailbroken: How Does LLM Safety Training Fail?
Alexander Wei
,
Nika Haghtalab
,
Jacob Steinhardt
ICML
2022
More than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Alexander Wei
,
Wei Hu
,
Jacob Steinhardt
ICML
2022
Predicting Out-of-Distribution Error with the Projection Norm
Yaodong Yu
,
Zitong Yang
,
Alexander Wei
,
Yi Ma
,
Jacob Steinhardt
NeurIPS
2022
TCT: Convexifying Federated Learning Using Bootstrapped Neural Tangent Kernels
Yaodong Yu
,
Alexander Wei
,
Sai Praneeth Karimireddy
,
Yi Ma
,
Michael I. Jordan
NeurIPS
2021
Learning Equilibria in Matching Markets from Bandit Feedback
Meena Jagadeesan
,
Alexander Wei
,
Yixin Wang
,
Michael I. Jordan
,
Jacob Steinhardt
NeurIPS
2020
Optimal Robustness-Consistency Trade-Offs for Learning-Augmented Online Algorithms
Alexander Wei
,
Fred Zhang