Hayase, Jonathan

18 publications

NeurIPS 2025 Broken Tokens? Your Language Model Can Secretly Handle Non-Canonical Tokenizations Brian Siyuan Zheng, Alisa Liu, Orevaoghene Ahia, Jonathan Hayase, Yejin Choi, Noah A. Smith
CVPR 2025 PLeaS - Merging Models with Permutations and Least Squares Anshul Nasery, Jonathan Hayase, Pang Wei Koh, Sewoong Oh
ICLR 2025 Scalable Extraction of Training Data from Aligned, Production Language Models Milad Nasr, Javier Rando, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A. Feder Cooper, Daphne Ippolito, Christopher A. Choquette-Choo, Florian Tramèr, Katherine Lee
NeurIPS 2025 Scalable Fingerprinting of Large Language Models Anshul Nasery, Jonathan Hayase, Creston Brooks, Peiyao Sheng, Himanshu Tyagi, Pramod Viswanath, Sewoong Oh
ICLRW 2025 Scalable Fingerprinting of Large Language Models Anshul Nasery, Jonathan Hayase, Creston Brooks, Peiyao Sheng, Himanshu Tyagi, Pramod Viswanath, Sewoong Oh
NeurIPS 2024 Data Mixture Inference Attack: BPE Tokenizers Reveal Training Data Compositions Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith
ICMLW 2024 Data Mixture Inference: What Do BPE Tokenizers Reveal About Their Training Data? Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith
COLT 2024 Insufficient Statistics Perturbation: Stable Estimators for Private Least Squares Extended Abstract Gavin Brown, Jonathan Hayase, Samuel Hopkins, Weihao Kong, Xiyang Liu, Sewoong Oh, Juan C Perdomo, Adam Smith
NeurIPS 2024 Query-Based Adversarial Prompt Generation Jonathan Hayase, Ema Borevkovic, Nicholas Carlini, Florian Tramèr, Milad Nasr
ICML 2024 Stealing Part of a Production Language Model Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Eric Wallace, David Rolnick, Florian Tramèr
NeurIPS 2023 DataComp: In Search of the Next Generation of Multimodal Datasets Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei W Koh, Olga Saukh, Alexander J Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt
ICLR 2023 Few-Shot Backdoor Attacks via Neural Tangent Kernels Jonathan Hayase, Sewoong Oh
ICLR 2023 Git Re-Basin: Merging Models Modulo Permutation Symmetries Samuel Ainsworth, Jonathan Hayase, Siddhartha Srinivasa
NeurIPS 2023 Label Poisoning Is All You Need Rishi Jha, Jonathan Hayase, Sewoong Oh
TMLR 2023 Towards a Defense Against Federated Backdoor Attacks Under Continuous Training Shuaiqi Wang, Jonathan Hayase, Giulia Fanti, Sewoong Oh
NeurIPSW 2022 Few-Shot Backdoor Attacks via Neural Tangent Kernels Jonathan Hayase, Sewoong Oh
NeurIPS 2022 Zonotope Domains for Lagrangian Neural Network Verification Matt Jordan, Jonathan Hayase, Alex Dimakis, Sewoong Oh
ICML 2021 SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics Jonathan Hayase, Weihao Kong, Raghav Somani, Sewoong Oh