Hayase, Jonathan

18 publications

NeurIPS 2025 Broken Tokens? Your Language Model Can Secretly Handle Non-Canonical Tokenizations Brian Siyuan Zheng, Alisa Liu, Orevaoghene Ahia, Jonathan Hayase, Yejin Choi, Noah A. Smith

CVPR 2025 PLeaS - Merging Models with Permutations and Least Squares Anshul Nasery, Jonathan Hayase, Pang Wei Koh, Sewoong Oh

ICLR 2025 Scalable Extraction of Training Data from Aligned, Production Language Models Milad Nasr, Javier Rando, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A. Feder Cooper, Daphne Ippolito, Christopher A. Choquette-Choo, Florian Tramèr, Katherine Lee

NeurIPS 2025 Scalable Fingerprinting of Large Language Models Anshul Nasery, Jonathan Hayase, Creston Brooks, Peiyao Sheng, Himanshu Tyagi, Pramod Viswanath, Sewoong Oh

ICLRW 2025 Scalable Fingerprinting of Large Language Models Anshul Nasery, Jonathan Hayase, Creston Brooks, Peiyao Sheng, Himanshu Tyagi, Pramod Viswanath, Sewoong Oh

NeurIPS 2024 Data Mixture Inference Attack: BPE Tokenizers Reveal Training Data Compositions Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith

ICMLW 2024 Data Mixture Inference: What Do BPE Tokenizers Reveal About Their Training Data? Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith

COLT 2024 Insufficient Statistics Perturbation: Stable Estimators for Private Least Squares Extended Abstract Gavin Brown, Jonathan Hayase, Samuel Hopkins, Weihao Kong, Xiyang Liu, Sewoong Oh, Juan C Perdomo, Adam Smith

NeurIPS 2024 Query-Based Adversarial Prompt Generation Jonathan Hayase, Ema Borevkovic, Nicholas Carlini, Florian Tramèr, Milad Nasr

ICML 2024 Stealing Part of a Production Language Model Nicholas Carlini, Daniel Paleka, Krishnamurthy Dj Dvijotham, Thomas Steinke, Jonathan Hayase, A. Feder Cooper, Katherine Lee, Matthew Jagielski, Milad Nasr, Arthur Conmy, Eric Wallace, David Rolnick, Florian Tramèr

NeurIPS 2023 DataComp: In Search of the Next Generation of Multimodal Datasets Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei W Koh, Olga Saukh, Alexander J Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt

ICLR 2023 Few-Shot Backdoor Attacks via Neural Tangent Kernels Jonathan Hayase, Sewoong Oh

ICLR 2023 Git Re-Basin: Merging Models Modulo Permutation Symmetries Samuel Ainsworth, Jonathan Hayase, Siddhartha Srinivasa

NeurIPS 2023 Label Poisoning Is All You Need Rishi Jha, Jonathan Hayase, Sewoong Oh

TMLR 2023 Towards a Defense Against Federated Backdoor Attacks Under Continuous Training Shuaiqi Wang, Jonathan Hayase, Giulia Fanti, Sewoong Oh

NeurIPSW 2022 Few-Shot Backdoor Attacks via Neural Tangent Kernels Jonathan Hayase, Sewoong Oh

NeurIPS 2022 Zonotope Domains for Lagrangian Neural Network Verification Matt Jordan, Jonathan Hayase, Alex Dimakis, Sewoong Oh

ICML 2021 SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics Jonathan Hayase, Weihao Kong, Raghav Somani, Sewoong Oh