Shavit, Nir N
9 publications
ICLR
2026
Scalable Energy-Based Models via Adversarial Training: Unifying Discrimination and Generation
NeurIPSW
2024
Jailbreak Defense in a Narrow Domain: Failures of Existing Methods and Improving Transcript-Based Classifiers