Herzig, Jonathan

4 publications

ICLR 2026 ManagerBench: Evaluating the Safety-Pragmatism Trade-Off in Autonomous LLMs Adi Simhi, Jonathan Herzig, Martin Tutek, Itay Itzhak, Idan Szpektor, Yonatan Belinkov
ICML 2024 Representation Surgery: Theory and Practice of Affine Steering Shashwat Singh, Shauli Ravfogel, Jonathan Herzig, Roee Aharoni, Ryan Cotterell, Ponnurangam Kumaraguru
NeurIPS 2024 TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools Avi Caciularu, Alon Jacovi, Eyal Ben-David, Sasha Goldshtein, Tal Schuster, Jonathan Herzig, Gal Elidan, Amir Globerson
NeurIPS 2023 What You See Is What You Read? Improving Text-Image Alignment Evaluation Michal Yarom, Yonatan Bitton, Soravit Changpinyo, Roee Aharoni, Jonathan Herzig, Oran Lang, Eran Ofek, Idan Szpektor