Khorrami, Farshad
15 publications
ICML
2025
EnIGMA: Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities
NeurIPS
2025
OSVI-WM: One-Shot Visual Imitation for Unseen Tasks Using World-Model-Guided Trajectory Generation
NeurIPS
2024
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security