Lee, Hwiwon

1 publications

NeurIPS 2025 SEC-Bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks Hwiwon Lee, Ziqi Zhang, Hanxiao Lu, Lingming Zhang