Bethany, Mazal

2 publications

AAAI 2024 Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually Mazal Bethany, Brandon Wherry, Nishant Vishwamitra, Peyman Najafirad
NeurIPSW 2024 Jailbreaking Large Language Models with Symbolic Mathematics Emet Bethany, Mazal Bethany, Juan A. Nolazco-Flores, Sumit Kumar Jha, Peyman Najafirad