Rawat, Ambrish
11 publications
NeurIPSW
2024
Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs
NeurIPSW
2024
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI
AAAI
2022
Bandit Limited Discrepancy Search and Application to Machine Learning Pipeline Optimization