Mou, Yutao

1 publications

NeurIPS 2024 SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types Yutao Mou, Shikun Zhang, Wei Ye