Gidel, Gauthier
72 publications
ICLR
2025
Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning
NeurIPSW
2024
Learning Diverse Attacks on Large Language Models for Robust Red-Teaming and Safety Tuning
ICML
2024
Sarah Frank-Wolfe: Methods for Constrained Optimization with Best Rates and Practical Features
NeurIPS
2024
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs Through the Embedding Space
NeurIPS
2023
Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples
NeurIPS
2023
Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure
AISTATS
2022
On the Convergence of Stochastic Extragradient for Bilinear Games Using Restarted Iteration Averaging
NeurIPS
2022
Last-Iterate Convergence of Optimistic Gradient Method for Monotone Variational Inequalities
NeurIPSW
2022
Nesterov Meets Optimism: Rate-Optimal Optimistic-Gradient-Based Method for Stochastic Bilinearly-Coupled Minimax Optimization