Sun, Meng
7 publications
ICML
2025
Position: Trustworthy AI Agents Require the Integration of Large Language Models and Formal Methods
NeurIPS
2024
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models
7 publications