Pan, Weiwei
32 publications
TMLR
2025
Is What You Ask for What You Get? Investigating Concept Associations in Text-to-Image Models
ICMLW
2024
A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning
ICMLW
2024
AMBER: An Entropy Maximizing Environment Design Algorithm for Inverse Reinforcement Learning
NeurIPSW
2024
Accuracy Isn’t Everything: Understanding the Desiderata of AI Tools in Legal-Financial Settings
ICMLW
2024
Bias Transmission in Large Language Models: Evidence from Gender-Occupation Bias in GPT-4
NeurIPSW
2024
Is What You Ask for What You Get? Investigating Concept Associations in Text-to-Image Models
ICMLW
2024
Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations
ICMLW
2023
Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning
ICMLW
2023
Why Do Universal Adversarial Attacks Work on Large Language Models?: Geometry Might Be the Answer