Doshi-Velez, Finale
81 publications
ICLRW
2025
Understanding the Relationship Between Prompts and Response Uncertainty in Large Language Models
ICMLW
2024
A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning
ICMLW
2024
AMBER: An Entropy Maximizing Environment Design Algorithm for Inverse Reinforcement Learning
NeurIPSW
2024
Accuracy Isn’t Everything: Understanding the Desiderata of AI Tools in Legal-Financial Settings
ICMLW
2023
Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning
ICMLW
2023
Why Do Universal Adversarial Attacks Work on Large Language Models?: Geometry Might Be the Answer
NeurIPSW
2022
An Empirical Analysis of the Advantages of Finite V.s. Infinite Width Bayesian Neural Networks
NeurIPS
2022
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
ICMLW
2022
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
NeurIPS
2021
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning
IJCAI
2019
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies
ICML
2018
Decomposition of Uncertainty in Bayesian Deep Learning for Efficient and Risk-Sensitive Learning
IJCAI
2017
Right for the Right Reasons: Training Differentiable Models by Constraining Their Explanations
AAAI
2010
Nonparametric Bayesian Approaches for Reinforcement Learning in Partially Observable Domains