Carranza, Andres

3 publications

ICLRW 2024 Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations Rylan Schaeffer, Berivan Isik, Dhruv Bhandarkar Pai, Andres Carranza, Victor Lecomte, Alyssa Unell, Mikail Khona, Thomas Edward Yerxa, Yann LeCun, SueYeon Chung, Andrey Gromov, Ravid Shwartz-Ziv, Sanmi Koyejo
ICMLW 2023 Deceptive Alignment Monitoring Andres Carranza, Dhruv Bhandarkar Pai, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo
ICMLW 2023 FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation Dhruv Bhandarkar Pai, Andres Carranza, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo