ICMLW 2024

1500 papers

"You Just Can’t Go Around Killing People'' Explaining Agent Behavior to a Human Terminator Uri Menkes, Ofra Amir, Assaf Hallak
PDF OpenReview
(Almost) Smooth Sailing: Towards Numerical Stability of Neural Networks Through Differentiable Regularization of the Condition Number Rossen Nenov, Daniel Haider, Peter Balazs
PDF OpenReview
(Deep) Generative Geodesics Beomsu Kim, Michael Anthony Puthawala, Jong Chul Ye, Emanuele Sansone
PDF OpenReview
$\alpha$-Fair Contextual Bandits Siddhant Chaudhary, Abhishek Sinha
PDF OpenReview
$\bf{\Phi}_\textrm{Flow}$: Differentiable Simulations for Machine Learning Philipp Holl, Nils Thuerey
PDF OpenReview
$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs Vlad Sobal, Mark Ibrahim, Randall Balestriero, Vivien Cabannes, Diane Bouchacourt, Pietro Astolfi, Kyunghyun Cho, Yann LeCun
PDF OpenReview
$\nabla \tau$: Gradient-Based and Task-Agnostic Machine Unlearning Daniel Trippa, Cesare Campagnano, Maria Sofia Bucarelli, Gabriele Tolomei, Fabrizio Silvestri
PDF OpenReview
2Bits of Protein: Efficient Protein Language Models at the Scale of 2-Bits Oliver M. Turnbull, Mohamed Baioumy, Charlotte Deane
PDF OpenReview
3D Reconstruction of Dark Matter Fields with Diffusion Models: Towards Application to Galaxy Surveys Core Francisco Park, Nayantara Mudur, Carolina Cuesta-Lazaro, Yueying Ni, Victoria Ono, Douglas Finkbeiner
PDF OpenReview
3D Shape Completion with Test-Time Training Michael Schopf-Kuester, Zorah Lähner, Michael Moeller
PDF OpenReview
A Bayesian Approach to Adversarially Robust Life Testing Dorina Weichert, Alexander Kister, Sebastian Houben, Gunar Ernis, Tim Wirtz
PDF OpenReview
A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays Saeed Masoudian, Julian Zimmert, Yevgeny Seldin
PDF OpenReview
A Case for Validation Buffer in Pessimistic Actor-Critic Michal Nauman, Mateusz Ostaszewski, Marek Cygan
PDF OpenReview
A Case-Based Reasoning Approach to Dynamic Few-Shot Prompting for Code Generation Dustin Dannenhauer, Zohreh Dannenhauer, Despina Christou, Kostas Hatalis
PDF OpenReview
A Classifier-Based Approach to Multi-Class Anomaly Detection Applied to Astronomical Time-Series Daniel Muthukrishna, Rithwik Gupta
PDF OpenReview
A Coding-Theoretic Analysis of Hyperspherical Prototypical Learning Geometry Martin Lindström, Borja Rodríguez Gálvez, Ragnar Thobaben, Mikael Skoglund
PDF OpenReview
A Critical Look at Tokenwise Reward-Guided Text Generation Ahmad Rashid, Ruotian Wu, Julia Grosse, Agustinus Kristiadi, Pascal Poupart
PDF OpenReview
A Deeper Look at Depth Pruning of LLMs Shoaib Ahmed Siddiqui, Xin Dong, Greg Heinrich, Thomas Breuel, Jan Kautz, David Krueger, Pavlo Molchanov
PDF OpenReview
A Differentiable Approach to Multi-Scale Brain Modeling Chaoming Wang, Muyang Lyu, Tianqiu Zhang, Sichao He, Si Wu
PDF OpenReview
A Differentiable Topological Notion of Local Maxima for Keypoint Detection Giovanni Barbarani, Francesco Vaccarino, Gabriele Trivigno, Marco Guerra, Gabriele Berton, Carlo Masone
PDF OpenReview
A Fast Learning-Based Surrogate of Electrical Machines Using a Reduced Basis Alejandro Ribes, Nawfal Benchekroun, Théo Delagnes
PDF OpenReview
A Framework for Differentiable Supervised Graph Prediction Paul Krzakala, Junjie Yang, Rémi Flamary, Florence d'Alché-Buc, Charlotte Laclau, Matthieu Labeau
PDF OpenReview
A Generative Foundation Model for Antibody Sequence Understanding Justin Barton, Aretas Gaspariunas, David A Yadin, Jorge Dias, Francesca L Nice, Danielle H Minns, Olivia Snudden, Chelsea Povall, Sara Valle Tomas, Harry Dobson, James H R Farmery, Jinwoo Leem, Jacob D Galson
PDF OpenReview
A Geometric Framework for Understanding Memorization in Generative Models Brendan Leigh Ross, Hamidreza Kamkari, Zhaoyan Liu, Tongzi Wu, George Stein, Gabriel Loaiza-Ganem, Jesse C. Cresswell
PDF OpenReview
A Geometric Framework for Understanding Memorization in Generative Models Brendan Leigh Ross, Hamidreza Kamkari, Zhaoyan Liu, Tongzi Wu, George Stein, Gabriel Loaiza-Ganem, Jesse C. Cresswell
PDF OpenReview
A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem
PDF OpenReview
A Hessian-Aware Stochastic Differential Equation for Modelling SGD Xiang Li, Zebang Shen, Liang Zhang, Niao He
PDF OpenReview
A Human-like Reasoning Framework for Multi-Phases Planning Task with Large Language Models Chengxing Xie, Difan Zou
PDF OpenReview
A Multi-View Mixture-of-Experts Based on Language and Graphs for Molecular Properties Prediction Victor Yukio Shirasuna, Eduardo Soares, Emilio Vital Brazil, Karen Fiorella Aquino Gutierrez, Renato Cerqueira, Seiji Takeda, Akihiro Kishimoto
PDF OpenReview
A Neural Material Point Method for Particle-Based Simulations Omer Rochman Sharabi, Sacha Lewin, Gilles Louppe
PDF OpenReview
A Peek into Token Bias: Large Language Models Are Not yet Genuine Reasoners Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J Su, Camillo Jose Taylor, Dan Roth
PDF OpenReview
A Phase Transition Between Positional and Semantic Learning in a Solvable Model of Dot-Product Attention Hugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborova
PDF OpenReview
A Policy Optimization Approach to the Solution of Unregularized Mean Field Games Sihan Zeng, Sujay Bhatt, Alec Koppel, Sumitra Ganesh
PDF OpenReview
A Pontryagin Perspective on Reinforcement Learning Onno Eberhard, Claire Vernade, Michael Muehlebach
PDF OpenReview
A Practical Diffusion Path for Sampling Omar Chehab, Anna Korba
PDF OpenReview
A Random Matrix Analysis of Learning with Noisy Labels Aymane El Firdoussi, Mohamed El Amine Seddik
PDF OpenReview
A Recipe for Charge Density Prediction Xiang Fu, Andrew Scott Rosen, Kyle Bystrom, Rui Wang, Albert Musaelian, Boris Kozinsky, Tess Smidt, Tommi Jaakkola
PDF OpenReview
A Safe Exploration Approach to Constrained Markov Decision Processes Tingting Ni, Maryam Kamgarpour
PDF OpenReview
A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez
PDF OpenReview
A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $\Theta(T^{2/3})$ and Its Application to Best-of-Both-Worlds Taira Tsuchiya, Shinji Ito
PDF OpenReview
A Simple and Expressive Graph Neural Network Based Method for Structural Link Representation Veronica Lachi, Francesco Ferrini, Antonio Longa, Bruno Lepri, Andrea Passerini
PDF OpenReview
A Statistical Framework for Weak-to-Strong Generalization Seamus Somerstep, Felipe Maia Polo, Moulinath Banerjee, Yaacov Ritov, Mikhail Yurochkin, Yuekai Sun
PDF OpenReview
A Systematic Comparison of fMRI-to-Video Reconstruction Techniques Camilo Luciano Fosco, Ben Lahner, Alex J Andonian, Bowen Pan, Aude Oliva
PDF OpenReview
A Theoretical Formulation of Many-Body Message Passing Neural Networks Jiatong Han
PDF OpenReview
A Theoretical Framework for Partially Observed Reward-States in RLHF Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari
PDF OpenReview
A Theoretical Framework for Partially-Observed Reward States in RLHF Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari
PDF OpenReview
A Theoretical Understanding of Self-Correction Through In-Context Alignment Yifei Wang, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang
PDF OpenReview
A Theoretical Understanding of Self-Correction Through In-Context Alignment Yifei Wang, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang
PDF OpenReview
A Tractable Inference Perspective of Offline RL Xuejie Liu, Anji Liu, Guy Van den Broeck, Yitao Liang
PDF OpenReview
A Unified Approach to Feature Learning in Bayesian Neural Networks Noa Rubin, Zohar Ringel, Inbar Seroussi, Moritz Helias
PDF OpenReview
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee, Se-Young Yun, Kwang-Sung Jun
PDF OpenReview
A Universal Class of Sharpness-Aware Minimization Algorithms Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri, Stefanie Jegelka, Patrick Jaillet
PDF OpenReview
A Variational Formulation of Reinforcement Learning in Infinite-Horizon Markov Decision Processes Tim G. J. Rudner
PDF OpenReview
AbFlex: Predicting the Conformational Flexibility of Antibody CDRs Fabian C Spoendlin, Wing Ki Wong, Guy Georges, Alexander Bujotzek, Charlotte Deane
PDF OpenReview
ABodyBuilder3: Improved and Scalable Antibody Structure Predictions Henry Kenlay, Frederic A Dreyer, Daniel Cutting, Daniel Allen Nissley, Charlotte Deane
PDF OpenReview
Abstract Understanding of Core-Knowledge Concepts: Humans vs. LLMs Alessandro B. Palmarini, Melanie Mitchell
PDF OpenReview
Accelerated Online Reinforcement Learning Using Auxiliary Start State Distributions Aman Mehra, Alexandre Capone, Jeff Schneider
PDF OpenReview
Accelerating Best-of-N via Speculative Rejection Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette
PDF OpenReview
Accelerating Best-of-N via Speculative Rejection Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette
PDF OpenReview
Accelerating Best-of-N via Speculative Rejection Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette
PDF OpenReview
Accelerating Electron Dynamics Simulations Through Machine Learned Time Propagators Karan Shah, Attila Cangi
PDF OpenReview
Accelerating NCE Convergence with Adaptive Normalizing Constant Computation Anish Sevekari, Rishal Aggarwal, Maria Chikina, David Koes
PDF OpenReview
Accelerating Simulation of Two-Phase Flows with Neural PDE Surrogates Yoeri Poels, Koen Minartz, Harshit Bansal, Vlado Menkovski
PDF OpenReview
Accelerating Statistical Inferences in Astrophysics with Neural Networks and Hamiltonian Monte Carlo Diego Gonzalez-Hernandez, Molly Wolfson, Joseph F. Hennawi
PDF OpenReview
Accelerating Statistical Inferences in Astrophysics with Neural Networks and Hamiltonian Monte Carlo Diego Gonzalez-Hernandez, Molly Wolfson, Joseph Hennawi
PDF OpenReview
Accelerating the Inference of String Generation-Based Chemical Reaction Models for Industrial Applications Mikhail Andronov, Natalia Andronova, Michael Wand, Djork-Arné Clevert, Jürgen Schmidhuber
PDF OpenReview
Accounting for Selection Effects in Supernova Cosmology with Simulation-Based Inference and Hierarchical Bayesian Modelling Benjamin M. Boyd, Matthew Grayling, Kaisey S. Mandel
PDF OpenReview
Accuracy on the Wrong Line: On the Pitfalls of Noisy Data for OOD Generalisation Amartya Sanyal, Yaxi Hu, Yaodong Yu, Yian Ma, Yixin Wang, Bernhard Schölkopf
PDF OpenReview
Acquiring Diverse Skills Using Curriculum Reinforcement Learning with Mixture of Experts Onur Celik, Aleksandar Taranovic, Gerhard Neumann
PDF OpenReview
Active Preference Optimization for Sample Efficient RLHF Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury
PDF OpenReview
Active Propulsion Noise Shaping for Multi-Rotor Aircraft Localization Tamir Shor, Gabriele Serussi, Tom Hirshberg, Chaim Baskin, Alex M. Bronstein
PDF OpenReview
AdaInf: Adaptive Inference for Resource-Constrained Foundation Models Zhuoyan Xu, Khoi Duc Nguyen, Preeti Mukherjee, Somali Chaterji, Yingyu Liang, Yin Li
PDF OpenReview
Adam Exploits $\ell_\infty$-Geometry of Loss Landscape via Coordinate-Wise Adaptivity Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li
PDF OpenReview
Adam-Mini: Use Fewer Learning Rates to Gain More Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun
PDF OpenReview
AdaMeM: Memory Efficient Momentum for Adafactor Nikhil Vyas, Depen Morwani, Sham M. Kakade
PDF OpenReview
AdaNF: Quantization Group Adaptive NormalFloat for Low Bit Fine-Tuning of LLMs Yeojoon Youn, Sehoon Kim, Suhong Moon, Sang Keun Choe, Ce Zhang
PDF OpenReview
Adapting LLM Agents with Universal Feedback in Communication Kuan Wang, Yadong Lu, Michael Santacroce, Yeyun Gong, Chao Zhang, Yelong Shen
PDF OpenReview
Adaptive $q$-Network: On-the-Fly Target Selection for Deep Reinforcement Learning Théo Vincent, Fabian Wahren, Jan Peters, Boris Belousov, Carlo D'Eramo
PDF OpenReview
Adaptive Concept Bottleneck for Foundation Models Jihye Choi, Jayaram Raghuram, Yixuan Li, Suman Banerjee, Somesh Jha
PDF OpenReview
Adaptive Experimental Design for Policy Learning: Contextual Best Arm Identification Masahiro Kato, Kyohei Okumura, Takuya Ishihara, Toru Kitagawa
PDF OpenReview
Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation Yingru Li, Jiawei Xu, Zhi-Quan Luo
PDF OpenReview
Adaptive Model Pruning in Federated Learning Through Loss Exploration Christian Internò, Elena Raponi, Niki van Stein, Thomas Bäck, Markus Olhofer, Yaochu Jin, Barbara Hammer
PDF OpenReview
Adaptive Sampling for Continuous Group Equivariant Neural Networks Berfin Inal, Gabriele Cesa
PDF OpenReview
Adaptive Two-Level Quasi-Monte Carlo for Soft Actor-Critic Du Ouyang, Zhenpeng Shi, Aodong Guo, Huaze Tang, Hejin Wang, Chao Wang, Wenbo Ding
PDF OpenReview
AdaptiveBackdoor: Backdoored Language Model Agents That Detect Human Overseers Heng Wang, Ruiqi Zhong, Jiaxin Wen, Jacob Steinhardt
PDF OpenReview
AdaptiveBackdoor: Backdoored Language Model Agents That Detect Human Overseers Heng Wang, Ruiqi Zhong, Jiaxin Wen, Jacob Steinhardt
PDF OpenReview
AdsorbDiff: Adsorbate Placement via Conditional Denoising Diffusion Adeesh Kolluru, John R. Kitchin
PDF OpenReview
Advancing LLM Reasoning Generalists with Preference Trees Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun
PDF OpenReview
Advantage Alignment Algorithms Juan Agustin Duque, Milad Aghajohari, Tim Cooijmans, Tianyu Zhang, Aaron Courville
PDF OpenReview
Adversarial Circuit Evaluation Niels uit de Bos, Adrià Garriga-Alonso
PDF OpenReview
Adversarial Multi-Dueling Bandits Pratik Gajane
PDF OpenReview
Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura
PDF OpenReview
Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura
PDF OpenReview
Adversarial Training with Synthesized Data: A Path to Robust and Generalizable Neural Networks Reza Bayat, Irina Rish
PDF OpenReview
Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics Francesco Croce, Christian Schlarmann, Naman Deep Singh, Matthias Hein
PDF OpenReview
AI Agents with Formal Security Guarantees Mislav Balunovic, Luca Beurer-Kellner, Marc Fischer, Martin Vechev
PDF OpenReview
AI Alignment with Changing and Influenceable Reward Functions Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan
PDF OpenReview
AI Alignment with Changing and Influenceable Reward Functions Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan
PDF OpenReview
AI for an Inverse Problem: Physical Model Solving Quantum Gravity Koji Hashimoto, Koshiro Matsuo, Masaki Murata, Gakuto Ogiwara, Daichi Takeda
PDF OpenReview
Aligned Diffusion Models for Retrosynthesis Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg
PDF OpenReview
Aligned Diffusion Models for Retrosynthesis Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg
PDF OpenReview
Aligning Crowd Feedback via Distributional Preference Reward Modeling Dexun Li, Cong Zhang, Kuicai Dong, Derrick Goh Xin Deik, Ruiming Tang, Yong Liu
PDF OpenReview
Aligning Large Language Models with Representation Editing: A Control Perspective Lingkai Kong, Haorui Wang, Wenhao Mu, Yuanqi Du, Yuchen Zhuang, Yifei Zhou, Yue Song, Rongzhi Zhang, Kai Wang, Chao Zhang
PDF OpenReview
Alignment Calibration: Machine Unlearning for Contrastive Learning Under Auditing Yihan Wang, Yiwei Lu, Guojun Zhang, Franziska Boenisch, Adam Dziedzic, Yaoliang Yu, Xiao-Shan Gao
PDF OpenReview
Alignment Is All You Need: A Training-Free Augmentation Strategy for Pose-Guided Video Generation XiaoyuJin, Zunnan Xu, Mingwen Ou, Wenming Yang
PDF OpenReview
Alignment of MPNNs and Graph Transformers Bao Nguyen, Anjana Yodaiken, Petar Veličković
PDF OpenReview
All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models Charumathi Badrinath, Usha Bhalla, Alex Oesterling, Suraj Srinivas, Himabindu Lakkaraju
PDF OpenReview
All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models Charumathi Badrinath, Usha Bhalla, Alex Oesterling, Suraj Srinivas, Himabindu Lakkaraju
PDF OpenReview
All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models Charumathi Badrinath, Usha Bhalla, Alex Oesterling, Suraj Srinivas, Himabindu Lakkaraju
PDF OpenReview
Altared Environments: The Role of Normative Infrastructure in AI Alignment Rakshit Trivedi, Nikhil Chandak, Carter Blair, Atrisha Sarkar, Tehilla Weltman, Dylan Hadfield-Menell, Gillian K Hadfield
PDF OpenReview
AMBER: An Entropy Maximizing Environment Design Algorithm for Inverse Reinforcement Learning Paul Nitschke, Lars Lien Ankile, Eura Nofshin, Siddharth Swaroop, Finale Doshi-Velez, Weiwei Pan
PDF OpenReview
Amortized Active Causal Induction with Deep Reinforcement Learning Yashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster
PDF OpenReview
Amortized Probabilistic Detection of Communities in Graphs Yueqi Wang, Yoonho Lee, Pallab Basu, Juho Lee, Yee Whye Teh, Liam Paninski, Ari Pakman
PDF OpenReview
An Advanced Physics-Informed Neural Operator for Comprehensive Design Optimization of Highly-Nonlinear Systems: An Aerospace Composites Processing Case Study Milad Ramezankhani, Anirudh Deodhar, Rishi Yash Parekh, Dagnachew Birru
PDF OpenReview
An Adversarial Example for Direct Logit Attribution: Memory Management in GELU-4L Jett Janiak, Can Rager, James Dao, Yeu-Tong Lau
PDF OpenReview
An Analytical Approach to Enhancing DNN Efficiency and Accuracy Using Approximate Multiplication Salar Shakibhamedan, Anice Jahanjoo, Amin Aminifar, Nima Amirafshar, Nima TaheriNejad, Axel Jantsch
PDF OpenReview
An Auditing Test to Detect Behavioral Shift in Language Models Leo Richter, Nitin Agrawal, Xuanli He, Pasquale Minervini, Matt Kusner
PDF OpenReview
An Embodied Generalist Agent in 3D World Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang
PDF OpenReview
An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Foundation Models Scott C. Lowe, Joakim Bruslund Haurum, Sageev Oore, Thomas B. Moeslund, Graham W. Taylor
PDF OpenReview
An Equivariant Flow Matching Framework for Learning Molecular Crystallization Shengchao Liu, Liang Yan, Hongyu Guo, Anima Anandkumar
PDF OpenReview
An Exactly Solvable Model for Emergence and Scaling Laws Yoonsoo Nam, Nayara Fonseca, Seok Hyeong Lee, Chris Mingard, Ard A. Louis
PDF OpenReview
An In-Context Learning Theoretic Analysis of Chain-of-Thought Chenxiao Yang, Zhiyuan Li, David Wipf
PDF OpenReview
An Information-Theoretic Study of Lying in LLMs Ann-Kathrin Dombrowski, Guillaume Corlouer
PDF OpenReview
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr
PDF OpenReview
Analysing Feature Learning of Gradient Descent Using Periodic Functions Jaehui Hwang, Taeyoung Kim, Hongseok Yang
PDF OpenReview
Analysis of Atom-Level Pretraining with QM Data for Graph Neural Networks Molecular Property Models Jose Arjona-Medina, Ramil Nugmanov
PDF OpenReview
Analyzing & Eliminating Learning Rate Warmup in GPT Pre-Training Atli Kosson, Bettina Messmer, Martin Jaggi
PDF OpenReview
Analyzing and Improving Surrogate Gradient Training in Binary Neural Networks Using Dynamical Systems Theory Rainer Engelken, Larry Abbott
PDF OpenReview
Analyzing GFlowNets: Stability, Expressiveness, and Assessment Tiago Silva, Eliezer de Souza da Silva, Rodrigo Barreto Alves, Luiz Max Carvalho, Amauri H Souza, Samuel Kaski, Vikas Garg, Diego Mesquita
PDF OpenReview
Analyzing the Generalization and Reliability of Steering Vectors Daniel Chee Hian Tan, David Chanin, Aengus Lynch, Adrià Garriga-Alonso, Dimitrios Kanoulas, Brooks Paige, Robert Kirk
PDF OpenReview
Anthropocentric Bias and the Possibility of Artificial Cognition Raphaël Millière, Charles Rathkopf
PDF OpenReview
Antigen-Specific Antibody Design via Direct Energy-Based Preference Optimization Xiangxin Zhou, Dongyu Xue, Ruizhe Chen, Zaixiang Zheng, Liang Wang, Quanquan Gu
PDF OpenReview
Approximate Natural Gradient in Gaussian Processes with Non-Log-Concave Likelihoods Marcelo Hartmann
PDF OpenReview
Are Large Language Models Chameleons? Mingmeng Geng, Sihong He, Roberto Trotta
PDF OpenReview
Are Protein Language Models Compute Optimal? Yaiza Serrano, Alvaro Ciudad Serrano, Alexis Molina
PDF OpenReview
AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields Louis Serrano, Thomas X Wang, Etienne Le Naour, Jean-Noël Vittaut, Patrick Gallinari
PDF OpenReview
AsEP: Benchmarking Deep Learning Methods for Antibody-Specific Epitope Prediction ChuNan Liu, Lilian Denzler, Yihong Chen, Brooks Paige, Andrew CR Martin
PDF OpenReview
Assessing the Viability of Generative Modeling in Simulated Astronomical Observations Patrick Janulewicz, Laurence Perreault-Levasseur, Tracy Webb
PDF OpenReview
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL Eduardo Pignatelli, Johan Ferret, Davide Paglieri, Samuel Coward, Tim Rocktäschel, Edward Grefenstette, Laura Toni
PDF OpenReview
AssistanceZero: Scalably Solving Assistance Games Cassidy Laidlaw, Eli Bronstein, Timothy Guo, Dylan Feng, Lukas Berglund, Justin Svegliato, Stuart Russell, Anca Dragan
PDF OpenReview
AstroPT: Scaling Large Observation Models for Astronomy Michael J. Smith, Ryan J. Roberts, Eirini Angeloudi, Marc Huertas-Company
PDF OpenReview
Asymptotic Dynamics for Delayed Feature Learning in a Toy Model Blake Bordelon, Tanishq Kumar, Samuel J. Gershman, Cengiz Pehlevan
PDF OpenReview
Asynchronous Local-SGD Training for Language Modeling Bo Liu, Rachita Chhaparia, Arthur Douillard, Satyen Kale, Andrei Alex Rusu, Jiajun Shen, Arthur Szlam, MarcAurelio Ranzato
PDF OpenReview
Asynchrony Invariance Loss Functions for Graph Neural Networks Pablo Monteagudo-Lago, Arielle Rosinski, Andrew Joseph Dudzik, Petar Veličković
PDF OpenReview
Attacking Large Language Models with Projected Gradient Descent Simon Geisler, Tom Wollschläger, M. H. I. Abdalla, Johannes Gasteiger, Stephan Günnemann
PDF OpenReview
Attention Is All You Need but You Don’t Need All of It for Inference of Large Language Models Georgy Tyukin, Gbetondji Jean-Sebastien Dovonon, Jean Kaddour, Pasquale Minervini
PDF OpenReview
Attention with Markov: A Curious Case of Single-Layer Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Alliot Nagle, Adway Girish, Hyeji Kim, Martin Jaggi, Michael Gastpar
PDF OpenReview
Augmenting Evolutionary Models with Structure-Based Retrieval Yining Huang, Zuobai Zhang, Jian Tang, Debora Susan Marks, Pascal Notin
PDF OpenReview
AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents Yao Fu, Dong-Ki Kim, Jaekyeom Kim, Sungryull Sohn, Lajanugen Logeswaran, Kyunghoon Bae, Honglak Lee
PDF OpenReview
Automatic Domain Adaptation by Transformers in In-Context Learning Ryuichiro Hataya, Kota Matsui, Masaaki Imaizumi
PDF OpenReview
Automatic Jailbreaking of the Text-to-Image Generative AI Systems Minseon Kim, Hyomin Lee, Boqing Gong, Huishuai Zhang, Sung Ju Hwang
PDF OpenReview
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models Bang An, Sicheng Zhu, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, Furong Huang
PDF OpenReview
Automatically Identifying Local and Global Circuits with Linear Computation Graphs Xuyang Ge, Fukang Zhu, Wentao Shu, Junxuan Wang, Zhengfu He, Xipeng Qiu
PDF OpenReview
Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos, Meagan Jens, Michelangelo Naim, Yen-Ling Kuo, Ignacio Cases, Andrei Barbu, Christopher J Cueva
PDF OpenReview
Babysit a Language Model from Scratch: Interactive Language Learning by Trials and Demonstrations Ziqiao Ma, Zekun Wang, Joyce Chai
PDF OpenReview
BAM! Just like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Nicolaus Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli
PDF OpenReview
Bandits with Abstention Under Expert Advice Stephen Pasteris, Alberto Rumi, Maximilian Thiessen, Shota Saito, Atsushi Miyauchi, Fabio Vitale, Mark Herbster
PDF OpenReview
Bandits with Preference Feedback: A Stackelberg Game Perspective Barna Pásztor, Parnian Kassraie, Andreas Krause
PDF OpenReview
Base-Change at Prediction: Inference-Time Update of Fine-Tuned Models Daiki Chijiwa, Taku Hasegawa, Kyosuke Nishida, Kuniko Saito, Susumu Takeuchi
PDF OpenReview
Batch Learning via Log-Sum-Exponential Estimator from Logged Bandit Feedback Armin Behnamnia, Gholamali Aminian, Alireza Aghaei, Chengchun Shi, Vincent Y. F. Tan, Hamid R. Rabiee
PDF OpenReview
Batch-Effect Invariant Graph Neural Networks for Predicting Chemotherapy Response in Triple-Negative Breast Cancer Patients Asif Khan, Giuseppe Torrisi, Luciana Luque, Claudia Owczarek, Maddy Parsons, Chris Sander, Linus Schumacher
PDF OpenReview
Batched Fixed-Confidence Pure Exploration for Bandits with Switching Constraints Newton Mwai, Milad Malekipirbazari, Fredrik D. Johansson
PDF OpenReview
Bayesian Optimization for the Discovery of Redox Active Quinones Giacomo De Gobbi, Reyhan Yagmur, Janine Maier, Stefan Spirk, Robert Peharz
PDF OpenReview
Bayesian Reward Models for LLM Alignment Adam X. Yang, Maxime Robeyns, Thomas Coste, Zhengyan Shi, Jun Wang, Haitham Bou Ammar, Laurence Aitchison
PDF OpenReview
Bayesian-LoRA: LoRA Based Parameter Efficient Fine-Tuning Using Optimal Quantization Levels and Rank Values Trough Differentiable Bayesian Gates Cristian Meo, Ksenia Sycheva, Anirudh Goyal, Justin Dauwels
PDF OpenReview
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents That Solve Fuzzy Tasks Stephanie Milani, Anssi Kanervisto, Karolis Jucys, Sander V Schulhoff, Brandon Houghton, Rohin Shah
PDF OpenReview
Behavior Generation with Latent Actions Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. Jin Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto
PDF OpenReview
Behavioral Bias of Vision-Language Models: A Behavioral Finance View Yuhang Xiao, Yudilin, Ming-Chang Chiu
PDF OpenReview
BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards Diego Dorn, Alexandre Variengien, Charbel-Raphael Segerie, Vincent Corruble
PDF OpenReview
Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulation Georg Kohl, Liwei Chen, Nils Thuerey
PDF OpenReview
Benchmarking Mental State Representations in Language Models Matteo Bortoletto, Constantin Ruhdorfer, Lei Shi, Andreas Bulling
PDF OpenReview
Benchmarking Probabilistic Machine Learning in Protein FItness Landscape Predictions Ningning Chen, Wenkai Han, Sai T. Reddy
PDF OpenReview
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk, Jan Dubiński, Atiyeh Ashari Ghomi, Yi Sui, George Stein, Jiapeng Wu, Jesse C. Cresswell, Franziska Boenisch, Adam Dziedzic
PDF OpenReview
Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks Bálint Mucsányi, Michael Kirchhof, Seong Joon Oh
PDF OpenReview
BenchMARL: Benchmarking Multi-Agent Reinforcement Learning Matteo Bettini, Amanda Prorok, Vincent Moens
PDF OpenReview
Beyond Model Collapse: Scaling up with Synthesized Data Requires Reinforcement Yunzhen Feng, Elvis Dohmatob, Pu Yang, Francois Charton, Julia Kempe
PDF OpenReview
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham
PDF OpenReview
Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models Sahil Kuchlous, Marvin Li, Jeffrey George Wang
PDF OpenReview
Bias Transmission in Large Language Models: Evidence from Gender-Occupation Bias in GPT-4 Kirsten Morehouse, Weiwei Pan, Juan Manuel Contreras, Mahzarin R. Banaji
PDF OpenReview
Bias-Inducing Geometries: Exactly Solvable Data Model with Fairness Implications Stefano Sarao Mannelli, Federica Gerace, Negar Rostamzadeh, Luca Saglietti
PDF OpenReview
Bidirectional Consistency Models Liangchen Li, Jiajun He
PDF OpenReview
Bigger, Regularized, Optimistic: Scaling for Compute and Sample-Efficient Continuous Control Michal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Miłoś, Marek Cygan
PDF OpenReview
Bilevel Optimization with Lower-Level Contextual MDPs Vinzenz Thoma, Barna Pásztor, Andreas Krause, Giorgia Ramponi, Yifan Hu
PDF OpenReview
Bilingual Adaptation of Monolingual Foundation Models Gurpreet Gosal, Yishi Xu, Gokulakrishnan Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming Chen, Biswajit Mishra, Sunil Kumar Sahu, Neha Sengupta, Natalia Vassilieva, Joel Hestness, Samujjwal Ghosh, Bokang Jia, Onkar Arun Pandit, Satheesh Katipomu, Samta Kamboj, Rahul Pal, Parvez Mullah, Soundar Balaji Doraiswamy, Karim Chami, Preslav Nakov
PDF OpenReview
BioinformaticsBench: A Collaboratively Built Large Language Model Benchmark for Bioinformatics Reasoning Varuni Sarwal, Seungmo Lee, Rosemary He, Aingela Kattapuram, Xiaoxuan Wang, Eleazar Eskin, Wei Wang, Serghei Mangul
PDF OpenReview
BiPer: Binary Neural Networks Using a Periodic Function Edwin Vargas, Claudia V. Correa, Carlos Hinojosa, Henry Arguello
PDF OpenReview
Bisimulation Metrics Are Optimal Transport Distances, and Can Be Computed Efficiently Sergio Calo, Anders Jonsson, Gergely Neu, Ludovic Schwartz, Javier Segovia-Aguas
PDF OpenReview
Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
PDF OpenReview
Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev
PDF OpenReview
Block Verification Accelerates Speculative Decoding Ziteng Sun, Uri Mendlovic, Yaniv Leviathan, Asaf Aharoni, Ahmad Beirami, Jae Hun Ro, Ananda Theertha Suresh
PDF OpenReview
BMapEst: Estimation of Brain Tissue Probability Maps Using a Differentiable MRI Simulator Utkarsh Gupta, Emmanouil Nikolakakis, Moritz Zaiss, Razvan Marinescu
PDF OpenReview
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin, Chien-Yi Wang, Ping-Chun Hsieh
PDF OpenReview
Boolean Logic for Low-Energy Deep Learning Van Minh Nguyen, Cristian Ocampo, Aymen Askri, Ba-Hien Tran
PDF OpenReview
Boost Your Crystal Model with Denoising Pre-Training Shuaike Shen, Ke Liu, Muzhi Zhu, Hao Chen
PDF OpenReview
Bootstrapping Language Models with DPO Implicit Rewards Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin
PDF OpenReview
Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity Zhuo Zhi, Ziquan Liu, Moe Elbadawi, Adam Daneshmend, Mine Orlu, Abdul W Basit, Andreas Demosthenous, Miguel R. D. Rodrigues
PDF OpenReview
Boundary Between Noise and Information Applied to Filtering Neural Network Weight Matrices Max Staats, Matthias Thamm, Bernd Rosenow
PDF OpenReview
BPNAS: Bayesian Progressive Neural Architecture Search Hyunwoong Chang, Anirban Samaddar, Sandeep Madireddy
PDF OpenReview
Bridging Distributional and Risk-Sensitive Reinforcement Learning: Balancing Statistical, Computational, and Risk Considerations Hao Liang
PDF OpenReview
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage Kishan Panaganti, Zaiyan Xu, Dileep Kalathil, Mohammad Ghavamzadeh
PDF OpenReview
BUILD: Buffer-Free Incremental Learning with OOD Detection for the Wild Srishti Gupta, Daniele Angioni, Lea Schönherr, Ambra Demontis, Battista Biggio
PDF OpenReview
Bundle Neural Networks for Message Diffusion on Graphs Jacob Bamberger, Federico Barbero, Xiaowen Dong, Michael M. Bronstein
PDF OpenReview
CADO: Cost-Aware Diffusion Solvers for Combinatorial Optimization Through RL Fine-Tuning Deunsol Yoon, Hyungseok Song, Kanghoon Lee, Woohyung Lim
PDF OpenReview
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Yair Schiff, Chia Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov
PDF OpenReview
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Yair Schiff, Chia Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov
PDF OpenReview
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Yair Schiff, Chia Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov
PDF OpenReview
Calibrated Self-Rewarding Vision Language Models Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao
PDF OpenReview
CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory Zexue He, Leonid Karlinsky, Donghyun Kim, Julian McAuley, Dmitry Krotov, Rogerio Feris
PDF OpenReview
Can Editing LLMs Inject Harm? Canyu Chen, Baixiang Huang, Zekun Li, Zhaorun Chen, Shiyang Lai, Xiongxiao Xu, Jia-Chen Gu, Jindong Gu, Huaxiu Yao, Chaowei Xiao, Xifeng Yan, William Yang Wang, Philip Torr, Dawn Song, Kai Shu
PDF OpenReview
Can Editing LLMs Inject Harm? Canyu Chen, Baixiang Huang, Zekun Li, Zhaorun Chen, Shiyang Lai, Xiongxiao Xu, Jia-Chen Gu, Jindong Gu, Huaxiu Yao, Chaowei Xiao, Xifeng Yan, William Yang Wang, Philip Torr, Dawn Song, Kai Shu
PDF OpenReview
Can Go AIs Be Adversarially Robust? Tom Tseng, Euan McLean, Kellin Pelrine, Tony Tong Wang, Adam Gleave
PDF OpenReview
Can Language Models Safeguard Themselves, Instantly and for Free? Dyah Adila, Changho Shin, Yijing Zhang, Frederic Sala
PDF OpenReview
Can Large Language Models Explore In-Context? Akshay Krishnamurthy, Keegan Harris, Dylan J Foster, Cyril Zhang, Aleksandrs Slivkins
PDF OpenReview
Can Learned Optimization Make Reinforcement Learning Less Difficult? Alexander D. Goldie, Chris Lu, Matthew Thomas Jackson, Shimon Whiteson, Jakob Nicolaus Foerster
PDF OpenReview
Can LLMs Enhance Performance Prediction for Deep Learning Models? Karthick Panner Selvam, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Bryan Perozzi, Mats Brorsson
PDF OpenReview
Can LLMs Predict the Convergence of Stochastic Gradient Descent? Oussama Zekri, Abdelhakim Benechehab, Ievgen Redko
PDF OpenReview
Can Mamba In-Context Learn Task Mixtures? Yingcong Li, Xupeng Wei, Haonan Zhao, Taigao Ma
PDF OpenReview
Can Models Learn Skill Composition from Examples? Haoyu Zhao, Simran Kaur, Dingli Yu, Anirudh Goyal, Sanjeev Arora
PDF OpenReview
Can Transformers Solve Least Squares to High Precision? Jerry Weihong Liu, Jessica Grogan, Owen M Dugan, Simran Arora, Atri Rudra, Christopher Re
PDF OpenReview
Can Transformers Solve Least Squares to High Precision? Jerry Weihong Liu, Jessica Grogan, Owen M Dugan, Simran Arora, Atri Rudra, Christopher Re
PDF OpenReview
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? Michael-Andrei Panaitescu-Liess, Zora Che, Bang An, Yuancheng Xu, Pankayaraj Pathmanathan, Souradip Chakraborty, Sicheng Zhu, Tom Goldstein, Furong Huang
PDF OpenReview
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models Peng Xia, Ze Chen, Juanxi Tian, Gong Yangrui, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao
PDF OpenReview
Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li, Yifan Wang, Ananth Grama, Ruqi Zhang
PDF OpenReview
Catastrophic Goodhart: Regularizing RLHF with KL Divergence Does Not Mitigate Heavy-Tailed Reward Misspecification Thomas Kwa, Drake Thomas, Adrià Garriga-Alonso
PDF OpenReview
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations Around Unknown Marginals Ziyi Liu, Idan Attias, Daniel M. Roy
PDF OpenReview
Causal Discovery over High-Dimensional Structured Hypothesis Spaces with Causal Graph Partitioning Ashka Shah, Adela Frances DePavia, Nathaniel C Hudson, Ian Foster, Rick Stevens
PDF OpenReview
CD-POS: Long Context Generalization in LLMs Through Continuous and Discrete Position Synthesis Zhiyuan Hu, Yuliang Liu, Jinman Zhao, Suyuchen Wang, WangYan, Wei Shen, Chao Yin, Bryan Hooi
PDF OpenReview
Cell Morphology-Guided Small Molecule Generation with GFlowNets Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski
PDF OpenReview
Cell Morphology-Guided Small Molecule Generation with GFlowNets Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski
PDF OpenReview
Cell Morphology-Guided Small Molecule Generation with GFlowNets Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski
PDF OpenReview
CellFlows: Inferring Splicing Kinetics from Latent and Mechanistic Cellular Dynamics Sei Chang, Zaiqian Chen, Bianca Dumitrascu, David A. Knowles
PDF OpenReview
Certifiably Robust RAG Against Retrieval Corruption Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal
PDF OpenReview
Certified Robustness in NLP Under Bounded Levenshtein Distance Elias Abad Rocamora, Grigorios Chrysos, Volkan Cevher
PDF OpenReview
Certifying Robustness to Adaptive Data Poisoning Avinandan Bose, Madeleine Udell, Laurent Lessard, Maryam Fazel, Krishnamurthy Dj Dvijotham
PDF OpenReview
CGMTorch: A Framework for Gradient-Based Design of Computational Granular Metamaterials Atoosa Parsa, Corey OHern, Rebecca Kramer-Bottiglio, Josh Bongard
PDF OpenReview
Chain of LoRA: Efficient Fine-Tuning of Language Models via Residual Learning Wenhan Xia, Chengwei Qin, Elad Hazan
PDF OpenReview
Chained Information-Theoretic Bounds and Tight Regret Rate for Linear Bandit Problems Amaury Gouverneur, Borja Rodríguez Gálvez, Tobias Oechtering, Mikael Skoglund
PDF OpenReview
Chained Tuning Leads to Biased Forgetting Megan Ung, Alicia Yi Sun, Samuel Bell, Levent Sagun, Adina Williams
PDF OpenReview
Chained Tuning Leads to Biased Forgetting Megan Ung, Alicia Yi Sun, Samuel Bell, Levent Sagun, Adina Williams
PDF OpenReview
Challenges in Mechanistically Interpreting Model Representations Satvik Golechha, James Dao
PDF OpenReview
Characterizing Prompt Compression Methods for Long Context Inference Siddharth Jha, Lutfi Eren Erdogan, Sehoon Kim, Kurt Keutzer, Amir Gholami
PDF OpenReview
CharED: Character-Wise Ensemble Decoding for Large Language Models Kevin Gu, Eva Tuecke, Dmitriy A Katz, Raya Horesh, David Alvarez-Melis, Mikhail Yurochkin
PDF OpenReview
Chemical Language Modeling with Structured State Spaces Rıza Özçelik, Sarah de Ruiter, Emanuele Criscuolo, Francesca Grisoni
PDF OpenReview
CLAM: Unifying Finetuning, Quantization, and Pruning by Chaining LLM Adapter Modules Neelay Velingker, Jason Liu, Amish Sethi, William Dodds, Zhiqiu Xu, Saikat Dutta, Mayur Naik, Eric Wong
PDF OpenReview
Class-Aware Initialization of Early Exits for Pre-Training Large Language Models Alperen Gormez, Erdem Koyuncu
PDF OpenReview
Classification of Freshwater Snails of the Genus Radomaniola with Multimodal Triplet Networks Dennis Vetter, Muhammad Ahsan, Diana Delicado, Thomas A. Neubauer, Thomas Wilke, Gemma Roig
PDF OpenReview
Closed Form of the Hessian Spectrum for Some Neural Networks Sidak Pal Singh, Thomas Hofmann
PDF OpenReview
Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms Samuel Don Stanton, Robert G Alberstein, Nathan C. Frey, Andrew Martin Watkins, Kyunghyun Cho
PDF OpenReview
Cluster-Norm for Unsupervised Probing of Knowledge Walter Laurito, Sharan Maiya, Grégoire Dhimoïla, Owen Ho Wan Yeung, Kaarel Hänni
PDF OpenReview
CO2: Precise Attention Score Observation for Improving KV Cache Replacement in Large Language Model Meguru Yamazaki, Shivaram Venkataraman
PDF OpenReview
Coarse-to-Fine Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-Based Recognition Hichem Sahbi
PDF OpenReview
Code Agents Are State of the Art Software Testers Niels Mündler, Mark Niklas Mueller, Jingxuan He, Martin Vechev
PDF OpenReview
Code Agents Are State of the Art Software Testers Niels Mündler, Mark Niklas Mueller, Jingxuan He, Martin Vechev
PDF OpenReview
CodonMPNN for Organism Specific and Codon Optimal Inverse Folding Hannes Stark, Umesh Padia, Julia Balla, Cameron Diao
PDF OpenReview
CodonMPNN for Organism Specific and Codon Optimal Inverse Folding Hannes Stark, Umesh Padia, Julia Balla, Cameron Diao
PDF OpenReview
CogErgLLM: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics Azmine Toushik Wasi
PDF OpenReview
Cognitive Assessment of Language Models Daniel McDuff, David Munday, Xin Liu, Isaac Galatzer-Levy
PDF OpenReview
Cognitive Flexibility of Large Language Models Sean M Kennedy, Robert D Nowak
PDF OpenReview
Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential Expression Generation Polina Tsvilodub, Michael Franke, Fausto Carcassi
PDF OpenReview
Collaborative Learning Under Strategic Behavior: Mechanisms for Eliciting Feedback in Principal-Agent Bandit Games Ramakrishnan K, Arpit Agarwal, Lakshminarayanan Subramanian, Maximilian Nickel
PDF OpenReview
Collective Variable Free Transition Path Sampling with Generative Flow Network Kiyoung Seong, Seonghyun Park, Seonghwan Kim, Woo Youn Kim, Sungsoo Ahn
PDF OpenReview
Collusion of Reinforcement Learning-Based Pricing Algorithms in Episodic Markets Paul Friedrich, Barna Pásztor, Giorgia Ramponi
PDF OpenReview
Color Style Transfer with Modulated Flows Maria Larchenko, Alexander Lobashev, Dmitry Guskov, Vladimir Vladimirovich Palyulin
PDF OpenReview
Combining Graph Attention and Recurrent Neural Networks in a Variational Autoencoder for Molecular Representation Learning and Drug Design Alex T. Müller, Kenneth Atz, Michael Reutlinger, Nicolas Zorn
PDF OpenReview
Combining Neural Networks and Symbolic Regression for Analytical Lyapunov Function Discovery Jie Feng, Haohan Zou, Yuanyuan Shi
PDF OpenReview
Combining Pre-Trained LoRA Modules Improves Few-Shot Adaptation of Foundation Models to New Tasks Nader Asadi, Mahdi Beitollahi, Yasser H. Khalil, Yinchuan Li, Guojun Zhang, Xi Chen
PDF OpenReview
Combining Reconstruction and Contrastive Methods for Multimodal Representations in RL Philipp Becker, Sebastian Mossburger, Fabian Otto, Gerhard Neumann
PDF OpenReview
Comgra: A Tool for Analyzing and Debugging Neural Networks Florian Dietz, Sophie Fellenz, Dietrich Klakow, Marius Kloft
PDF OpenReview
Communication Efficient Federated Learning with Differentiated Aggregation Peyman Gholami, Hulya Seferoglu
PDF OpenReview
Commute-Time-Optimised Graphs for GNNs Igor Sterner, Shiye Su, Petar Veličković
PDF OpenReview
Compact Proofs of Model Performance via Mechanistic Interpretability Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan
PDF OpenReview
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, Aditya Grover
PDF OpenReview
Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries Xuening Feng, Zhaohui Jiang, Timo Kaufmann, Eyke Hüllermeier, Paul Weng, Yifei Zhu
PDF OpenReview
Compatible Gradient Approximations for Actor-Critic Algorithms Baturay Saglam, Dionysis Kalogerias
PDF OpenReview
CompeteAI: Understanding the Competition Dynamics of Large Language Model-Based Agents Qinlin Zhao, Jindong Wang, Yixuan Zhang, Yiqiao Jin, Kaijie Zhu, Hao Chen, Xing Xie
PDF OpenReview
Composable Contracts for Multi-Agent Coordination Christy Chen, Louis Parker
PDF OpenReview
Compositional Communication with LLMs and Reasoning About Chemical Structures Dmitry Zubarev, Sarathkrishna Swaminathan
PDF OpenReview
Compress Then Serve: Serving Thousands of LoRA Adapters with Little Overhead Rickard Brüel Gabrielsson, Jiacheng Zhu, Onkar Bhardwaj, Leshem Choshen, Kristjan Greenewald, Mikhail Yurochkin, Justin Solomon
PDF OpenReview
Compressing the Latent Space of Single-Sequence Protein Predictors for Multimodal Generation Amy X. Lu, Wilson Yan, Vladimir Gligorijevic, Pieter Abbeel, Kevin K Yang, Nathan C. Frey
PDF OpenReview
Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels Zhuorui Ye, Stephanie Milani, Fei Fang, Geoffrey J. Gordon
PDF OpenReview
Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels Zhuorui Ye, Stephanie Milani, Fei Fang, Geoff Gordon
PDF OpenReview
Conditional Common Entropy for Instrumental Variable Testing and Partial Identification Ziwei Jiang, Murat Kocaoglu
PDF OpenReview
Conditional Flow Matching for Time Series Modelling Ella Tamir, Najwa Laabid, Markus Heinonen, Vikas Garg, Arno Solin
PDF OpenReview
Conditional Generative Models Are Sufficient to Sample from Any Causal Effect Estimand Md Musfiqur Rahman, Matt Jordan, Murat Kocaoglu
PDF OpenReview
Conditional Meta-Reinforcement Learning with State Representation Yuxuan Sun, Laura Toni, Yiannis Andreopoulos
PDF OpenReview
Confidence Regulation Neurons in Language Models Alessandro Stolfo, Ben Peng Wu, Wes Gurnee, Yonatan Belinkov, Xingyi Song, Mrinmaya Sachan, Neel Nanda
PDF OpenReview
Conformalized Credal Set Predictors Alireza Javanmardi, David Stutz, Eyke Hüllermeier
PDF OpenReview
Consistency Checks for Language Model Forecasters Abhimanyu Pallavi Sudhir, Alejandro Alvarez, Adam Shen, Daniel Paleka
PDF OpenReview
Consistency Checks for Language Model Forecasters Abhimanyu Pallavi Sudhir, Alejandro Alvarez, Adam Shen, Daniel Paleka
PDF OpenReview
Consistency Models with Learned Idempotent Boundary Conditions Gianluigi Silvestri, Luca Ambrogioni
PDF OpenReview
Consistent Validation for Predictive Methods in Spatial Settings David R. Burt, Yunyi Shen, Tamara Broderick
PDF OpenReview
Constructing Artificial Life and Materials Scientists with Accelerated AI Using Deep AndersoNN Saleem Abdul Fattah Ahmed Al Dajani, David Keyes
PDF OpenReview
Constructing Gauge-Invariant Neural Networks for Scientific Applications Manos Theodosis, Demba E. Ba, Nima Dehmamy
PDF OpenReview
Constructing Gauge-Invariant Neural Networks for Scientific Applications Manos Theodosis, Demba E. Ba, Nima Dehmamy
PDF OpenReview
ContextCite: Attributing Model Generation to Context Benjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry
PDF OpenReview
ContextCite: Attributing Model Generation to Context Benjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry
PDF OpenReview
Contextualized Hybrid Ensemble Q-Learning: Learning Fast with Control Priors Emma Cramer, Bernd Frauenknecht, Ramil Sabirov, Sebastian Trimpe
PDF OpenReview
Continual Deep Learning on the Edge via Stochastic Local Competition Among Subnetworks Theodoros Christophides, Kyriakos Tolias, Sotirios Chatzis
PDF OpenReview
Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents Yoann Poupart
PDF OpenReview
Controlling Large Language Model Agents with Entropic Activation Steering Nate Rahn, Pierluca D'Oro, Marc G Bellemare
PDF OpenReview
CoordConformer: Heterogenous EEG Datasets Decoding Using Transformers Sharat Patil, Robin Tibor Schirrmeister, Frank Hutter, Tonio Ball
PDF OpenReview
Coordination Failure in Cooperative Offline MARL Callum Rhys Tilbury, Juan Claude Formanek, Louise Beyers, Jonathan Phillip Shock, Arnu Pretorius
PDF OpenReview
Correlated Noise in Epoch-Based Stochastic Gradient Descent: Implications for Weight Variances Marcel Kühn, Bernd Rosenow
PDF OpenReview
CoSy: Evaluating Textual Explanations of Neurons Laura Kopf, Philine Lou Bommer, Anna Hedström, Sebastian Lapuschkin, Marina MC Höhne, Kirill Bykov
PDF OpenReview
CoSy: Evaluating Textual Explanations of Neurons Laura Kopf, Philine Lou Bommer, Anna Hedström, Sebastian Lapuschkin, Marina MC Höhne, Kirill Bykov
PDF OpenReview
CPeSFA: Empowering SFs for Policy Learning and Transfer in Continuous Action Spaces Yining Li, Tianpei Yang, Wei Guo, Jianye Hao, Yan Zheng
PDF OpenReview
Crafting Large Language Models for Enhanced Interpretability Chung-En Sun, Tuomas Oikarinen, Tsui-Wei Weng
PDF OpenReview
Cramming Protein Language Model Training in 24 GPU Hours Nathan C. Frey, Taylor Joren, Aya Abdelsalam Ismail, Allen Goodman, Richard Bonneau, Kyunghyun Cho, Vladimir Gligorijevic
PDF OpenReview
Cross-Domain Knowledge Transfer for RL via Preference Consistency Ting-Hsuan Huang, Ping-Chun Hsieh
PDF OpenReview
Cross-Lingual QA: A Key to Unlocking In-Context Cross-Lingual Performance Sunkyoung Kim, Dayeon Ki, Yireun Kim, Jinsik Lee
PDF OpenReview
Cross-Modality Matching and Prediction of Perturbation Responses with Labeled Gromov-Wasserstein Optimal Transport Jayoung Ryu, Romain Lopez, Charlotte Bunne, Luca Pinello, Aviv Regev
PDF OpenReview
Cross-Modality Matching and Prediction of Perturbation Responses with Labeled Gromov-Wasserstein Optimal Transport Jayoung Ryu, Romain Lopez, Charlotte Bunne, Luca Pinello, Aviv Regev
PDF OpenReview
DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation Xueqing Wu, Rui Zheng, Jingzhen Sha, Te-Lin Wu, Hanyu Zhou, Tang Mohan, Kai-Wei Chang, Nanyun Peng, Haoran Huang
PDF OpenReview
DARE: The Deep Adaptive Regulator for Control of Uncertain Continuous-Time Systems Harrison Waldon, Fayçal Drissi, Yannick Limmer, Uljad Berdica, Jakob Nicolaus Foerster, Alvaro Cartea
PDF OpenReview
DASH: Warm-Starting Neural Network Training Without Loss of Plasticity Under Stationarity Baekrok Shin, Junsoo Oh, Hanseul Cho, Chulhee Yun
PDF OpenReview
Data as a Consumable Resource Dar Gilboa, Siddhartha Jain, Jarrod Ryan McClean
PDF OpenReview
Data Mixture Inference: What Do BPE Tokenizers Reveal About Their Training Data? Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith
PDF OpenReview
Deciphering the Definition of Adversarial Robustness for Post-Hoc OOD Detectors Peter Lorenz, Mario Ruben Fernandez, Jens Müller, Ullrich Koethe
PDF OpenReview
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan
PDF OpenReview
Decoder Ensembling for Learned Latent Geometries Stas Syrota, Pablo Moreno-Muñoz, Søren Hauberg
PDF OpenReview
Decoding Chemical Predictions: Group Contribution Methods for XAI Gabriel Cathoud, Vignesh Ram Somnath, Luis Macedo, Kjell Jorner
PDF OpenReview
Decoding-Time Language Model Alignment with Multiple Objectives Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon Shaolei Du
PDF OpenReview
Decomposed Evaluations of Geographic Disparities in Text-to-Image Models Abhishek Sureddy, Dishant Padalia, Nandhinee Periyakaruppan, Oindrila Saha, Adina Williams, Adriana Romero-Soriano, Megan Richards, Polina Kirichenko, Melissa Hall
PDF OpenReview
Decomposed Evaluations of Geographic Disparities in Text-to-Image Models Abhishek Sureddy, Dishant Padalia, Nandhinee Periyakaruppan, Oindrila Saha, Adina Williams, Adriana Romero-Soriano, Megan Richards, Polina Kirichenko, Melissa Hall
PDF OpenReview
Decomposed Linear Dynamical Systems (dLDS) for Identifying the Latent Dynamics Underlying High-Dimensional Time-Series Noga Mudrik, Yenho Chen, Eva Yezerets, Christopher John Rozell, Adam Shabti Charles
PDF OpenReview
Decomposing and Editing Predictions by Modeling Model Computation Harshay Shah, Andrew Ilyas, Aleksander Madry
PDF OpenReview
Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian, Samyadeep Basu, Soheil Feizi
PDF OpenReview
Decoupled Differentiable Neural Architecture Search: Memory-Efficient Differentiable NAS via Disentangled Search Space Libin Hou
PDF OpenReview
Decoupled Stochastic Gradient Descent for N-Player Games Ali Zindari, Parham Yazdkhasti, Tatjana Chavdarova, Sebastian U Stich
PDF OpenReview
Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models Vorakit Vorakitphan, Milos Basic, Guilhaume Leroy Meline
PDF OpenReview
Deep Learning for Protein-Ligand Docking: Are We There yet? Alex Morehead, Nabin Giri, Jian Liu, Jianlin Cheng
PDF OpenReview
Deep Networks Always Grok and Here Is Why Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk
PDF OpenReview
Deep Reinforcement Learning for Equilibrium Computation in Multi-Stage Auctions and Contests Fabian Raoul Pieroth, Nils Kohring, Martin Bichler
PDF OpenReview
Deep Supramolecular Language Processing for Co-Crystal Prediction Rebecca Birolo, Rıza Özçelik, Andrea Aramini, Michele R. Chierotti, Roberto Gobetto, Francesca Grisoni
PDF OpenReview
DeePC-Hunt: Data-Enabled Predictive Control Hyperparameter Tuning via Differentiable Optimization Michael Cummins, Alberto Padoan, Keith Moffat, John Lygeros, Florian Dorfler
PDF OpenReview
Defending Against Unknown Corrupted Agents: Reinforcement Learning of Adversarially Robust Nash Equilibria Andi Nika, Jonathan Nöther, Adish Singla, Goran Radanovic
PDF OpenReview
Delay Embedding Theory of Neural Sequence Models Mitchell Ostrow, Adam Joseph Eisen, Ila R Fiete
PDF OpenReview
Delayed Adversarial Attacks on Stochastic Multi-Armed Bandits Pierriccardo Olivieri, Matteo Castiglioni, Nicola Gatti
PDF OpenReview
Demonstrations in In-Context Learning for LLMs with Large Label Space Zhan Li, Fanghui Liu, Volkan Cevher, Grigorios Chrysos
PDF OpenReview
Demystifying Amortized Causal Discovery with Transformers Francesco Montagna, Max Cairney-Leeming, Dhanya Sridhar, Francesco Locatello
PDF OpenReview
Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Wasu Top Piriyakulkij, Yingheng Wang, Volodymyr Kuleshov
PDF OpenReview
Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models Nicholas Bai, Rahul Ajay Iyer, Tuomas Oikarinen, Tsui-Wei Weng
PDF OpenReview
DETAIL: Task DEmonsTration Attribution for Interpretable In-Context Learning Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low
PDF OpenReview
Detecting Critical Treatment Effect Bias in Small Subgroups Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang
PDF OpenReview
Detrimental Memories in Transfer Learning Amal Alnouri, Timothy J Wroge, Bilal Alsallakh
PDF OpenReview
Diagnosing and Fixing Common Problems in Bayesian Optimization for Molecule Design Austin Tripp, José Miguel Hernández-Lobato
PDF OpenReview
Differentiable Approximations of Fair OWA Optimization My H Dinh, James Kotary, Ferdinando Fioretto
PDF OpenReview
Differentiable Cluster Graph Neural Network Yanfei Dong, Mohammed Haroon Dupty, Lambert Deng, Zhuanghua Liu, Yong Liang Goh, Wee Sun Lee
PDF OpenReview
Differentiable Cost-Parameterized Monge mAP Estimators Samuel Howard, George Deligiannidis, Patrick Rebeschini, James Thornton
PDF OpenReview
Differentiable Iterated Function Systems Cory Braker Scott
PDF OpenReview
Differentiable Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem
PDF OpenReview
Differentiable Mapper for Topological Optimization of Data Representation Ziyad Oulhaj, Mathieu Carrière, Bertrand Michel
PDF OpenReview
Differentiable Short-Time Fourier Transform: A Time-Frequency Layer with Learnable Parameters Maxime Leiber, Yosra Marnissi, Axel Barrau
PDF OpenReview
Differentiable Soft Min-Max Loss to Restrict Weight Range for Model Quantization Arnav Kundu, Chungkuk Yoo, Minsik Cho, Saurabh Adya
PDF OpenReview
Differentiable Weighted Automata Anand Balakrishnan, Jyotirmoy V. Deshmukh
PDF OpenReview
Differentiable Wireless Simulation with Geometric Transformers Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy, Arash Behboodi, Johann Brehmer
PDF OpenReview
DiffFit: Differentiable Fitting of Molecule Structures to a Cryo-EM mAP Deng Luo, Zainab Alsuwaykit, Dawar Khan, Ondrej Strnad, Tobias Isenberg, Ivan Viola
PDF OpenReview
Diffusion Domain Expansion: Learning to Coordinate Pre-Trained Diffusion Models Egor Lifar, Semyon Savkin, Timur Garipov, Shangyuan Tong, Tommi Jaakkola
PDF OpenReview
Diffusion Models with Group Equivariance Haoye Lu, Spencer Szabados, Yaoliang Yu
PDF OpenReview
Diffusion-Based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning Jihwan Oh, Sungnyun Kim, Gahee Kim, SeongHwan Kim, Se-Young Yun
PDF OpenReview
DiffusionBlend: Learning 3D Image Prior Through Position-Aware Diffusion Score Blending for 3D Computed Tomography Reconstruction Bowen Song, Jason Hu, Zhaoxu Luo, Jeffrey A Fessler, Liyue Shen
PDF OpenReview
DiffusionGuard: A Robust Defense Against Malicious Diffusion-Based Image Editing June Suk Choi, Kyungmin Lee, Jongheon Jeong, Saining Xie, Jinwoo Shin, Kimin Lee
PDF OpenReview
DiffusionPDE: Generative PDE-Solving Under Partial Observation Jiahe Huang, Guandao Yang, Zichen Wang, Jeong Joon Park
PDF OpenReview
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
PDF OpenReview
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
PDF OpenReview
DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Yifei Zhou, Hao Bai, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
PDF OpenReview
DiLoCo: Distributed Low-Communication Training of Language Models Arthur Douillard, Qixuan Feng, Andrei Alex Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, MarcAurelio Ranzato, Arthur Szlam, Jiajun Shen
PDF OpenReview
DiMViS: Diffusion-Based Multi-View Synthesis Giuseppe Di Giacomo, Giulio Franzese, Tania Cerquitelli, Carla Fabiana Chiasserini, Pietro Michiardi
PDF OpenReview
Dirac--Bianconi Graph Neural Networks - Enabling Long-Range Graph Predictions Christian Nauck, Rohan Gorantla, Michael Lindner, Konstantin Schürholt, Antonia S J S Mey, Frank Hellmann
PDF OpenReview
Discovering Preference Optimization Algorithms with and for Large Language Models Chris Lu, Samuel Holt, Claudio Fanconi, Alex James Chan, Jakob Nicolaus Foerster, Mihaela van der Schaar, Robert Tjarko Lange
PDF OpenReview
Discrete Diffusion Posterior Sampling for Protein Design Mert Cemri, Ajil Jalal, Kannan Ramchandran
PDF OpenReview
Disentangled Representation Learning Through Geometry Preservation with the Gromov-Monge Gap Théo Uscidda, Luca Eyring, Karsten Roth, Fabian J Theis, Zeynep Akata, Marco Cuturi
PDF OpenReview
Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models Aruna Sankaranarayanan, Dylan Hadfield-Menell, Aaron Mueller
PDF OpenReview
Dissecting Query-Key Interaction in Vision Transformers Xu Pan, Aaron Philip, Ziqian Xie, Odelia Schwartz
PDF OpenReview
DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection Yewon Lim, Changyeon Lee, Aerin Kim, Oren Etzioni
PDF OpenReview
Distillation Based Robustness Verification with PAC Guarantees Patrick Indri, Peter Blohm, Anagha Athavale, Ezio Bartocci, Georg Weissenbacher, Matteo Maffei, Dejan Nickovic, Thomas Gärtner, Sagar Malhotra
PDF OpenReview
Distilling LLMs’ Decomposition Abilities into Compact Language Models Denis Tarasov, Kumar Shridhar
PDF OpenReview
Distilling LLMs’ Decomposition Abilities into Compact Language Models Denis Tarasov, Kumar Shridhar
PDF OpenReview
Distributional Monte-Carlo Planning with Thompson Sampling in Stochastic Environments Tuan Quang Dam, Brahim Driss, Odalric-Ambrym Maillard
PDF OpenReview
Distributional Preference Alignment of LLMs via Optimal Transport Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan Greenewald, Jiri Navratil, Jarret Ross
PDF OpenReview
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet
PDF OpenReview
DiveR-CT: Diversity-Enhanced Red Teaming with Relaxing Constraints Andrew Zhao, Quentin Xu, Matthieu Lin, Shenzhi Wang, Yong-jin Liu, Zilong Zheng, Gao Huang
PDF OpenReview
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Idan Schwartz, Guy Yariv, Itai Gat, Yossi Adi, Sagie Benaim, Lior Wolf
PDF OpenReview
Do LLM Agents Have Regret? a Case Study in Online Learning and Games Chanwoo Park, Xiangyu Liu, Asuman E. Ozdaglar, Kaiqing Zhang
PDF OpenReview
Do LLM Agents Have Regret? a Case Study in Online Learning and Games Chanwoo Park, Xiangyu Liu, Asuman E. Ozdaglar, Kaiqing Zhang
PDF OpenReview
Do LLMs Dream of Elephants (when Told Not to)? Latent Concept Association and Associative Memory in Transformers Yibo Jiang, Goutham Rajendran, Pradeep Kumar Ravikumar, Bryon Aragam
PDF OpenReview
Do LLMs Dream of Elephants (when Told Not to)? Latent Concept Association and Associative Memory in Transformers Yibo Jiang, Goutham Rajendran, Pradeep Kumar Ravikumar, Bryon Aragam
PDF OpenReview
Do Parameters Reveal More than Loss for Membership Inference? Anshuman Suri, Xiao Zhang, David Evans
PDF OpenReview
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi, Ali Nosratifiroozsalari, Mengxi Zhou, Dheeraj Kulshrestha, Rajiv Ramnath
PDF OpenReview
Does Editing Provide Evidence for Localization? Zihao Wang, Victor Veitch
PDF OpenReview
Does SGD Really Happen in Tiny Subspaces? Minhak Song, Kwangjun Ahn, Chulhee Yun
PDF OpenReview
Does Your Data Spark Joy? Performance Gains from Domain Upsampling at the End of Training Cody Blakeney, Mansheej Paul, Brett W. Larsen, Sean Owen, Jonathan Frankle
PDF OpenReview
Domain-Aware Fine-Tuning of Foundation Models Uğur Ali Kaplan, Yumeng Li, Margret Keuper, Anna Khoreva, Dan Zhang
PDF OpenReview
Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling Yuanqi Du, Michael Plainer, Rob Brekelmans, Chenru Duan, Frank Noe, Carla P Gomes, Alan Aspuru-Guzik, Kirill Neklyudov
PDF OpenReview
Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling Yuanqi Du, Michael Plainer, Rob Brekelmans, Chenru Duan, Frank Noe, Carla P Gomes, Alan Aspuru-Guzik, Kirill Neklyudov
PDF OpenReview
Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling Yuanqi Du, Michael Plainer, Rob Brekelmans, Chenru Duan, Frank Noe, Carla P Gomes, Alan Aspuru-Guzik, Kirill Neklyudov
PDF OpenReview
DPM: Dual Preferences-Based Multi-Agent Reinforcement Learning Sehyeok Kang, Yongsik Lee, Se-Young Yun
PDF OpenReview
DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, Liwei Wang
PDF OpenReview
DPO-Finetuned Large Multi-Modal Planner with Retrieval-Augmented Generation @ EgoPlan Challenge ICML 2024 Kwanghyeon Lee, Mina Kang, Hyungho Na, HeeSun Bae, Byeonghu Na, Doyun Kwon, Seungjae Shin, Yeongmin Kim, Kim Taewoo, Seungmin Yun, Il-chul Moon
PDF OpenReview
DrJAX: Scalable and Differentiable MapReduce Primitives in JAX J Keith Rush, Zachary Charles, Zachary Garrett, Sean Augenstein, Nicole Elyse Mitchell
PDF OpenReview
Dual Approximation Policy Optimization Zhihan Xiong, Maryam Fazel, Lin Xiao
PDF OpenReview
Dual Risk Minimization for Robust Fine-Tuning of Zero-Shot Models Kaican Li, Weiyan Xie, Ricardo Silva, Nevin L. Zhang
PDF OpenReview
DualBind: A Dual-Loss Framework for Protein-Ligand Binding Affinity Prediction Meng Liu, Saee Gopal Paliwal
PDF OpenReview
DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang, Guy Tennenholtz, ChihWei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier
PDF OpenReview
E-ProTran: Efficient Probabilistic Transformers for Forecasting Batuhan Koyuncu, Tim Nico Bauerschmidt, Isabel Valera
PDF OpenReview
E(n) Equivariant Message Passing Cellular Networks Veljko Kovac, Erik J Bekkers, Pietro Lio, Floor Eijkelboom
PDF OpenReview
Early Period of Training Impacts Out-of-Distribution Generalization Chen Cecilia Liu, Iryna Gurevych
PDF OpenReview
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation Yuqiao Wen, Behzad Shayegh, Chenyang Huang, Yanshuai Cao, Lili Mou
PDF OpenReview
ECO: Efficient Computational Optimization for Exact Machine Unlearning in Deep Neural Networks Yu-Ting Huang, Pei-Yuan Wu, Chuan-Ju Wang
PDF OpenReview
EEG2TEXT: Open Vocabulary EEG-to-Text Decoding with EEG Pre-Training and Multi-View Transformer Hanwen Liu, Daniel Hajialigol, Benny Antony, Aiguo Han, Xuan Wang
PDF OpenReview
Effect of Random Learning Rate: Theoretical Analysis of SGD Dynamics in Non-Convex Optimization via Stationary Distribution Naoki Yoshida, Shogo Nakakita, Masaaki Imaizumi
PDF OpenReview
Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders Christian Toth, Christian Knoll, Franz Pernkopf, Robert Peharz
PDF OpenReview
Effective Layer Pruning Through Similarity Metric Perspective Ian Pons, Bruno Yamamoto, Anna Helena Reali Costa, Artur Jordao
PDF OpenReview
Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling Moritz Haas, Jin Xu, Volkan Cevher, Leena Chennuru Vankadara
PDF OpenReview
Efficiency and Transferability of Inductive Mondrian Conformal Predictors for Drug-Drug Synergy Arushi GK Majha
PDF OpenReview
Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport Ross Irwin, Alessandro Tibo, Jon Paul Janet, Simon Olsson
PDF OpenReview
Efficient Adaptive Federated Optimization Su Hyeong Lee, Sidharth Sharma, Manzil Zaheer, Tian Li
PDF OpenReview
Efficient Differentially Private Fine-Tuning of Diffusion Models Jing Liu, Andrew Lowy, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang
PDF OpenReview
Efficient Document Ranking with Learnable Late Interactions Himanshu Jain, Ziwei Ji, Ankit Singh Rawat, Andreas Veit, Sadeep Jayasumana, Sashank J. Reddi, Aditya Krishna Menon, Felix Yu
PDF OpenReview
Efficient Document Ranking with Learnable Late Interactions Himanshu Jain, Ziwei Ji, Sashank J. Reddi, Ankit Singh Rawat, Felix Yu, Aditya Krishna Menon, Sadeep Jayasumana
PDF OpenReview
Efficient Evolutionary Search over Chemical Space with Large Language Models Haorui Wang, Marta Skreta, Yuanqi Du, Wenhao Gao, Lingkai Kong, Cher Tian Ser, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Alan Aspuru-Guzik, Kirill Neklyudov, Chao Zhang
PDF OpenReview
Efficient Evolutionary Search over Chemical Space with Large Language Models Haorui Wang, Marta Skreta, Yuanqi Du, Wenhao Gao, Lingkai Kong, Cher Tian Ser, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Alan Aspuru-Guzik, Kirill Neklyudov, Chao Zhang
PDF OpenReview
Efficient Inverse Reinforcement Learning Without Compounding Errors Nicolas Espinosa Dice, Gokul Swamy, Sanjiban Choudhury, Wen Sun
PDF OpenReview
Efficient Linear System Solver with Transformers Max Vladymyrov, Johannes von Oswald, Nolan Andrew Miller, Mark Sandler
PDF OpenReview
Efficient LLM Pruning with Global Token-Dependency Awareness and Hardware-Adapted Inference Oshin Dutta, Ritvik Gupta, Sumeet Agarwal
PDF OpenReview
Efficient Multi-Prompt Evaluation of LLMs Felipe Maia Polo, Ronald Xu, Lucas Weber, Mírian Silva, Onkar Bhardwaj, Leshem Choshen, Allysson Flavio Melo de Oliveira, Yuekai Sun, Mikhail Yurochkin
PDF OpenReview
Efficient Offline Learning of Ranking Policies via Top-$k$ Policy Decomposition Ren Kishimoto, Koichi Tanaka, Haruka Kiyohara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto, Yuta Saito
PDF OpenReview
Efficient Offline Reinforcement Learning: The Critic Is Critical Adam Jelley, Trevor McInroe, Sam Devlin, Amos Storkey
PDF OpenReview
Efficient Training of Language Models with Compact and Consistent Next Token Distributions Ashutosh Sathe, Sunita Sarawagi
PDF OpenReview
EggNet: An Evolving Graph-Based Graph Attention Network for Particle Track Reconstruction Paolo Calafiura, Jay Chan, Loic Delabrouille, Brandon Wang
PDF OpenReview
EgoSim: Egocentric Exploration in Virtual Worlds with Multi-Modal Conditioning Wei Yu, Songheng Yin, Steve Easterbrook, Animesh Garg
PDF OpenReview
EigenVI: Score-Based Variational Inference with Orthogonal Function Expansions Diana Cai, Chirag Modi, Charles Margossian, Robert M. Gower, David Blei, Lawrence K. Saul
PDF OpenReview
Eliciting Black-Box Representations from LLMs Through Self-Queries Dylan Sam, Marc Anton Finzi
PDF OpenReview
ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Thibaut Thonet, Jos Rozen, Laurent Besacier
PDF OpenReview
Emergent Representations in Networks Trained with the Forward-Forward Algorithm Niccolo Tosato, Lorenzo Basile, Emanuele Ballarin, Giuseppe De Alteriis, Alberto Cazzaniga, Alessio Ansuini
PDF OpenReview
EMPO: A Clustering-Based On-Policy Algorithm for Offline Reinforcement Learing Jongeui Park, Myungsik Cho, Youngchul Sung
PDF OpenReview
End-to-End Causal Effect Estimation from Unstructured Natural Language Data Nikita Dhawan, Leonardo Cotta, Karen Ullrich, Rahul Krishnan, Chris J. Maddison
PDF OpenReview
End-to-End Differentiable Model of Robot-Terrain Interactions Ruslan Agishev, Vladimír Kubelka, Martin Pecka, Tomas Svoboda, Karel Zimmermann
PDF OpenReview
Energy-Based Hopfield Boosting for Out-of-Distribution Detection Claus Hofmann, Simon Lucas Schmid, Bernhard Lehner, Daniel Klotz, Sepp Hochreiter
PDF OpenReview
Energy-Based Hopfield Boosting for Out-of-Distribution Detection Claus Hofmann, Simon Lucas Schmid, Bernhard Lehner, Daniel Klotz, Sepp Hochreiter
PDF OpenReview
Energy-Free Guidance of Geometric Diffusion Models for 3D Molecule Inverse Design Aksh Garg, Jiaqi Han, Sanjay Nagaraj, Minkai Xu
PDF OpenReview
Energy-Free Guidance of Geometric Diffusion Models for 3D Molecule Inverse Design Jiaqi Han, Aksh Garg, Sanjay Nagaraj, Minkai Xu
PDF OpenReview
Energy-Free Guidance of Geometric Diffusion Models for 3D Molecule Inverse Design Sanjay Nagaraj, Jiaqi Han, Aksh Garg, Minkai Xu
PDF OpenReview
Enhancing Actor-Critic Decision-Making with Afterstate Models for Continuous Control Norio Kosaka
PDF OpenReview
Enhancing Concept-Based Learning with Logic Deepika Vemuri, Gautham Bellamkonda, Vineeth N. Balasubramanian
PDF OpenReview
Enhancing Concept-Based Learning with Logic Deepika Vemuri, Gautham Bellamkonda, Vineeth N. Balasubramanian
PDF OpenReview
Enhancing Fine-Grained Multi-Modal Alignment via Adapters: A Parameter-Efficient Training Framework for Referring Image Segmentation Zunnan Xu, Jiaqi Huang, Ting Liu, Yong Liu, Haonan Han, Kehong Yuan, Xiu Li
PDF OpenReview
Enhancing Intent Understanding for Ambiguous Prompt: A Human-Machine Co-Adaption Strategy Yangfan He, Yuxuan Bai, Tianyu Shi
PDF OpenReview
Enhancing LLM Complex Reasoning Capability Through Hyperbolic Geometry Menglin Yang, Aosong Feng, Bo Xiong, Jiahong Liu, Irwin King, Rex Ying
PDF OpenReview
Enhancing Multi-Tip Artifact Detection in STM Images Using Fourier Transform and Vision Transformers Tommaso Rodani, Alessio Ansuini, Alberto Cazzaniga
PDF OpenReview
Enhancing Peak Assignment in CNMR Spectroscopy: A Novel Approach Using Multimodal Alignment Hao Xu, Zhengyang Zhou, Pengyu Hong
PDF OpenReview
Enhancing Protein Design Robustness Through Noise-Informed Sequence Design Yehlin Cho, Sergey Ovchinnikov, Christopher Frank
PDF OpenReview
Enhancing Single-Cell VAE Latent Space via Semi-Supervision Meichen Gong, Konstantin Ivanov, Merja Heinäniemi, Ville Hautamaki
PDF OpenReview
Enhancing Stability for Large Models Training in Constrained Bandwidth Networks Yun Dai, Tejas Dharamsi, Pin-Lun Hsu, Tao Song, Hamed Firooz
PDF OpenReview
Enhancing the Resilience of LLMs Against Grey-Box Extractions Hanbo Huang, Yihan Li, Bowen Jiang, Bo Jiang, Lin Liu, Zhuotao Liu, Ruoyu Sun, Shiyu Liang
PDF OpenReview
Ensemble Guidance: Towards Generative 3D SBDD in Bioactive Chemical Spaces Charles Harris, Arian Rokkum Jamasb, Pietro Lio, Tom Leon Blundell
PDF OpenReview
EPD: Long-Term Memory Extraction, Context-Aware Planning and Multi-Iteration Decision @ EgoPlan Challenge ICML 2024 Letian Shi, Qi Lv, Xiang Deng, Liqiang Nie
PDF OpenReview
Equation Identification for Fluid Flows via Physics-Informed Neural Networks Alexander New, Marisel Villafañe-Delgado, Charles Shugert
PDF OpenReview
EquiTorch: A Modularized Package for Flexibly Constructing Equivariant GNNs Building upon PyTorch-Geometric Tong Wang, Chuan Chen
PDF OpenReview
Equivariant Flow Matching for Molecular Conformer Generation Majdi Hassan, Nikhil Shenoy, Jungyoon Lee, Hannes Stark, Stephan Thaler, Dominique Beaini
PDF OpenReview
Equivariant Flow Matching for Molecular Conformer Generation Majdi Hassan, Nikhil Shenoy, Jungyoon Lee, Hannes Stark, Stephan Thaler, Dominique Beaini
PDF OpenReview
Equivariant Neural Diffusion for Molecule Generation François R J Cornet, Grigory Bartosh, Mikkel N. Schmidt, Christian A. Naesseth
PDF OpenReview
Equivariant Transformer Forcefields for Molecular Conformer Generation Rui Feng, Binghong Chen, Chao Zhang
PDF OpenReview
Equivariant vs. Invariant Layers: A Comparison of Backbone and Pooling for Point Cloud Classification Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Robert Sheng, Soheil Kolouri
PDF OpenReview
Essentially Sharp Estimates on the Entropy Regularization Error in Discounted Markov Decision Processes Johannes Müller, Semih Cayci
PDF OpenReview
Estimating Probability Densities of Tabular Data Using a Transformer Model Combined with Denoising Diffusion Henry W. Leung, Jo Bovy, Joshua S. Speagle
PDF OpenReview
Ethereum AI Agent Coordinator (EAAC): A Framework for AI Agent Activity Coordination Taehoon Kim
PDF OpenReview
Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models Yuzhu Cai, Sheng Yin, Yuxi Wei, Chenxin Xu, Weibo Mao, Felix Juefei-Xu, Siheng Chen, Yanfeng Wang
PDF OpenReview
Euler Operators for Mis-Specified Physics-Informed Neural Networks Charlie Cowen-Breen, Yongji Wang, Stephen Bates, Ching-Yao Lai
PDF OpenReview
Evaluating Self-Supervised Foundation Models in Holographic Imaging Silas Dietler, Yanick Zeder, Elias Graf, Kilian Koch, Andreas Schwendimann, Tommaso Bendinelli
PDF OpenReview
Evaluation of RAG Metrics for Question Answering in the Telecom Domain Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Neeraj Gunda, Vansh Chhabra, Sai Krishna Bala
PDF OpenReview
EVCL: Elastic Variational Continual Learning with Weight Consolidation Hunar Batra, Ronald Clark
PDF OpenReview
Event-Based Federated Q-Learning Guner Dilsad Er, Michael Muehlebach
PDF OpenReview
EvoSBDD: Latent Evolution for Accurate and Efficient Structure-Based Drug Design Danny Reidenbach
PDF OpenReview
Exact Soft Analytical Side-Channel Attacks Using Tractable Circuits Thomas Wedenig, Rishub Nagpal, Gaëtan Cassiers, Stefan Mangard, Robert Peharz
PDF OpenReview
Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference Catherine Huang, Martin Pawelczyk, Himabindu Lakkaraju
PDF OpenReview
Exploiting Activation Sparsity with Dense to Dynamic-K Mixture-of-Experts Conversion Filip Szatkowski, Bartosz Wójcik, Mikołaj Piórczyński, Simone Scardapane
PDF OpenReview
Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning Batuhan Yardim, Niao He
PDF OpenReview
Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning Jia Wan, Sean R. Sinclair, Devavrat Shah, Martin J Wainwright
PDF OpenReview
Exploiting LLM Quantization Kazuki Egashira, Mark Vero, Robin Staab, Jingxuan He, Martin Vechev
PDF OpenReview
ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers Under Domain Shifts Samar Khanna, Medhanie Irgau, David B. Lobell, Stefano Ermon
PDF OpenReview
Exploration and Application of AI in Space Science Xiang Zhao, You Song
PDF OpenReview
Exploring and Improving Drafts in Blockwise Parallel Decoding Taehyeon Kim, Ananda Theertha Suresh, Kishore A Papineni, Michael Riley, Sanjiv Kumar, Adrian Benton
PDF OpenReview
Exploring Integrality Grip for Mixed-Integer Programming by MCTS Planning Defeng Liu
PDF OpenReview
Exploring Monotonicity in Early-Exiting Language Models Filipe Laitenberger, Max Belitsky, Denys Sheremet
PDF OpenReview
Exploring Neural Scaling Laws in Molecular Pretraining with Synthetic Tasks Rodrigo Hormazabal, Seung Woo Ko, Inwan Yoo, Sehui Han, Paul Bertens
PDF OpenReview
Exploring Scaling Trends in LLM Robustness Nikolaus H. R. Howe, Michał Zając, Ian R. McKenzie, Oskar John Hollinsworth, Pierre-Luc Bacon, Adam Gleave
PDF OpenReview
Exploring Sequence Landscape of Biosynthetic Gene Clusters with Protein Language Models Tatiana Malygina, Olga Kalinina
PDF OpenReview
Exploring the Development of Complexity over Depth and Time in Deep Neural Networks Hannah Pinson, Aurélien Boland, Vincent Ginis, Mykola Pechenizkiy
PDF OpenReview
Exploring the Internal Mechanisms of Music LLMs: A Study of Root and Quality via Probing and Intervention Techniques Wenye Ma, Gus Xia
PDF OpenReview
ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement Eashan Adhikarla, Kai Zhang, John Nicholson, Brian D. Davison
PDF OpenReview
Exponential Quantum Communication Advantage in Distributed Inference and Learning Hagay Michaeli, Dar Gilboa, Daniel Soudry, Jarrod Ryan McClean
PDF OpenReview
Expressivity of Neural Networks with Fixed Weights and Learned Biases Ezekiel Williams, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Alexandre Payeur, Matthew G Perich, Luca Mazzucato, Guillaume Lajoie
PDF OpenReview
Extracting Finite State Machines from Transformers Rik Adriaensen, Jaron Maene
PDF OpenReview
Extracting Training Data from Document-Based VQA Models Francesco Pinto, Nathalie Rauschmayr, Florian Tramèr, Philip Torr, Federico Tombari
PDF OpenReview
Extrapolative Protein Design Through Triplet-Based Preference Learning Mostafa Karimi, Sharmi Banerjee, Tommi Jaakkola, Bella Dubrov, Shang Shang, Ron Benson
PDF OpenReview
Fairness Through Controlled (Un)Awareness in Node Embeddings Dennis Vetter, Jasper Forth, Gemma Roig, Holger Dell
PDF OpenReview
Fairness Through Partial Awareness: Evaluation of the Addition of Demographic Information for Bias Mitigation Methods Chung Peng Lee, Rachel Hong, Jamie Heather Morgenstern
PDF OpenReview
FairPFN: Transformers Can Do Counterfactual Fairness Jake Robertson, Noah Hollmann, Noor Awad, Frank Hutter
PDF OpenReview
Faithful and Fast Influence Function via Advanced Sampling Jungyeon Koh, Hyeonsu Lyu, Jonggyu Jang, Hyun Jong Yang
PDF OpenReview
Fast Adaptation and Robust Quantization of Multi-Modal Foundation Models from Associative Memory: A Case Study in SpeechLM Shang Wu, Yen-Ju Lu, Haozheng Luo, Jerry Yao-Chieh Hu, Jiayi Wang, Najim Dehak, Jesus Villalba, Han Liu
PDF OpenReview
Fast and Memory-Efficient Multi-Sequence Generation via Structured Masking Daniel Mingyi Israel, Siyan Zhao, Guy Van den Broeck, Aditya Grover
PDF OpenReview
Fast Machine Unlearning via Robust Training Youssef Allouah, Joshua Kazdan, Rachid Guerraoui, Sanmi Koyejo
PDF OpenReview
Fast Training Dataset Attribution via In-Context Learning Milad Fotouhi, Mohammad Taha Bahadori, Seyi Feyisetan, Payman Arabshahi, David Heckerman
PDF OpenReview
Fast yet Safe: Early-Exiting with Risk Control Metod Jazbec, Alexander Timans, Tin Hadži Veljković, Kaspar Sakmann, Dan Zhang, Christian A. Naesseth, Eric Nalisnick
PDF OpenReview
Fast yet Safe: Early-Exiting with Risk Control Metod Jazbec, Alexander Timans, Tin Hadži Veljković, Kaspar Sakmann, Dan Zhang, Christian A. Naesseth, Eric Nalisnick
PDF OpenReview
Fast-Forward FARGO: Accelerating Protoplanetary Disk Simulations with Limited Data Valentina Tardugno Poleo, David W Hogg, Shirley Ho
PDF OpenReview
FastDecode: High-Throughput LLM Serving Through Disaggregating Attention Computation Jiaao He, Kezhao Huang, Jidong Zhai
PDF OpenReview
Feature Learning Dynamics Under Grokking in a Sparse Parity Task Javier Sanguino Bautiste, Gregor Bachmann, Bobby He, Lorenzo Noci, Thomas Hofmann
PDF OpenReview
Federated Fine-Tuning of Vision Foundation Models via Probabilistic Masking Vasileios Tsouvalas, Yuki M Asano, Aaqib Saeed
PDF OpenReview
Fewer Truncations Improve Language Modeling Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto
PDF OpenReview
Filling in the Gaps: LLM-Based Structured Data Generation from Semi-Structured Scientific Data Hanbum Ko, Hongjun Yang, Sehui Han, Sungwoong Kim, Sungbin Lim, Rodrigo Hormazabal
PDF OpenReview
Filtered Direct Preference Optimization Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu
PDF OpenReview
Finding NeMo: Localizing Neurons Responsible for Memorization in Diffusion Models Lukas Struppek, Dominik Hintersdorf, Kristian Kersting, Adam Dziedzic, Franziska Boenisch
PDF OpenReview
Finding Structure-Property Relationships for Molecular Property Predictions with Globally Explainable AI Jonas Teufel, Pascal Friederich
PDF OpenReview
Finding Visual Task Vectors Alberto Hojel, Yutong Bai, Trevor Darrell, Amir Globerson, Amir Bar
PDF OpenReview
Fine-Grained Analysis of In-Context Linear Estimation Yingcong Li, Ankit Singh Rawat, Samet Oymak
PDF OpenReview
Fine-Grained Analysis of In-Context Linear Estimation: Data, Architecture, and Beyond Yingcong Li, Ankit Singh Rawat, Samet Oymak
PDF OpenReview
Fine-Tuned Network Relies on Generic Representation to Solve Unseen Cognitive Task Dongyan Lin
PDF OpenReview
Fine-Tuning Large Language Models with User-Level Differential Privacy Zachary Charles, Arun Ganesh, Ryan McKenna, Hugh Brendan McMahan, Nicole Elyse Mitchell, Krishna Pillutla, J Keith Rush
PDF OpenReview
Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise Qimin Yang, Rongshengwang, Chen Jiexin, Runqi Su, Tao Tan
PDF OpenReview
Fine-Tuning the ESM2 Protein Language Model to Understand the Functional Impact of Missense Variants Ali Saadat, Jacques Fellay
PDF OpenReview
Fine-Tuning with Uncertainty-Aware Priors Makes Vision and Language Foundation Models More Reliable Tim G. J. Rudner, Xiang Pan, Yucen Lily Li, Ravid Shwartz-Ziv, Andrew Gordon Wilson
PDF OpenReview
Finite Sample Identification: From Frequency to Time Domain Anastasios Tsiamis, Mohamed Abdalmoaty, Roy S. Smith, John Lygeros
PDF OpenReview
Finite-Time Convergence to an $\epsilon$-Efficient Nash Equilibrium in Potential Games Anna Maria Maddux, Reda Ouhamma, Maryam Kamgarpour
PDF OpenReview
Fisher-Aware Quantization for DETR Detectors with Critical-Category Objectives Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang
PDF OpenReview
Flexible Docking via Unbalanced Flow Matching Gabriele Corso, Vignesh Ram Somnath, Noah Getz, Regina Barzilay, Tommi Jaakkola, Andreas Krause
PDF OpenReview
Flexible Docking via Unbalanced Flow Matching Gabriele Corso, Vignesh Ram Somnath, Noah Getz, Regina Barzilay, Tommi Jaakkola, Andreas Krause
PDF OpenReview
FlowBack: A Flow-Matching Approach for Generative Backmapping of Macromolecules Michael Jones, Smayan Khanna, Andrew Ferguson
PDF OpenReview
FoMu-SSL: Foundation Model-Guided Multi-Sensor Self-Supervised Learning for Remote Sensing Dabin Seo, Haeji Jung, Jinkyu Kim
PDF OpenReview
Forecasting Smog Clouds with Deep Learning: A Proof-of-Concept Valentijn Oldenburg, Juan Cardenas-Cartagena, Matias Valdenegro-Toro
PDF OpenReview
Fourier Neural Operator Based Surrogates for $\textrm{CO}_2$ Storage in Realistic Geologies Anirban Chandra, Marius Koch, Suraj Pawar, Aniruddha Panda, Kamyar Azizzadenesheli, Jeroen Snippe, Faruk O. Alpak, Farah Hariri, Clement Etienam, Pandu Devarakota, Anima Anandkumar, Detlef Hohl
PDF OpenReview
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos Jiahe Liu, Youran Qu, Qi Yan, Xiaohui Zeng, Lele Wang, Renjie Liao
PDF OpenReview
Free-Energy Equilibria: Toward a Theory of Interactions Between Boundedly-Rational Agents David Hyland, Tomáš Gavenčiak, Lancelot Da Costa, Conor Heins, Vojtech Kovarik, Julian Gutierrez, Michael J. Wooldridge, Jan Kulveit
PDF OpenReview
From AlexNet to Transformers: Measuring the Non-Linearity of Deep Neural Networks with Affine Optimal Transport Quentin Bouniot, Ievgen Redko, Anton Mallasto, Charlotte Laclau, Oliver Struckmeier, Karol Arndt, Markus Heinonen, Ville Kyrki, Samuel Kaski
PDF OpenReview
From Graph Diffusion to Graph Classification Jia Jun Cheng Xian, Sadegh Mahdavi, Renjie Liao, Oliver Schulte
PDF OpenReview
From Laboratory to Everyday Life: Personalized Stress Prediction via Smartwatches Batuhan Koyuncu, Aleyna Dilan Kıran, Katja Heilmann, Laith Hamid, Anja Buder, Veronika Engert, Martin Walter, Isabel Valera
PDF OpenReview
From Text to Pixel: Advancing Long-Context Understanding in MLLMs Yujie Lu, Xiujun Li, Tsu-Jui Fu, Miguel Eckstein, William Yang Wang
PDF OpenReview
From Words to Worlds: Compositionality for Cognitive Architectures Ruchira Dhar, Anders Søgaard
PDF OpenReview
Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer Ensembles Sophie Steger, Christian Knoll, Bernhard Klein, Holger Fröning, Franz Pernkopf
PDF OpenReview
Functional Acceleration for Policy Mirror Descent Veronica Chelu, Doina Precup
PDF OpenReview
Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models Adway Girish, Alliot Nagle, Ashok Vardhan Makkuva, Marco Bondaschi, Michael Gastpar, Hyeji Kim
PDF OpenReview
Fundamental Limits of Weak Learnability in High-Dimensional Multi-Index Models Emanuele Troiani, Yatin Dandi, Leonardo Defilippis, Lenka Zdeborova, Bruno Loureiro, Florent Krzakala
PDF OpenReview
FusionDTI: Fine-Grained Binding Discovery with Token-Level Fusion for Drug-Target Interaction Zhaohan Meng, Zaiqiao Meng, Iadh Ounis
PDF OpenReview
FusOn-pLM: A Fusion Oncoprotein-Specific Language Model via Focused Probabilistic Masking Sophia Vincoff, Shrey Goel, Kseniia Kholina, Pranam Chatterjee
PDF OpenReview
Future-Proof Vaccine Design with a Generative Model of Antibody Cross-Reactivity Noor Youssef, Sarah Gurev, Hannah Rivka Pierce-Hoffman, Alexander A Cohen, Luis F Caldera, Pamela J Bjorkman, Debora Susan Marks
PDF OpenReview
Games for AI-Control: Models of Safety Evaluations of AI Deployment Protocols Charlie Griffin, Buck Shlegeris, Alessandro Abate
PDF OpenReview
Gaussian Process-Based Representation Learning via Timeseries Symmetries Petar Bevanda, Max Beier, Armin Lederer, Alexandre Capone, Stefan Georg Sosnowski, Sandra Hirche
PDF OpenReview
Gene Regulatory Network Inference from Pre-Trained Single-Cell Transcriptomics Transformer with Joint Graph Learning Sindhura Kommu, Yizhi Wang, Yue Wang, Xuan Wang
PDF OpenReview
Gene-Centric Evaluation of Causal Variant Prediction for DNA Models Chantriolnt-Andreas Kapourani, Alice Del Vecchio, Agnieszka Dobrowolska, Andrew Anighoro, Edith M. Hessel, Lindsay Edwards, Cristian Regep
PDF OpenReview
Generalization vs. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data Antonis Antoniades, Xinyi Wang, Yanai Elazar, Alfonso Amayuelas, Alon Albalak, Kexun Zhang, William Yang Wang
PDF OpenReview
Generalized Linear Bandits with Limited Adaptivity Ayush Sawarni, Nirjhar Das, Siddharth Barman, Gaurav Sinha
PDF OpenReview
Generalizing Convolution to Point Clouds Davide Bacciu, Francesco Landolfi
PDF OpenReview
Generalizing Offline Alignment Theoretical Paradigm with Diverse Divergence Constraints Haoyuan Sun, Yuxin Zheng, Yifei Zhao, Yongzhe Chang, Xueqian Wang
PDF OpenReview
Generated Audio Detectors Are Not Robust in Real-World Conditions Soumya Shaw, Ben Nassi, Lea Schönherr
PDF OpenReview
Generating Fine-Grained Causality in Climate Time Series Data for Forecasting and Anomaly Detection Dongqi Fu, Yada Zhu, Hanghang Tong, Kommy Weldemariam, Onkar Bhardwaj, Jingrui He
PDF OpenReview
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion Hossein Souri, Arpit Bansal, Hamid Kazemi, Liam H Fowl, Aniruddha Saha, Jonas Geiping, Andrew Gordon Wilson, Rama Chellappa, Tom Goldstein, Micah Goldblum
PDF OpenReview
Generation and Human-Expert Evaluation of Interesting Research Ideas Using Knowledge Graphs and Large Language Models Xuemei Gu, Mario Krenn
PDF OpenReview
Generation Constraint Scaling Can Mitigate Hallucination Georgios Kollias, Payel Das, Subhajit Chaudhury
PDF OpenReview
Generative Acceleration of Molecular Dynamics Simulations for Solid-State Electrolytes Juno Nam, Sulin Liu, Gavin Winter, Rafael Gomez-Bombarelli
PDF OpenReview
Generative Autoencoding of Dropout Patterns Shunta Maeda
PDF OpenReview
Generative Classifiers Avoid Shortcut Solutions Alexander Cong Li, Ananya Kumar, Deepak Pathak
PDF OpenReview
Generative Design of Decision Tree Policies for Reinforcement Learning Jacob Pettit, Chak Shing Lee, Jiachen Yang, Alex Ho, Daniel Faissol, Brenden K. Petersen, Mikel Landajuela
PDF OpenReview
Generative Fractional Diffusion Models Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek
PDF OpenReview
Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets Ulrich Armel Mbou Sob, Qiulin Li, Miguel Arbesú, Oliver Bent, Andries Petrus Smit, Arnu Pretorius
PDF OpenReview
Generative Modeling of Molecular Dynamics Trajectories Bowen Jing, Hannes Stark, Tommi Jaakkola, Bonnie Berger
PDF OpenReview
Geometric Algebra Based Encoding for Graph Prompting Sotirios Panagiotis Chytas, Rudrasis Chakraborty, Vikas Singh
PDF OpenReview
Geometric Algebra Transformers for Large 3D Meshes via Cross-Attention Julian Suk, Pim De Haan, Baris Imre, Jelmer M. Wolterink
PDF OpenReview
Geometric Median Matching for Robust Data Pruning Anish Acharya, Inderjit S Dhillon, Sujay Sanghavi
PDF OpenReview
Geometric Self-Supervised Pretraining on 3D Protein Structures Using Subgraphs Michail Chatzianastasis, George Dasoulas, Michalis Vazirgiannis
PDF OpenReview
Geometric Wireless Simulation with Equivariant Transformers Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy, Arash Behboodi, Johann Brehmer
PDF OpenReview
Geometry Aware Deep Learning for Integrated Closed-Shell and Open-Shell Systems Beom Seok Kang, Vignesh C Bhethanabotla, Mohammadamin Tavakoli, William Goddard, Anima Anandkumar
PDF OpenReview
Geometry Fidelity for Spherical Images Anders Christensen, Nooshin Mojab, Khushman Patel, Karan Ahuja, Zeynep Akata, Ole Winther, Mar Gonzalez-Franco, Andrea Colaco
PDF OpenReview
Geometry-Aware Autoencoders for Metric Learning and Generative Modeling on Data Manifolds Xingzhi Sun, Danqi Liao, Kincaid MacDonald, Yanlei Zhang, Guillaume Huguet, Guy Wolf, Ian Adelstein, Tim G. J. Rudner, Smita Krishnaswamy
PDF OpenReview
Geometry-Informed Neural Networks Arturs Berzins, Andreas Radler, Sebastian Sanokowski, Sepp Hochreiter, Johannes Brandstetter
PDF OpenReview
GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut
PDF OpenReview
Get It Cooperating: Enhancing Generative Agent Cooperation with Commitment Devices Feng Yan, Qitian Jason Hu, Nan Jiang, Xinyuan Sun
PDF OpenReview
Get Rich Quick: Exact Solutions Reveal How Unbalanced Initializations Promote Rapid Feature Learning Daniel Kunin, Allan Raventos, Clémentine Carla Juliette Dominé, Feng Chen, David Klindt, Andrew M Saxe, Surya Ganguli
PDF OpenReview
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment Jiaxiang Li, Siliang Zeng, Hoi To Wai, Chenliang Li, Alfredo Garcia, Mingyi Hong
PDF OpenReview
GLAD: Improving Latent Graph Generative Modeling with Simple Quantization Van Khoa Nguyen, Yoann Boget, Frantzeska Lavda, Alexandros Kalousis
PDF OpenReview
Glauber Generative Model: Discrete Diffusion Models via Binary Classification Harshit Varma, Dheeraj Mysore Nagaraj, Karthikeyan Shanmugam
PDF OpenReview
GLAudio Listens to the Sound of the Graph Aurelio Sulser, Johann Wenckstern, Clara Kümpel
PDF OpenReview
Gone with the Bits: Benchmarking Bias in Facial Phenotype Degradation Under Low-Rate Neural Compression Tian Qiu, Arjun Nichani, Rasta Tadayon, Haewon Jeong
PDF OpenReview
GPT-HyperAgent: Scalable Uncertainty Estimation and Exploration for Foundation Model Decisions Yingru Li, Jiawei Xu, Zhi-Quan Luo
PDF OpenReview
GPTVQ: The Blessing of Dimensionality for LLM Quantization Mart Van Baalen, Andrey Kuzmin, Markus Nagel, Peter Couperus, Artem Bolshakov, Cedric Bastoul, Eric Mahurin, Tijmen Blankevoort, Paul Whatmough
PDF OpenReview
Gradient Descent Induces Alignment Between Weights and the Pre-Activation Tangents for Deep Non-Linear Networks Daniel Beaglehole, Ioannis Mitliagkas, Atish Agarwala
PDF OpenReview
Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks Chenyang Zhang, Gao Peifeng, Difan Zou, Yuan Cao
PDF OpenReview
Gradient Descent with Polyak’s Momentum Finds Flatter Minima via Large Catapults Prin Phunyaphibarn, Junghyun Lee, Bohan Wang, Huishuai Zhang, Chulhee Yun
PDF OpenReview
Gradient Dissent in Language Model Training and Saturation Andrei Mircea, Ekaterina Lobacheva, Irina Rish
PDF OpenReview
Gradient-Based Discrete Sampling with Automatic Cyclical Scheduling Patrick Pynadath, Riddhiman Bhattacharya, Arun Narayanan Hariharan, Ruqi Zhang
PDF OpenReview
Graph Convolutional Networks for Learning Laplace-Beltrami Operators Yingying Wu, Roger Fu, Richard Peng, Qifeng Chen
PDF OpenReview
Graph Multi-Similarity Learning for Molecular Property Prediction Hao Xu, Zhengyang Zhou, Pengyu Hong
PDF OpenReview
Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Julien Delile, Srayanta Mukherjee, Anton Van Pamel, Leonid Zhukov
PDF OpenReview
Graph2Token: Make LLMs Understand Molecule Graphs Runze Wang, Mingqi Yang, Yanming Shen
PDF OpenReview
GraphBPE: Molecular Graphs Meet Byte-Pair Encoding Yuchen Shen, Barnabas Poczos
PDF OpenReview
GraphKAN: Graph Kolmogorov Arnold Network for Small Molecule-Protein Interaction Predictions Tashin Ahmed, Md Habibur Rahman Sifat
PDF OpenReview
Grappa - A Machine Learned Molecular Mechanics Force Field Leif Seute, Eric Hartmann, Jan Stuehmer, Frauke Gräter
PDF OpenReview
GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients Aashiq Muhamed, Oscar Li, David Woodruff, Mona T. Diab, Virginia Smith
PDF OpenReview
GROD: Enhancing Generalization of Transformer with Out-of-Distribution Detection Yijin Zhou, Yu Guang Wang
PDF OpenReview
Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization Boshi Wang, Xiang Yue, Yu Su, Huan Sun
PDF OpenReview
Grokking and the Geometry of Circuit Formation Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk
PDF OpenReview
Grokking, Rank Minimization and Generalization in Deep Learning David Yunis, Kumar Kshitij Patel, Samuel Wheeler, Pedro Henrique Pamplona Savarese, Gal Vardi, Karen Livescu, Michael Maire, Matthew Walter
PDF OpenReview
GROOT-1.5: Learning to Follow Multi-Modal Instructions from Weak Supervision Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
PDF OpenReview
Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution Tim Seyde, Peter Werner, Wilko Schwarting, Markus Wulfmeier, Daniela Rus
PDF OpenReview
Hallmarks of Optimization Trajectories in Neural Networks and LLMs: Directional Exploration and Redundancy Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf
PDF OpenReview
Handling Delay in Reinforcement Learning Caused by Parallel Computations of Neurons Ivan Anokhin, Rishav Rishav, Stephen Chung, Irina Rish, Samira Ebrahimi Kahou
PDF OpenReview
Hardware-Efficient Quantization for Green Custom Foundation Models Toshiaki Koike-Akino, Chang Meng, Volkan Cevher, Giovanni De Micheli
PDF OpenReview
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms Michael Hanna, Sandro Pezzelle, Yonatan Belinkov
PDF OpenReview
Heterogeneous Federated Zeroth-Order Optimization Using Gradient Surrogates Yao Shu, Xiaoqiang Lin, Zhongxiang Dai, Bryan Kian Hsiang Low
PDF OpenReview
Hidden Learning Dynamics of Capability Before Behavior in Diffusion Models Core Francisco Park, Maya Okawa, Andrew Lee, Ekdeep Singh Lubana, Hidenori Tanaka
PDF OpenReview
Hierarchical Contrastive Learning for Enzyme Function Prediction Soorin Yim, Doyeong Hwang, Kiyoung Kim, Sehui Han
PDF OpenReview
Hierarchical Reinforcement Learning and Model Predictive Control for Strategic Motion Planning in Autonomous Racing Rudolf Reiter, Jasper Hoffmann, Joschka Boedecker, Moritz Diehl
PDF OpenReview
Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta, Tess Hellebrekers, Lerrel Pinto
PDF OpenReview
High-Resolution in Silico Painting with Generative Models Trang Le
PDF OpenReview
Higher Order and Self-Referential Evolution for Population-Based Methods Samuel Coward, Chris Lu, Alistair Letcher, Minqi Jiang, Jack Parker-Holder, Jakob Nicolaus Foerster
PDF OpenReview
HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs via High Level Synthesis Darren Yan Key, Andy He, Mason Bulling, Andrew Chang, Skyler Shapiro, Everett Lee
PDF OpenReview
How Consensus-Based Optimization Can Be Interpreted as a Stochastic Relaxation of Gradient Descent Konstantin Riedl, Timo Klock, Carina Geldhauser, Massimo Fornasier
PDF OpenReview
How Do Llamas Process Multilingual Text? a Latent Exploration Through Activation Patching Clément Dumas, Veniamin Veselovsky, Giovanni Monea, Robert West, Chris Wendler
PDF OpenReview
How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability? Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen
PDF OpenReview
How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability? Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen
PDF OpenReview
How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator Subhash Kantamneni, Ziming Liu, Max Tegmark
PDF OpenReview
How Do Transformers Fill in the Blanks? a Case Study on Matrix Completion Pulkit Gopalani, Ekdeep Singh Lubana, Wei Hu
PDF OpenReview
How Do Transformers Fill in the Blanks? a Case Study on Matrix Completion Pulkit Gopalani, Ekdeep Singh Lubana, Wei Hu
PDF OpenReview
How Do Transformers Fill in the Blanks? a Case Study on Matrix Completion Pulkit Gopalani, Ekdeep Singh Lubana, Wei Hu
PDF OpenReview
How Does Return Distribution in Distributional Reinforcement Learning Help Optimization? Ke Sun, Bei Jiang, Linglong Kong
PDF OpenReview
How Transformers Learn Diverse Attention Correlations in Masked Vision Pretraining Yu Huang, Zixin Wen, Yuejie Chi, Yingbin Liang
PDF OpenReview
How Transformers Utilize Multi-Head Attention in In-Context Learning? a Case Study on Sparse Linear Regression Xingwu Chen, Lei Zhao, Difan Zou
PDF OpenReview
How Truncating Weights Improves Reasoning in Language Models Lei Chen, Joan Bruna, Alberto Bietti
PDF OpenReview
How Truncating Weights Improves Reasoning in Language Models Lei Chen, Joan Bruna, Alberto Bietti
PDF OpenReview
Humans Linguistically Align to Their Conversational Partners, and Language Models Should Too Rachel Ostrand, Sara E Berger
PDF OpenReview
Hummer: Towards Limited Competitive Preference Dataset Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng
PDF OpenReview
Hummer: Towards Limited Competitive Preference Dataset Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng
PDF OpenReview
Hybrid Recurrent Models Support Emergent Descriptions for Hierarchical Planning and Control Poppy Collis, Ryan Singh, Paul Kinghorn, Christopher Buckley
PDF OpenReview
Hydragen: High-Throughput LLM Inference with Shared Prefixes Jordan Juravsky, Bradley Brown, Ryan Saul Ehrlich, Daniel Y Fu, Christopher Re, Azalia Mirhoseini
PDF OpenReview
Hyperspectral Unmixing for Raman Spectroscopy via Physics-Constrained Autoencoders Dimitar Georgiev, Álvaro Fernández-Galiana, Simon Vilms Pedersen, Georgios Papadopoulos, Ruoxiao Xie, Molly M. Stevens, Mauricio Barahona
PDF OpenReview
Hyperspectral Unmixing for Raman Spectroscopy via Physics-Constrained Autoencoders Dimitar Georgiev, Álvaro Fernández-Galiana, Simon Vilms Pedersen, Georgios Papadopoulos, Ruoxiao Xie, Molly M. Stevens, Mauricio Barahona
PDF OpenReview
Hypothesis Testing the Circuit Hypothesis in LLMs Claudia Shi, Nicolas Beltran-Velez, Achille Nazaret, Carolina Zheng, Adrià Garriga-Alonso, Andrew Jesson, Maggie Makar, David Blei
PDF OpenReview
Identifiable Latent Bandits: Combining Observational Data and Exploration for Personalized Healthcare Ahmet Zahid Balcıoğlu, Emil Carlsson, Fredrik D. Johansson
PDF OpenReview
Identifying Biological Priors and Structure in Single-Cell Foundation Models Flavia Pedrocchi, Stefan Stark, Gunnar Ratsch, Amir Joudaki
PDF OpenReview
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning Dan Braun, Jordan Taylor, Nicholas Goldowsky-Dill, Lee Sharkey
PDF OpenReview
Identifying Latent State Transition in Non-Linear Dynamical Systems Çağlar Hızlı, Çagatay Yildiz, Matthias Bethge, S. T. John, Pekka Marttinen
PDF OpenReview
Impact4Cast: Forecasting High-Impact Research Topics via Machine Learning on Evolving Knowledge Graphs Xuemei Gu, Mario Krenn
PDF OpenReview
Implementability of Information Elicitation Mechanisms with Pre-Trained Language Models Zachary Robertson, Hannah Cha, Andrew Sheha, Sanmi Koyejo
PDF OpenReview
Implicit Diffusion: Efficient Optimization Through Stochastic Sampling Pierre Marion, Anna Korba, Peter Bartlett, Mathieu Blondel, Valentin De Bortoli, Arnaud Doucet, Felipe Llinares-López, Courtney Paquette, Quentin Berthet
PDF OpenReview
Implicit Optimization Bias of Next-Token Prediction in Linear Models Christos Thrampoulidis
PDF OpenReview
Implicit Optimization Bias of Next-Token Prediction in Linear Models Christos Thrampoulidis
PDF OpenReview
Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems Bingcong Li, Liang Zhang, Niao He
PDF OpenReview
ImportanceWeighted Multi-Draft Speculative Sampling Ashish J Khisti, Arash Behravesh, Hassan Dbouk, Arash Behboodi, Roland Memisevic, Christos Louizos
PDF OpenReview
Improve Temporal Awareness of LLMs for Domain-General Sequential Recommendation Zhendong Chu, Zichao Wang, Ruiyi Zhang, Yangfeng Ji, Hongning Wang, Tong Sun
PDF OpenReview
Improved Algorithms for Adversarial Bandits with Unbounded Losses Mingyu Chen, Xuezhou Zhang
PDF OpenReview
Improved Algorithms for Contextual Dynamic Pricing Matilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet
PDF OpenReview
Improved Algorithms for Kernel Matrix-Vector Multiplication Piotr Indyk, Michael Kapralov, Kshiteej Sheth, Tal Wagner
PDF OpenReview
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin
PDF OpenReview
Improving AlphaFlow for Efficient Protein Ensembles Generation Shaoning Li, Mingyu Li, Yusong Wang, Xinheng He, Zhang Jian, Nanning Zheng, Pheng-Ann Heng
PDF OpenReview
Improving Consistency Models with Generator-Induced Coupling Thibaut Issenhuth, Ludovic Dos Santos, Jean-Yves Franceschi, Alain Rakotomamonjy
PDF OpenReview
Improving Equivariant Networks with Probabilistic Symmetry Breaking Hannah Lawrence, Vasco Portilheiro, Yan Zhang, Sékou-Oumar Kaba
PDF OpenReview
Improving Flow Matching for Posterior Inference with Physics-Based Controls Benjamin Holzschuh, Nils Thuerey
PDF OpenReview
Improving Fragment-Based Deep Molecular Generative Models Panukorn Taleongpong, Brooks Paige
PDF OpenReview
Improving GFlowNets for Text-to-Image Diffusion Alignment Dinghuai Zhang, Yizhe Zhang, Jiatao Gu, Ruixiang Zhang, Joshua M. Susskind, Navdeep Jaitly, Shuangfei Zhai
PDF OpenReview
Improving GFlowNets for Text-to-Image Diffusion Alignment Dinghuai Zhang, Yizhe Zhang, Jiatao Gu, Ruixiang Zhang, Joshua M. Susskind, Navdeep Jaitly, Shuangfei Zhai
PDF OpenReview
Improving GFlowNets with Monte Carlo Tree Search Nikita Morozov, Daniil Tiapkin, Sergey Samsonov, Alexey Naumov, Dmitry Vetrov
PDF OpenReview
Improving Graph-Language Alignment with Hierarchical Graph Tokenization Yongqiang Chen, Quanming Yao, Juzheng Zhang, James Cheng, Yatao Bian
PDF OpenReview
Improving Molecular Modeling with Geometric GNNs: An Empirical Study Ali Ramlaoui, Théo Saulus, Basile Terver, Victor Schmidt, David Rolnick, Fragkiskos D. Malliaros, Alexandre AGM Duval
PDF OpenReview
Improving Performance Prediction of Electrolyte Formulations with Transformer-Based Molecular Representation Model Indra Priyadarsini, Vidushi Sharma, Seiji Takeda, Akihiro Kishimoto, Lisa Hamada, Hajime Shinohara
PDF OpenReview
Improving Route Development Using Convergent Retrosynthesis Planning Paula Torren-Peraire, Jonas Verhoeven, Dorota Herman, Hugo Ceulemans, Igor V. Tetko, Jörg K. Wegner
PDF OpenReview
Improving Self Consistency in LLMs Through Probabilistic Tokenization Ashutosh Sathe, Divyanshu Aggarwal, Sunayana Sitaram
PDF OpenReview
Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, Janos Kramar, Rohin Shah, Neel Nanda
PDF OpenReview
Improving the Accuracy of Coarse-Grained Partial Differential Equations with Grid-Based Reinforcement Learning Jan-Philipp von Bassewitz, Sebastian Kaltenbach, Petros Koumoutsakos
PDF OpenReview
Improving the Efficiency of Self-Supervised Adversarial Training Through Latent Clustering-Based Selection Somrita Ghosh, Yuelin Xu, Xiao Zhang
PDF OpenReview
In Defense of Structural Sparse Adapters for Concurrent LLM Serving Junda Su, Zirui Liu, Zeju Qiu, Weiyang Liu, Zhaozhuo Xu
PDF OpenReview
In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning Mikhail Terekhov, Caglar Gulcehre
PDF OpenReview
In Search of Forgotten Domain Generalization Prasanna Mayilvahanan, Roland S. Zimmermann, Thaddäus Wiedemer, Evgenia Rusak, Attila Juhos, Matthias Bethge, Wieland Brendel
PDF OpenReview
In-Context Generalization to New Tasks from Unlabeled Observation Data Anthony Liang, Pavel Czempin, Yutai Zhou, Stephen Tu, Erdem Biyik
PDF OpenReview
In-Context Learning from Training on Unstructured Data: The Role of Co-Occurrence, Positional Information, and Training Data Structure Kevin Christian Wibisono, Yixin Wang
PDF OpenReview
In-Context Learning from Training on Unstructured Data: The Role of Co-Occurrence, Positional Information, and Training Data Structure Kevin Christian Wibisono, Yixin Wang
PDF OpenReview
In-Context Learning Improves Compositional Understanding of Vision-Language Models Matteo Nulli, Anesa Ibrahimi, Avik Pal, Hoshe Lee, Ivona Najdenkoska
PDF OpenReview
In-Context Learning in Presence of Spurious Correlations Hrayr Harutyunyan, Rafayel Darbinyan, Samvel Karapetyan, Hrant Khachatrian
PDF OpenReview
In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Pengrui Han, Peiyang Song, Haofei Yu, Jiaxuan You
PDF OpenReview
In-Context Learning of Energy Functions Rylan Schaeffer, Mikail Khona, Sanmi Koyejo
PDF OpenReview
In-Context Learning with Long-Context Models: An In-Depth Exploration Amanda Bertsch, Maor Ivgi, Uri Alon, Jonathan Berant, Matthew R. Gormley, Graham Neubig
PDF OpenReview
In-Context Learning with Representations: Contextual Generalization of Trained Transformers Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi
PDF OpenReview
In-Context Learning with Topological Information for LLM-Based Knowledge Graph Completion Udari Madhushani Sehwag, Kassiani Papasotiriou, Jared Vann, Sumitra Ganesh
PDF OpenReview
In-Context Learning, Can It Break Safety? Sophie Xhonneux, David Dobre, Michael Noukhovitch, Jian Tang, Gauthier Gidel, Dhanya Sridhar
PDF OpenReview
In-Context Principle Learning from Mistakes Tianjun Zhang, Aman Madaan, Luyu Gao, Steven Zhang, Swaroop Mishra, Yiming Yang, Niket Tandon, Uri Alon
PDF OpenReview
In-Context Reinforcement Learning Without Optimal Action Labels Juncheng Dong, Moyang Guo, Ethan X Fang, Zhuoran Yang, Vahid Tarokh
PDF OpenReview
In-Context Symmetries: Self-Supervised Learning Through Contextual World Models Sharut Gupta, Chenyu Wang, Yifei Wang, Tommi Jaakkola, Stefanie Jegelka
PDF OpenReview
Incorporating Stability into Flow Matching Christopher Iliffe Sprague, Arne Elofsson, Hossein Azizpour
PDF OpenReview
Inference Performance Optimization for Large Language Models on CPUs Pujiang He, Shan Zhou, Wenhuan Huang, Changqing Li, Duyi Wang, Bin Guo, Chen Meng, Sheng Gui, Weifei Yu, Yi Xie
PDF OpenReview
Inferring Physiological Properties of Motor Neurons Using Neural Posterior Estimation Pranav Mamidanna, Dario Farina
PDF OpenReview
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory Chaojun Xiao, Pengle Zhang, Xu Han, Guangxuan Xiao, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu, Maosong Sun
PDF OpenReview
InfoNCE: Identifying the Gap Between Theory and Practice Evgenia Rusak, Patrik Reizinger, Attila Juhos, Oliver Bringmann, Roland S. Zimmermann, Wieland Brendel
PDF OpenReview
InfoNCE: Identifying the Gap Between Theory and Practice Evgenia Rusak, Patrik Reizinger, Attila Juhos, Oliver Bringmann, Roland S. Zimmermann, Wieland Brendel
PDF OpenReview
Information Theoretic Guarantees for Policy Alignment in Large Language Models Youssef Mroueh
PDF OpenReview
Information-Theoretic Progress Measures Reveal Grokking Is an Emergent Phase Transition Kenzo Clauw, Daniele Marinazzo, Sebastiano Stramaglia
PDF OpenReview
Informed Meta-Learning Kasia Kobalczyk, Mihaela van der Schaar
PDF OpenReview
Informed Meta-Learning Kasia Kobalczyk, Mihaela van der Schaar
PDF OpenReview
Injecting Hierarchical Biological Priors into Graph Neural Networks for Flow Cytometry Prediction Fatemeh Nassajian Mojarrad, Lorenzo Bini, Thomas Matthes, Stephane Marchand-Maillet
PDF OpenReview
Inpainting Crystal Structure Generations with Score-Based Denoising Xinzhe Dai, Peichen Zhong, Bowen Deng, Yifan Chen, Gerbrand Ceder
PDF OpenReview
Inpainting Galaxy Counts onto N-Body Simulations over Multiple Cosmologies and Astrophysics Antoine Bourdin, Ronan Legin, Matthew Ho, Alexandre Adam, Yashar Hezaveh, Laurence Perreault-Levasseur
PDF OpenReview
InstructBooth: Instruction-Following Personalized Text-to-Image Generation Daewon Chae, Nokyung Park, Jinkyu Kim, Kimin Lee
PDF OpenReview
Instruction Tuning with Loss over Instructions Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani
PDF OpenReview
Instruction-Guided Visual Masking Jinliang Zheng, Jianxiong Li, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan
PDF OpenReview
Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering Hongxuan Liu, Haoyu Yin, Zhiyao Luo, Xiaonan Wang
PDF OpenReview
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models Cong Lu, Shengran Hu, Jeff Clune
PDF OpenReview
Interactome-Scale Comparison of Co-Immunoprecipitation and Yeast Two-Hybrid Assays for Protein Interaction Prediction Kapil Devkota, Lenore Cowen, Rohit Singh
PDF OpenReview
InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques Rohan Gupta, Iván Arcuschin, Thomas Kwa, Adrià Garriga-Alonso
PDF OpenReview
Interpolated-MLPs: Controllable Inductive Bias Sean Wu, Jordan Hong, Keybai, Gregor Bachmann
PDF OpenReview
Interpretability Analysis on a Pathology Foundation Model Reveals Biologically Relevant Embeddings Across Modalities Nhat Le, Ciyue Shen, Chintan Shah, Blake Martin, Daniel Shenker, Harshith Padigela, Jennifer A. Hipp, Sean Grullon, John Abel, Harsha Vardhan Pokkalla, Dinkar Juyal
PDF OpenReview
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent Karolis Jucys, George Adamopoulos, Mehrab Hamidi, Stephanie Milani, Mohammad Reza Samsami, Artem Zholus, Sonia Joseph, Blake Aaron Richards, Irina Rish, Özgür Şimşek
PDF OpenReview
Interpreting Attention Layer Outputs with Sparse Autoencoders Connor Kissane, Robert Krzyzanowski, Joseph Isaac Bloom, Arthur Conmy, Neel Nanda
PDF OpenReview
Inverse Reinforcement Learning from Demonstrations for LLM Alignment Hao Sun, Mihaela van der Schaar
PDF OpenReview
InversionView: A General-Purpose Method for Reading Information from Neural Activations Xinting Huang, Madhur Panwar, Navin Goyal, Michael Hahn
PDF OpenReview
Invertible Temper Modeling Using Normalizing Flows and the Effects of Structure Preserving Loss Tegan Emerson, Henry Kvinge, Keerti Sahithi Kappagantula, Sylvia Howland
PDF OpenReview
Investigating Generalization Behaviours of Generative Flow Networks Lazar Atanackovic, Emmanuel Bengio
PDF OpenReview
Investigating the Indirect Object Identification Circuit in Mamba Danielle Ensign, Adrià Garriga-Alonso
PDF OpenReview
Investigating the Interpretability of Biometric Face Templates Using Gated Sparse Autoencoders and Differentiable Image Parametrizations Peter Rot, Klemen Grm
PDF OpenReview
Is a Good Description Worth a Thousand Pictures? Reducing Multimodal Alignment to Text-Based, Unimodal Alignment Amin Memarian, Touraj Laleh, Irina Rish, Ardavan S. Nobandegani
PDF OpenReview
Is ChatGPT Transforming Academics' Writing Style? Mingmeng Geng, Roberto Trotta
PDF OpenReview
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Tomasz Korbak, Henry Sleight, Rajashree Agrawal, John Hughes, Dhruv Bhandarkar Pai, Andrey Gromov, Dan Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo
PDF OpenReview
Is My Data Safe? Predicting Instance-Level Membership Inference Success for White-Box and Black-Box Attacks Tobias Leemann, Bardh Prenkaj, Gjergji Kasneci
PDF OpenReview
Is Persona Enough for Personality? Using ChatGPT to Reconstruct an Agent's Latent Personality from Simple Descriptions Yongyi Ji, Zhisheng Tang, Mayank Kejriwal
PDF OpenReview
Is Poisoning a Real Threat to LLM Alignment? Maybe More so than You Think Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang
PDF OpenReview
Is Self-Knowledge and Action Consistent or Not: Investigating Large Language Model's Personality Yiming Ai, Zhiwei He, Ziyin Zhang, Wenhong Zhu, Hongkun Hao, Kai Yu, Lingjun Chen, Rui Wang
PDF OpenReview
Is Transformer a Stochastic Parrot? a Case Study in Simple Arithmetic Task Peixu Wang, Chen Yu, Yu Ming
PDF OpenReview
Is Value Functions Estimation with Classification Plug-and-Play for Offline Reinforcement Learning? Denis Tarasov, Kirill Brilliantov, Dmitrii Kharlapenko
PDF OpenReview
Is Value Learning Really the Main Bottleneck in Offline RL? Seohong Park, Kevin Frans, Sergey Levine, Aviral Kumar
PDF OpenReview
It Takes Two: On the Seamlessness Between Reward and Policy Model in RLHF TaiMing Lu, Lingfeng Shen, Xinyu Yang, Weiting Tan, Beidi Chen, Huaxiu Yao
PDF OpenReview
Iteration Head: A Mechanistic Study of Chain-of-Thought Vivien Cabannes, Charles Arnal, Wassim Bouaziz, Xingyu Alice Yang, Francois Charton, Julia Kempe
PDF OpenReview
Iterative Sizing Field Prediction for Adaptive Mesh Generation from Expert Demonstrations Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Philipp Becker, Aleksandar Taranovic, Onno Grönheim, Luise Kärger, Gerhard Neumann
PDF OpenReview
Iterative Theory of Mind Assay of Multimodal AI Models Rohini Elora Das, Rajarshi Das, Niharika Maity, Sreerupa Das
PDF OpenReview
iWISDM: Assessing Instruction Following in Multimodal Models at Scale Xiaoxuan Lei, Lucas Gomez, Hao Yuan Bai, Pouya Bashivan
PDF OpenReview
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Quentin Gallouédec, Edward Emanuel Beeching, Clément Romac, Emmanuel Dellandrea
PDF OpenReview
Jafar: An Open-Source Genie Reimplemention in JAX Timon Willi, Matthew Thomas Jackson, Jakob Nicolaus Foerster
PDF OpenReview
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong
PDF OpenReview
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion
PDF OpenReview
Janus: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences Krithik Ramesh, Sameed Muneeb Siddiqui, Michael Mitzenmacher, Pardis Sabeti
PDF OpenReview
Jina CLIP: Your CLIP Model Is Also Your Text Retriever Han Xiao, Georgios Mastrapas, Bo Wang
PDF OpenReview
Jogging the Memory of Unlearned Models Through Targeted Relearning Attacks Shengyuan Hu, Yiwei Fu, Steven Wu, Virginia Smith
PDF OpenReview
Joint Diffusion Processes as an Inductive Bias in Sheaf Neural Networks Ferran Hernandez Caralt, Guillermo Bernardez, Iulia Duta, Eduard Alarcon, Pietro Lio
PDF OpenReview
Just Read Twice: Closing the Recall Gap for Recurrent Language Models Simran Arora, Aman Timalsina, Aaryan Singhal, Sabri Eyuboglu, Xinyi Zhao, Ashish Rao, Atri Rudra, Christopher Re
PDF OpenReview
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu, Ying Shen, Shuangfei Zhai, Yizhe Zhang, Navdeep Jaitly, Joshua M. Susskind
PDF OpenReview
KalMamba: Towards Efficient Probabilistic State Space Models for RL Under Uncertainty Philipp Becker, Niklas Freymuth, Gerhard Neumann
PDF OpenReview
Knowledge Graph Extraction from Total Synthesis Documents Andres M Bran, Zlatko Jončev, Philippe Schwaller
PDF OpenReview
Landscaping Linear Mode Connectivity Sidak Pal Singh, Linara Adilova, Michael Kamp, Asja Fischer, Bernhard Schölkopf, Thomas Hofmann
PDF OpenReview
Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure Bfloat16 Is Enough Konstantin Dobler, Gerard de Melo
PDF OpenReview
Language Alignment via Nash-Learning and Adaptive Feedback Ari Azarafrooz, Farshid Faal
PDF OpenReview
Language Model-in-the-Loop: Data Optimal Approach to Recommend Actions in Text Games Arjun V Sudhakar, Prasanna Parthasarathi, Janarthanan Rajendran, Sarath Chandar
PDF OpenReview
Language Models Linearly Represent Sentiment Curt Tigges, Oskar John Hollinsworth, Atticus Geiger, Neel Nanda
PDF OpenReview
Large Language Models Are Bad Game Theoretic Reasoners: Evaluating Performance and Bias in Two-Player Non-Zero-Sum Games Nathan Herr, Fernando Acero, Roberta Raileanu, Maria Perez-Ortiz, Zhibin Li
PDF OpenReview
Large Language Models Are Frame-Level Directors for Zero-Shot Text-to-Video Generation Susung Hong, Junyoung Seo, Heeseong Shin, Sunghwan Hong, Seungryong Kim
PDF OpenReview
Large Language Models Are Not Inverse Thinkers Quite yet Haoran Zhao
PDF OpenReview
Large Language Models as Misleading Assistants in Conversation Betty Li Hou, Kejian Shi, Jason Phang, James Aung, Steven Adler, Rosie Campbell
PDF OpenReview
Large Language Models Can Self-Correct with Minimal Effort Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang
PDF OpenReview
Large Language Models for Automated Open-Domain Scientific Hypotheses Discovery Zonglin Yang, Xinya Du, Junxian Li, Jie Zheng, Soujanya Poria, Erik Cambria
PDF OpenReview
Large Language Models Lack Understanding of Character Composition of Words Andrew Shin, Kunitake Kaneko
PDF OpenReview
Large-Scale Discovery of Experimental Designs in Super-Resolution Microscopy with XLuminA Carla Rodríguez, Sören Arlt, Leonhard Möckl, Mario Krenn
PDF OpenReview
Latent Functional Maps Marco Fumero, Marco Pegoraro, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà
PDF OpenReview
Latent Functional Maps Marco Fumero, Marco Pegoraro, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà
PDF OpenReview
Latent-Guided Equivariant Diffusion for Controlled Structure-Based De Novo Ligand Generation Tuan Le, Julian Cremer, Djork-Arné Clevert, Kristof T Schütt
PDF OpenReview
LAuReL: Learned Augmented Residual Layer Gaurav Menghani, Ravi Kumar, Sanjiv Kumar
PDF OpenReview
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Qichen Fu, Minsik Cho, Thomas Merth, Sachin Mehta, Mohammad Rastegari, Mahyar Najibi
PDF OpenReview
Lean4trace: Data Augmentation for Neural Theorem Proving in Lean Vasilii Nesterov, Yermek Kapushev, Mikhail Burtsev
PDF OpenReview
Learnability of Parameter-Bounded Bayes Nets Arnab Bhattacharyya, Davin Choo, Sutanu Gayen, Dimitrios Myrisiotis
PDF OpenReview
Learned Best-Effort LLM Serving Siddharth Jha, Coleman Richard Charles Hooper, Xiaoxuan Liu, Sehoon Kim, Kurt Keutzer
PDF OpenReview
Learning and Unlearning of Fabricated Knowledge in Language Models Chen Sun, Nolan Andrew Miller, Andrey Zhmoginov, Max Vladymyrov, Mark Sandler
PDF OpenReview
Learning Cure Kinetics of Frontal Polymerization PDEs Using Differentiable Simulations Pengfei Cai, Qibang Liu, Philippe Geubelle, Rafael Gomez-Bombarelli
PDF OpenReview
Learning Diffeomorphic Lyapunov Functions from Data Samuel Tesfazgi, Leonhard Sprandl, Sandra Hirche
PDF OpenReview
Learning Efficient Recursive Numeral Systems via Reinforcement Learning Jonathan David Thomas, Andrea Silvi, Devdatt Dubhashi, Emil Carlsson, Moa Johansson
PDF OpenReview
Learning Fast and Slow: Representations for In-Context Weight Modulation Andrey Zhmoginov, Jihwan Lee, Max Vladymyrov, Mark Sandler
PDF OpenReview
Learning Generative Population Models from Multiple Clinical Datasets via Probabilistic Programming João Loula, Katherine M. Collins, Ulrich Schaechtle, Joshua B. Tenenbaum, Adrian Weller, Feras Saad, Timothy J. O'Donnell, Vikash Mansinghka
PDF OpenReview
Learning High-Dimensional Mixed Models via Amortized Variational Inference Priscilla Ong, Manuel Haussmann, Harri Lähdesmäki
PDF OpenReview
Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement Learning Alena Shilova, Thomas Delliaux, Philippe Preux, Bruno Raffin
PDF OpenReview
Learning In-Context Decision Making with Synthetic MDPs Akarsh Kumar, Chris Lu, Louis Kirsch, Phillip Isola
PDF OpenReview
Learning Latent Graph Structures and Their Uncertainty Alessandro Manenti, Daniele Zambon, Cesare Alippi
PDF OpenReview
Learning Long Timescale in Molecular Dynamics by Nano-GPT Yuan Yao, Wenqi Zeng
PDF OpenReview
Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics Alireza Mousavi-Hosseini, Denny Wu, Murat A Erdogdu
PDF OpenReview
Learning Nash Equilibria in Zero-Sum Markov Games: A Single-Timescale Algorithm Under Weak Reachability Reda Ouhamma, Maryam Kamgarpour
PDF OpenReview
Learning Sequence Models Through Consolidation Eleanor Spens, Neil Burgess
PDF OpenReview
Learning Set Functions with Implicit Differentiation Gözde Özcan, Chengzhi Shi, Stratis Ioannidis
PDF OpenReview
Learning Stable Allocations of Strictly Convex Stochastic Cooperative Games Nam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh
PDF OpenReview
Learning Symmetries via Weight-Sharing with Doubly Stochastic Tensors Putri A Van der Linden, Alejandro García Castellanos, Sharvaree Vadgama, Thijs P. Kuipers, Erik J Bekkers
PDF OpenReview
Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically Kabir Ahuja, Vidhisha Balachandran, Madhur Panwar, Tianxing He, Noah A. Smith, Navin Goyal, Yulia Tsvetkov
PDF OpenReview
Learning Task Representations from In-Context Learning Baturay Saglam, Zhuoran Yang, Dionysis Kalogerias, Amin Karbasi
PDF OpenReview
Learning the Boundary-to-Domain Mapping Using Lifting Product Fourier Neural Operators for Partial Differential Equations Aditya Kashi, Arka Daw, Muralikrishnan Gopalakrishnan Meena, Hao Lu
PDF OpenReview
Learning the Eye of the Beholder: Statistical Modeling and Estimation for Personalized Color Perception Xuanzhou Chen, Austin Xu, Jingyan Wang, Ashwin Pananjady
PDF OpenReview
Learning to Assist Humans Without Inferring Rewards Vivek Myers, Evan Ellis, Benjamin Eysenbach, Sergey Levine, Anca Dragan
PDF OpenReview
Learning to Design Data-Structures: A Case Study of Nearest Neighbor Search Omar Salemohamed, Vatsal Sharan, Shivam Garg, Laurent Charlin, Gregory Valiant
PDF OpenReview
Learning to Explore with Lagrangians for Bandits Under Unknown Constraints Udvas Das, Debabrota Basu
PDF OpenReview
Learning to Grok: Emergence of In-Context Learning and Skill Composition in Modular Arithmetic Tasks Tianyu He, Darshil Doshi, Aritra Das, Andrey Gromov
PDF OpenReview
Learning to Reason by Failing: Offline RL on Sub-Optimal Rollouts Scales Synthetic Data by 8x Amrith Setlur, Saurabh Garg, Xinyang Geng, Naman Garg, Virginia Smith, Aviral Kumar
PDF OpenReview
Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data Younghun Lee, Sungchul Kim, Ryan A. Rossi, Tong Yu, Xiang Chen
PDF OpenReview
Learning to Steer Markovian Agents Under Model Uncertainty Jiawei Huang, Vinzenz Thoma, Zebang Shen, Heinrich H. Nax, Niao He
PDF OpenReview
Learning When to Trust the Expert for Guided Exploration in RL Felix Schulz, Jasper Hoffmann, Yuan Zhang, Joschka Boedecker
PDF OpenReview
LEGENT: Open Platform for Embodied Agents Zhili Cheng, Jinyi Hu, Zhitong Wang, Yuge Tu, Shengding Hu, An Liu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong Sun
PDF OpenReview
Leveraging Generative Foundation Models for Domain Generalization Sobhan Hemati, Mahdi Beitollahi, Amir Hossein Estiri, Bassel Al Omari, Xi Chen, Guojun Zhang
PDF OpenReview
Leveraging Multi-Color Spaces as a Defense Mechanism Against Model Inversion Attack Sofiane Ouaari, Ali Burak Ünal, Mete Akgün, Nico Pfeifer
PDF OpenReview
Leveraging Topological Guidance for Improved Knowledge Distillation Eun Som Jeon, Rahul Khurana, Aishani Pathak, Pavan K. Turaga
PDF OpenReview
Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space Mohamed Amine Ketata, Nicholas Gao, Johanna Sommer, Tom Wollschläger, Stephan Günnemann
PDF OpenReview
Lifted Residual Score Estimation Tejas Jayashankar, Jongha Jon Ryu, Xiangxiang Xu, Gregory W. Wornell
PDF OpenReview
LIFTED: Multimodal Mixture-of-Experts for Clinical Trial Outcome Prediction Wenhao Zheng, Dongshen Peng, Hongxia Xu, Yun Li, Hongtu Zhu, Tianfan Fu, Huaxiu Yao
PDF OpenReview
Likelihood-Based Fine-Tuning of Protein Language Models for Few-Shot Fitness Prediction and Design Alex Hawkins-Hooker, Jakub Kmec, Oliver Bent, Paul Duckworth
PDF OpenReview
Likelihood-Based Fine-Tuning of Protein Language Models for Few-Shot Fitness Prediction and Design Alex Hawkins-Hooker, Jakub Kmec, Oliver Bent, Paul Duckworth
PDF OpenReview
Limitations of scRNA-Seq Zero-Imputation Methods for Network Inference Ankit Bhardwaj, Joshua Weiner, Preetha Balasubramanian, Lakshmi Subramanian
PDF OpenReview
Linear Transformers Are Versatile In-Context Learners Max Vladymyrov, Johannes von Oswald, Mark Sandler, Rong Ge
PDF OpenReview
Linear Weight Interpolation Leads to Transient Performance Gains Gaurav Iyer, Gintare Karolina Dziugaite, David Rolnick
PDF OpenReview
Liouna: Biologically Plausible Learning for Efficient Pre-Training of Transferrable Deep Models Fady Rezk, Antreas Antoniou, Henry Gouk, Timothy Hospedales
PDF OpenReview
LLM Circuit Analyses Are Consistent Across Training and Scale Curt Tigges, Michael Hanna, Qinan Yu, Stella Biderman
PDF OpenReview
LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language James Requeima, John F Bronskill, Dami Choi, Richard E. Turner, David Duvenaud
PDF OpenReview
LLM Sample: Part Average and Part Ideal Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi, Mario Fritz
PDF OpenReview
LLM Task Interference: Impact of Task-Switch in Conversational History Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz
PDF OpenReview
LLM-Informed Discrete Prompt Optimization Zeeshan Memon, Muhammad Arham, Adnan Ul-Hasan, Faisal Shafait
PDF OpenReview
LLM3: Large Language Model-Based Task and Motion Planning with Motion Failure Reasoning Shu Wang, Muzhi Han, Ziyuan Jiao, Zeyu Zhang, Ying Nian Wu, Song-Chun Zhu, Hangxin Liu
PDF OpenReview
LLMs at the Bargaining Table Yuan Deng, Vahab Mirrokni, Renato Paes Leme, Hanrui Zhang, Song Zuo
PDF OpenReview
LLMs Learn Governing Principles of Dynamical Systems, Revealing an In-Context Neural Scaling Law Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati, Christopher Earls
PDF OpenReview
Local Lateral Connectivity Is Sufficient for Replicating Cortex-like Topographical Organization in Deep Neural Networks Xinyu Qian, Amirozhan Dehghani, Asa Borzabadifarahani, Pouya Bashivan
PDF OpenReview
Local to Global: Learning Dynamics and Effect of Initialization for Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Chanakya Ekbote, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar
PDF OpenReview
Localized Zeroth-Order Prompt Optimization Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low
PDF OpenReview
Localizing Auditory Concepts in CNNs Pratyaksh Gautam, Makarand Tapaswi, Vinoo Alluri
PDF OpenReview
Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies Alex DeWeese, Guannan Qu
PDF OpenReview
Logical Distillation of Graph Neural Networks Alexander Pluska, Pascal Welke, Thomas Gärtner, Sagar Malhotra
PDF OpenReview
Long Context Understanding Using Self-Generated Synthetic Data Jerry Li, Subhro Das, Aude Oliva, Dmitry Krotov, Leonid Karlinsky, Rogerio Feris
PDF OpenReview
Long-Context Vision Large Language Models: Empirical Insights and a Baseline Yongshuo Zong, Ismail Elezi, Yongxin Yang, Jiankang Deng, Timothy Hospedales
PDF OpenReview
Long-Horizon Planning for Multi-Agent Robots in Partially Observable Environments Siddharth Nayak, Adelmo Morrison Orozco, Marina Ten Have, Jackson Zhang, Vittal Thirumalai, Darren Chen, Aditya Kapoor, Eric Robinson, Karthik Gopalakrishnan, James Harrison, Anuj Mahajan, Brian Ichter, Hamsa Balakrishnan
PDF OpenReview
LongAlign: A Recipe for Long Context Alignment of Large Language Models Yushi Bai, Xin Lv, Jiajie Zhang, Yuze He, Ji Qi, Lei Hou, Jie Tang, Yuxiao Dong, Juanzi Li
PDF OpenReview
Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models Alexandre Variengien, Eric Winsor
PDF OpenReview
Looking at Deep Learning Phenomena Through a Telescoping Lens Alan Jeffares, Alicia Curth, Mihaela van der Schaar
PDF OpenReview
LoQT: Low Rank Adapters for Quantized Training Sebastian Bugge Loeschcke, Mads Toftrup, Michael Kastoryano, Serge Belongie, Vésteinn Snæbjarnarson
PDF OpenReview
LoRD: Low-Rank Decomposition of Monolingual Code LLMs for One-Shot Compression Ayush Kaushal, Tejas Vaidhya, Irina Rish
PDF OpenReview
Lorentzian Residual Neural Networks Neil He, Menglin Yang, Rex Ying
PDF OpenReview
Loss in the Crowd: Hidden Breakthroughs in Language Model Training Sara Kangaslahti, Elan Rosenfeld, Naomi Saphra
PDF OpenReview
Loss Landscape Geometry Reveals Stagewise Development of Transformers George Wang, Matthew Farrugia-Roberts, Jesse Hoogland, Liam Carroll, Susan Wei, Daniel Murfet
PDF OpenReview
Lost in Translation: The Algorithmic Gap Between LMs and the Brain Tosato Tommaso, Tikeng Notsawo Pascal Junior, Helbling Saskia, Irina Rish, Guillaume Dumas
PDF OpenReview
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal
PDF OpenReview
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal
PDF OpenReview
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal
PDF OpenReview
Low Rank Quantization-Aware Training for LLMs Yelysei Bondarenko, Riccardo Del Chiaro, Markus Nagel
PDF OpenReview
Low-Rank Linearization of Large Language Models Michael Zhang, Aaryan Singhal, Benjamin Frederick Spector, Simran Arora, Christopher Re
PDF OpenReview
Lowering PyTorch's Memory Consumption for Selective Differentiation Samarth Bhatia, Felix Dangel
PDF OpenReview
Machine Learning Nominal Max Oxygen Consumption from Wearable Reflective Pulse Oximetry with Density Functional Theory Saleem Abdul Fattah Ahmed Al Dajani, Frédéric Laquai
PDF OpenReview
MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-Go Approximation Alexandre Hayderi, Amin Saberi, Ellen Vitercik, Anders Wikum
PDF OpenReview
Mamba-PTQ: Outlier Channels in Recurrent Large Language Models Alessandro Pierro, Steven Abreu
PDF OpenReview
Manifold-Constrained Nucleus-Level Denoising Diffusion Model for Structure-Based Drug Design Shengchao Liu, Liang Yan, Weitao Du, Weiyang Liu, Hongyu Guo, Christian Borgs, Jennifer T Chayes, Anima Anandkumar
PDF OpenReview
Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva, Marina MC Höhne, Alexander Warnecke, Lukas Pirch, Klaus Robert Muller, Konrad Rieck, Kirill Bykov
PDF OpenReview
Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva, Marina MC Höhne, Alexander Warnecke, Lukas Pirch, Klaus Robert Muller, Konrad Rieck, Kirill Bykov
PDF OpenReview
Many-Shot In-Context Learning Rishabh Agarwal, Avi Singh, Lei M Zhang, Bernd Bohnet, Luis Rosias, Stephanie C.Y. Chan, Biao Zhang, Ankesh Anand, Zaheer Abbas, Azade Nova, John D Co-Reyes, Eric Chu, Feryal Behbahani, Aleksandra Faust, Hugo Larochelle
PDF OpenReview
Many-Shot In-Context Learning Rishabh Agarwal, Avi Singh, Lei M Zhang, Bernd Bohnet, Luis Rosias, Stephanie C.Y. Chan, Biao Zhang, Aleksandra Faust, Hugo Larochelle
PDF OpenReview
Many-Shot In-Context Learning for Molecular Inverse Design Saeed Moayedpour, Alejandro Corrochano-Navarro, Faryad Sahneh, Alexander Koetter, Jiří Vymětal, Lorenzo Kogler Anele, Pablo Mas, Yasser Jangjoo, Sizhen Li, Michael Bailey, Marc Bianciotto, Hans Matter, Christoph Grebner, Gerhard Hessler, Ziv Bar-Joseph, Sven Jager
PDF OpenReview
Many-Shot In-Context Learning in Multimodal Foundation Models Yixing Jiang, Jeremy Andrew Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H Chen, Andrew Y. Ng
PDF OpenReview
Many-to-Many Image Generation with Auto-Regressive Diffusion Models Ying Shen, Yizhe Zhang, Shuangfei Zhai, Lifu Huang, Joshua M. Susskind, Jiatao Gu
PDF OpenReview
MAP-THOR: Benchmarking Long-Horizon Multi-Agent Planning Frameworks in Partially Observable Environments Siddharth Nayak, Adelmo Morrison Orozco, Marina Ten Have, Vittal Thirumalai, Jackson Zhang, Darren Chen, Aditya Kapoor, Eric Robinson, Karthik Gopalakrishnan, Brian Ichter, James Harrison, Anuj Mahajan, Hamsa Balakrishnan
PDF OpenReview
MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-Based Programming Assistants John Heibel, Daniel Lowd
PDF OpenReview
Marginal Fairness Sliced Wasserstein Barycenter Khai Nguyen, Hai Nguyen, Nhat Ho
PDF OpenReview
Markov Persuasion Processes: How to Persuade Multiple Agents from Scratch Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni, Nicola Gatti, Alberto Marchesi
PDF OpenReview
Marrying Causal Representation Learning with Dynamical Systems for Science Dingling Yao, Caroline Muller, Francesco Locatello
PDF OpenReview
Masking in Molecular Graphs Leveraging Reaction Context Jiannan Yang, Veronika Thost, Tengfei Ma
PDF OpenReview
Matching Domain Experts by Training from Scratch on Domain Knowledge Xiaoliang Luo, Guangzhi Sun, Bradley C. Love
PDF OpenReview
Mathematical Models of Computation in Superposition Kaarel Hänni, Jake Mendel, Dmitry Vaintrob, Lawrence Chan
PDF OpenReview
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Bedi, Mengdi Wang
PDF OpenReview
Measuring Goal-Directedness Matt MacDermott, James Fox, Francesco Belardinelli, Tom Everitt
PDF OpenReview
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models Adam Karvonen, Benjamin Wright, Can Rager, Rico Angell, Jannik Brinkmann, Logan Riggs Smith, Claudio Mayrink Verdun, David Bau, Samuel Marks
PDF OpenReview
Mechanism Design for Large Language Models Paul Duetting, Vahab Mirrokni, Renato Paes Leme, Haifeng Xu, Song Zuo
PDF OpenReview
Mechanistic Interpretability of Binary and Ternary Transformer Networks Jason Li
PDF OpenReview
Medical Unlearnable Examples: Securing Medical Data from Unauthorized Training via Sparsity-Aware Local Masking Weixiang Sun, Yixin Liu, Zhiling Yan, Kaidi Xu, Lichao Sun
PDF OpenReview
Memory and Bandwidth Are All You Need for Fully Sharded Data Parallel Jiangtao Wang, Jan Ebert, Oleg Filatov, Stefan Kesselheim
PDF OpenReview
Merging Improves Self-Critique Against Jailbreak Attacks Victor Gallego
PDF OpenReview
Merging Text Transformer Models from Different Initializations Neha Verma, Maha Elbayad
PDF OpenReview
MESS: Modern Electronic Structure Simulations Hatem Helal, Andrew W Fitzgibbon
PDF OpenReview
Message-Passing Monte Carlo: Generating Low-Discrepancy Point Sets via Graph Neural Networks T. Konstantin Rusch, Nathan Kirk, Michael M. Bronstein, Christiane Lemieux, Daniela Rus
PDF OpenReview
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold Lazar Atanackovic, Xi Zhang, Brandon Amos, Mathieu Blanchette, Leo J Lee, Yoshua Bengio, Alexander Tong, Kirill Neklyudov
PDF OpenReview
Meta-Designing Quantum Experiments with Language Models Sören Arlt, Haonan Duan, Felix Li, Sang Michael Xie, Yuhuai Wu, Mario Krenn
PDF OpenReview
Meta-Optimization for Deep Learning via Nonstochastic Control Xinyi Chen, Evan Dogariu, Zhou Lu, Elad Hazan
PDF OpenReview
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving Aniket Rajiv Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy P Lillicrap, Danilo Jimenez Rezende, Yoshua Bengio, Michael Curtis Mozer, Sanjeev Arora
PDF OpenReview
MetaGFN: Exploring Distant Modes with Adapted Metadynamics for Continuous GFlowNets Dominic Phillips, Flaviu Cipcigan
PDF OpenReview
Metric Learning for Clifford Group Equivariant Neural Networks Riccardo Ali, Paulina Kulytė, Haitz Sáez de Ocáriz Borde, Pietro Lio
PDF OpenReview
Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models Francisco Eiras, Aleksandar Petrov, Philip Torr, M. Pawan Kumar, Adel Bibi
PDF OpenReview
Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI Hugo Caselles-Dupré, Charles Mellerio, Herent, Alizée Lopez-Persem, Benoît Béranger, Pierre Fautrel, Gauthier Vernier, Matthieu Cord
PDF OpenReview
MInference: Accelerating Pre-Filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu
PDF OpenReview
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar
PDF OpenReview
Minimax Tree of Thoughts: Playing Two-Player Zero-Sum Sequential Games with Large Language Models Wei Guo, Xiaotian Hao, Jianye Hao, Yan Zheng
PDF OpenReview
MiniMol: A Parameter-Efficient Foundation Model for Molecular Learning Kerstin Klaser, Blazej Banaszewski, Samuel Maddrell-Mander, Callum McLean, Luis Müller, Ali Parviz, Shenyang Huang, Andrew W Fitzgibbon
PDF OpenReview
Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks Aaron Mueller
PDF OpenReview
Mission Impossible: A Statistical Perspective on Jailbreaking LLMs Jingtong Su, Julia Kempe, Karen Ullrich
PDF OpenReview
Misspecified $q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error Ally Yalei Du, Lin Yang, Ruosong Wang
PDF OpenReview
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension Yijiong Yu, Huiqiang Jiang, Xufang Luo, Qianhui Wu, Chin-Yew Lin, Dongsheng Li, Yuqing Yang, Yongfeng Huang, Lili Qiu
PDF OpenReview
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy Cameron Allen, Aaron T. Kirtland, Ruo Yu Tao, Sam Lobel, Daniel Scott, Nicholas Petrocelli, Omer Gottesman, Ronald Parr, Michael Littman, George Konidaris
PDF OpenReview
Mixed-Curvature Decision Trees and Random Forests Philippe Chlenski, Quentin Chu, Itsik Pe'er
PDF OpenReview
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge? Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Leria Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao
PDF OpenReview
Mobile and Edge Evaluation of Large Language Models Stefanos Laskaridis, Kleomenis Katevas, Lorenzo Minto, Hamed Haddadi
PDF OpenReview
Model Based Diffusion for Trajectory Optimization Chaoyi Pan, Zeji Yi, Guanya Shi, Guannan Qu
PDF OpenReview
Model Breadcrumbs: Scalable Upcycling of Finetuned Foundation Models via Sparse Task Vectors Merging MohammadReza Davari, Eugene Belilovsky
PDF OpenReview
Model-Agnostic Graph Dataset Compression with the Tree Mover’s Distance Mika Sarkin Jain, Stefanie Jegelka, Ishani Karmarkar, Luana Ruiz, Ellen Vitercik
PDF OpenReview
Modeling Bilingual Disfluencies with Large Language Models Negin Raoof, Yating Wu, Carlos Bonilla, Junyi Jessy Li, Stephanie M Grasso, Alex Dimakis, Zoi Gkalitsiou
PDF OpenReview
Modeling Droplets Dynamics in Emulsions with Graph Neural Networks Giulio Ortali, Federico Toschi, Jan-Willem van de Meent
PDF OpenReview
Modeling the Plurality of Human Preferences via Ideal Points Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak
PDF OpenReview
Modeling the Plurality of Human Preferences via Ideal Points Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak
PDF OpenReview
Modelling Latent Dynamical Systems with Recognition-Parametrised Models Samo Hromadka, Maneesh Sahani
PDF OpenReview
Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum
PDF OpenReview
Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum
PDF OpenReview
Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum
PDF OpenReview
Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum
PDF OpenReview
Modularity in Biologically Inspired Representations Depends on Task Variable Range Independence Will Dorrell, Kyle Hsu, Luke Hollingsworth, Jin Hwa Lee, Jiajun Wu, Chelsea Finn, Peter E. Latham, Timothy Edward John Behrens, James C. R. Whittington
PDF OpenReview
MolEval: An Evaluation Toolkit for Molecular Embeddings via LLMs Shaghayegh Sadeghi, Ali Forooghi, Jianguo Lu, Alioune Ngom
PDF OpenReview
MolGene-E: Inverse Molecular Design to Modulate Single Cell Transcriptomics Rahul Ohlan, Raswanth Murugan, Li Xie, Mohammadsadeq Mottaqi, Shuo Zhang, Lei Xie
PDF OpenReview
MONGOOSE: Path-Wise Smooth Bayesian Optimisation via Meta-Learning Adam X. Yang, Laurence Aitchison, Henry Moss
PDF OpenReview
More Details, Please: Improving Autoformalization with More Detailed Proofs Guillem Tarrach, Albert Q. Jiang, Daniel Raggi, Wenda Li, Mateja Jamnik
PDF OpenReview
MoRe Fine-Tuning with 10x Fewer Parameters Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala
PDF OpenReview
MoRe Fine-Tuning with 10x Fewer Parameters Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala
PDF OpenReview
MoReDrop: Dropout Without Dropping Li Jiang, Duo Li, Yichuan Ding, Xue Liu, Victor Wai Kin Chan
PDF OpenReview
MSA Pairing Transfomer: Protein Interaction Partner Prediction with Few-Shot Contrastive Learning Alex Hawkins-Hooker, Daniel Burkhardt Cerigo, Umberto Lupo, David Jones, Brooks Paige
PDF OpenReview
MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training Bo Chen, Zhilei Bei, Xingyi Cheng, Pan Li, Jie Tang, Le Song
PDF OpenReview
MSAMamba: Adapting Subquadratic Models to Long-Context DNA MSA Analysis Vishrut Thoutam, Dina Ellsworth
PDF OpenReview
MSAMamba: Adapting Subquadratic Models to Long-Context DNA MSA Analysis Vishrut Thoutam, Dina Ellsworth
PDF OpenReview
Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Steven Wu
PDF OpenReview
Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Steven Wu
PDF OpenReview
Multi-Frequency Progressive Refinement for Learned Inverse Scattering Owen Melia, Olivia Tsang, Vasileios Charisopoulos, Yuehaw Khoo, Jeremy Hoskins, Rebecca Willett
PDF OpenReview
Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Weijie J Su, Camillo Jose Taylor, Tanwi Mallick
PDF OpenReview
Multi-Modal and Multi-Task Transformer for Small Molecule Drug Discovery Sai Krishna Sirumalla, David Stephen Farina Jr, Zhuoran Qiao, Daniele Alessandro Di Cesare, Felipe Costas Farias, Michael Bernard O’Connor, Peter John Bygrave, Feizhi Ding, Thomas Dresselhaus, Marcelo Gomes Pereira de Lacerda, Jason Matthew Swails, Daniel Miles, Matthew Welborn, Fred Manby, Thomas Miller
PDF OpenReview
Multi-Objective Differentiable Neural Architecture Search Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Samuel Dooley, Josif Grabocka, Frank Hutter
PDF OpenReview
Multi-Objective Guidance via Importance Sampling for Target-Aware Diffusion-Based De Novo Ligand Generation Julian Cremer, Tuan Le, Frank Noe, Djork-Arné Clevert, Kristof T Schütt
PDF OpenReview
Multi-Task Extension of Geometrically Aligned Transfer Encoder Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Hyunseung Kim, Chanhui Lee, Soorin Yim, Sehui Han
PDF OpenReview
Multi-Task Training Increases Native Sequence Recovery of Antigen-Specific T-Cell Receptor Sequences Dhuvarakesh Karthikeyan, Alex Rubinsteyn
PDF OpenReview
Multilingual Compression Parity: How Efficiently Large Language Models Represent Information Across Languages? Alexander Tsvetkov, Alon Kipnis
PDF OpenReview
Multimodal Foundation World Models for Generalist Embodied Agents Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Aaron Courville, Sai Rajeswar
PDF OpenReview
Multiple-Policy Evaluation via Density Estimation Yilei Chen, Aldo Pacchiano, Ioannis Paschalidis
PDF OpenReview
MultiScale Policy Learning for Alignment with Long Term Objectives Richa Rastogi, Yuta Saito, Thorsten Joachims
PDF OpenReview
Multivector Neurons: Better and Faster O(n)-Equivariant Clifford GNNs Cong Liu, David Ruhe, Patrick Forré
PDF OpenReview
Navigating Chemical Space with Latent Flows Guanghao Wei, Yining Huang, Chenru Duan, Yue Song, Yuanqi Du
PDF OpenReview
Navigating Trustworthiness of Deep Learning in ∆∆g Prediction : Addressing Data Bias, Model Evaluation, and Interpretation Ruochi Zhang, Ningning Chen, Fengfeng Zhou, Xin Gao
PDF OpenReview
NCIDiff: Non-Covalent Interaction-Generative Diffusion Model for Improving Reliability of 3D Molecule Generation Inside Protein Pocket Joongwon Lee, Wonho Zhung, Woo Youn Kim
PDF OpenReview
NEBULA: Neural Empirical Bayes Under Latent Representations for Efficient and Controllable Design of Molecular Libraries Ewa Nowara, Pedro O. Pinheiro, Sai Pooja Mahajan, Omar Mahmood, Andrew Martin Watkins, Saeed Saremi, Michael Maser
PDF OpenReview
NEORL: Efficient Exploration for Nonepisodic RL Bhavya Sukhija, Lenart Treven, Florian Dorfler, Stelian Coros, Andreas Krause
PDF OpenReview
Neural Collapse Versus Low-Rank Bias: Is Deep Neural Collapse Really Optimal? Peter Súkeník, Marco Mondelli, Christoph H. Lampert
PDF OpenReview
Neural Dueling Bandits Arun Verma, Zhongxiang Dai, Xiaoqiang Lin, Patrick Jaillet, Bryan Kian Hsiang Low
PDF OpenReview
Neural Incremental Data Assimilation Matthieu Blanke, Ronan Fablet, Marc Lelarge
PDF OpenReview
Neural Interactive Proofs Lewis Hammond, Sam Adam-Day
PDF OpenReview
Neural Network Learns Low-Dimensional Polynomials with SGD near the Information-Theoretic Limit Jason D. Lee, Kazusato Oko, Taiji Suzuki, Denny Wu
PDF OpenReview
Neural Ratio Estimators Meet Distributional Shift and Mode Misspecification: A Cautionary Tale from Strong Gravitational Lensing Andreas Filipp, Yashar Hezaveh, Laurence Perreault-Levasseur
PDF OpenReview
Neural Symmetry Detection for Learning Neural Network Constraints Alex Gabel, Rick Quax, Stratis Gavves
PDF OpenReview
Neural Thermodynamic Integration: Free Energies from Energy-Based Diffusion Models Bálint Máté, François Fleuret, Tristan Bereau
PDF OpenReview
Neuroplasticity and Corruption in Model Mechanisms: A Case Study of Indirect Object Identification Vishnu Kabir Chhabra, Ding Zhu, Mohammad Mahdi Khalili
PDF OpenReview
Neurosymbolic Markov Models Lennert De Smet, Gabriele Venturato, Luc De Raedt, Giuseppe Marra
PDF OpenReview
New Desiderata for Direct Preference Optimization Xiangkun Hu, Tong He, David Wipf
PDF OpenReview
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO Skander Moalla, Andrea Miele, Razvan Pascanu, Caglar Gulcehre
PDF OpenReview
Non-Differentiable Diffusion Guidance for Improved Molecular Geometry Yuchen Shen, Chenhao Zhang, Chenghui Zhou, Sijie Fu, Newell Washburn, Barnabas Poczos
PDF OpenReview
Non-Ergodicity in Reinforcement Learning: Robustness via Ergodicity Transformations Dominik Baumann, Erfaun Noorani, James Price, Ole Peters, Colm Connaughton, Thomas B. Schön
PDF OpenReview
Non-Linear $H_\infty$ Robustness Guarantees for Neural Network Policies Daniel Urieli
PDF OpenReview
Non-Parameteric Conformal Distributionally Robust Optimization Yash Patel, Guyang Cao, Ambuj Tewari
PDF OpenReview
Nonconvex Meta-Optimization for Deep Learning Xinyi Chen, Evan Dogariu, Zhou Lu, Elad Hazan
PDF OpenReview
Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators Jianhao Yuan, Francesco Pinto, Adam Davies, Philip Torr
PDF OpenReview
NVDSL: Simplifying Tensor Cores with Python-Driven MLIR Metaprogramming Guray Ozen
PDF OpenReview
Off-Policy Evaluation from Logged Human Feedback Aniruddha Bhargava, Lalit K Jain, Branislav Kveton, Ge Liu, Subhojyoti Mukherjee
PDF OpenReview
Offline Reinforcement Learning with Pessimistic Value Priors Filippo Valdettaro, Aldo A. Faisal
PDF OpenReview
Offline RL via Feature-Occupancy Gradient Ascent Gergely Neu, Nneka Okolo
PDF OpenReview
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang, Shaofei Cai, Zhancun Mu, Haowei Lin, Ceyao Zhang, Xuejie Liu, Qing Li, Anji Liu, Xiaojian Ma, Yitao Liang
PDF OpenReview
On Conditional Sampling with Joint Flow Matching Amy Xiang Wang
PDF OpenReview
On Fairly Comparing Group Equivariant Networks Lucas Roos, Rodney Stephen Kroon
PDF OpenReview
On Language Models’ Cognitive Biases in Reading Time Prediction Patrick Haller, Lena Sophia Bolliger, Lena Ann Jäger
PDF OpenReview
On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization Motahareh Sohrabi, Juan Ramirez, Tianyue H. Zhang, Simon Lacoste-Julien, Jose Gallego-Posada
PDF OpenReview
On Provable Length and Compositional Generalization Kartik Ahuja, Amin Mansouri
PDF OpenReview
On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks Nicholas H. Barbara, Ruigang Wang, Ian Manchester
PDF OpenReview
On the Calibration of Conditional-Value-at-Risk Rajeev Verma, Volker Fischer, Eric Nalisnick
PDF OpenReview
On the Difficulty of Faithful Chain-of-Thought Reasoning in Large Language Models Sree Harsha Tanneru, Dan Ley, Chirag Agarwal, Himabindu Lakkaraju
PDF OpenReview
On the Discrepancy and Connection Between Memorization and Generation in Diffusion Models Hanyu Wang, Yujin Han, Difan Zou
PDF OpenReview
On the Effectiveness of Quantum Chemistry Pre-Training for Pharmacological Property Prediction Arun Raja, Hongtao Zhao, Christian Tyrchan, Eva Nittinger, Michael M. Bronstein, Charlotte Deane, Garrett M Morris
PDF OpenReview
On the Expressive Power of Tree-Structured Probabilistic Circuits Lang Yin, Han Zhao
PDF OpenReview
On the Local Geometry of Deep Generative Manifolds Ahmed Imtiaz Humayun, Ibtihel Amara, Candice Schumann, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaei
PDF OpenReview
On the Matter of Embeddings Dispersion on Hyperspheres Evgeniia Tokarchuk, Hua Chang Bakker, Vlad Niculae
PDF OpenReview
On the Metastability of Learning Algorithms in Physics-Informed Neural Networks: A Case Study on Schr\"odinger Operators Alessandro Maria Selvitella
PDF OpenReview
On the Multi-Modal Vulnerability of Diffusion Models Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yu
PDF OpenReview
On the Power of Convolution Augmented Transformer Mingchen Li, Xuechen Zhang, Yixiao Huang, Samet Oymak
PDF OpenReview
On the Privacy Risks of Post-Hoc Explanations of Foundation Models Catherine Huang, Martin Pawelczyk, Himabindu Lakkaraju
PDF OpenReview
On the Robustness of Neural Networks Quantization Against Data Poisoning Attacks Yiwei Lu, Yihan Wang, Guojun Zhang, Yaoliang Yu
PDF OpenReview
On the Similarity of Circuits Across Languages: A Case Study on the Subject-Verb Agreement Task Javier Ferrando, Marta R. Costa-jussà
PDF OpenReview
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics Michal Nauman, Marek Cygan
PDF OpenReview
On Three-Layer Data Markets Alireza Fallah, Michael Jordan, Ali Makhdoumi, Azarakhsh Malekian
PDF OpenReview
One-Shot Safety Alignment for Large Language Models via Optimal Dualization Xinmeng Huang, Shuo Li, Edgar Dobriban, Osbert Bastani, Hamed Hassani, Dongsheng Ding
PDF OpenReview
One-Shot Safety Alignment for Large Language Models via Optimal Dualization Xinmeng Huang, Shuo Li, Edgar Dobriban, Osbert Bastani, Hamed Hassani, Dongsheng Ding
PDF OpenReview
One-Versus-Others Attention: Scalable Multimodal Integration for Biomedical Data Michal Golovanevsky, Eva Schiller, Akira A Nair, Ritambhara Singh, Carsten Eickhoff
PDF OpenReview
Online Optimization of Closed-Loop Control Systems Hao Ma, Melanie Zeilinger, Michael Muehlebach
PDF OpenReview
Online Performance Optimization of Nonlinear Systems: A Gray-Box Approach Zhiyu He, Michael Muehlebach, Saverio Bolognani, Florian Dorfler
PDF OpenReview
Open LLMs Are Necessary for Private Adaptations and Outperform Their Closed Alternatives Vincent Hanke, Tom Blanchard, Franziska Boenisch, Iyiola Emmanuel Olatunji, Michael Backes, Adam Dziedzic
PDF OpenReview
Open LLMs Are Necessary for Private Adaptations and Outperform Their Closed Alternatives Vincent Hanke, Tom Blanchard, Franziska Boenisch, Iyiola Emmanuel Olatunji, Michael Backes, Adam Dziedzic
PDF OpenReview
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar, Johannes Hagemann
PDF OpenReview
OpenELM: An Efficient Language Model Family with Open Training and Inference Framework Sachin Mehta, Mohammad Hossein Sekhavat, Qingqing Cao, Maxwell Horton, Yanzi Jin, Chenfan Sun, Seyed Iman Mirzadeh, Mahyar Najibi, Dmitry Belenko, Peter Zatloukal, Mohammad Rastegari
PDF OpenReview
Optimal Design for Human Feedback Subhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Anand Deshmukh, Ge Liu, Yifei Ma, Branislav Kveton
PDF OpenReview
Optimality of Stationary Policies in Risk-Averse Total-Reward MDPs with EVaR Xihong Su, Marek Petrik, Julien Grand-Clément
PDF OpenReview
Optimised Grouped-Query Attention Mechanism for Transformers Yuang Chen, Cheng Zhang, Xitong Gao, Robert D. Mullins, George Anthony Constantinides, Yiren Zhao
PDF OpenReview
Optimistic Asynchrony Control: Achieving Synchronous Convergence with Asynchronous Throughput for Embedding Model Training Roger Waleffe, Jason Mohoney
PDF OpenReview
Optimistic Information Directed Sampling Gergely Neu, Matteo Papini, Ludovic Schwartz
PDF OpenReview
Optimistic Verifiable Training by Controlling Hardware Nondeterminism Megha Srivastava, Simran Arora, Dan Boneh
PDF OpenReview
Oracle-Efficient Reinforcement Learning for Max Value Ensembles Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell
PDF OpenReview
Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback Zhirui Chen, Vincent Y. F. Tan
PDF OpenReview
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano, Pulkit Agrawal
PDF OpenReview
OTTER: Effortless Label Distribution Adaptation of Zero-Shot Models Changho Shin, Jitian Zhao, Sonia Cromp, Harit Vishwakarma, Frederic Sala
PDF OpenReview
Out-of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions Leonardo Cotta, Chris J. Maddison
PDF OpenReview
Out-of-Distribution Validation for Bioactivity Prediction in Drug Discovery: Lessons from Materials Science Udit Surya Saha, Michele Vendruscolo, Anne E Carpenter, Shantanu Singh, Andreas Bender, Srijit Seal
PDF OpenReview
OutEffHop: A Principled Outlier-Efficient Attention Layer from Dense Associative Memory Models Haozheng Luo, Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu
PDF OpenReview
Outliers and Calibration Sets Have Diminishing Effect on Quantization of Modern LLMs Davide Paglieri, Saurabh Dash, Tim Rocktäschel, Jack Parker-Holder
PDF OpenReview
Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl
PDF OpenReview
Overconfident Oracles: Limitations of in Silico Sequence Design Benchmarking Shikha Surana, Nathan Grinsztajn, Timothy Atkinson, Paul Duckworth, Thomas D Barrett
PDF OpenReview
OxonFair: A Flexible Toolkit for Algorithmic Fairness Eoin D. Delaney, Zihao Fu, Sandra Wachter, Brent Mittelstadt, Chris Russell
PDF OpenReview
PAIR: Boosting the Predictive Power of Protein Representations with a Corpus of Text Annotations Haonan Duan, Marta Skreta, Leonardo Cotta, Ella Miray Rajaonson, Nikita Dhawan, Alan Aspuru-Guzik, Chris J. Maddison
PDF OpenReview
PanSAM: Zero-Shot, Prompt-Free Pancreas Segmentation in CT Imaging Abolfazl Malekahmadi, Mohammad Taha Teimuri Jervakani, Armin Behnamnia, Zahra Dehghanian, Amir Shamloo, Hamid R. Rabiee
PDF OpenReview
Parallelising Differentiable Algorithms Removes the Scalar Bottleneck: A Case Study Euan Ong, Ferenc Huszár, Pietro Lio, Petar Veličković
PDF OpenReview
Parameter Tuning and Modeling of a Rotary Kiln Using Physics-Informed Neural Networks Janak M. Patel, Vishal Sudam Jadhav, Anirudh Deodhar, Shirish Karande, Venkataramana Runkana
PDF OpenReview
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis Sagar Srinivas Sakhinana, Sannidhi Gowri Naga Krishna Geethan, Chidaksh Ravuru, Venkataramana Runkana
PDF OpenReview
Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis Sagar Srinivas Sakhinana, Sannidhi Gowri Naga Krishna Geethan, Chidaksh Ravuru, Venkataramana Runkana
PDF OpenReview
Partial Structure Discovery Is Sufficient for No-Regret Learning in Causal Bandits Muhammad Qasim Elahi, Mahsa Ghasemi, Murat Kocaoglu
PDF OpenReview
Partially Observable Multi-Agent Reinforcement Learning Using Mean Field Control Kai Cui, Sascha H. Hauck, Christian Fabian, Heinz Koeppl
PDF OpenReview
Path Complex Neural Network for Molecular Property Prediction Longlong Li, Xiang Liu, Guanghui Wang, Yu Guang Wang, Kelin Xia
PDF OpenReview
PathoLM: Identifying Pathogenicity from the DNA Sequence Through the Genome Foundation Model Sajib Acharjee Dip
PDF OpenReview
Penzai + Treescope: A Toolkit for Interpreting, Visualizing, and Editing Models as Data Daniel D. Johnson
PDF OpenReview
Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami, Reza Bayat, Ioannis Mitliagkas, Joao Monteiro, Valentina Zantedeschi
PDF OpenReview
Performative Prediction on Games and Mechanism Design António Góis, Mehrnaz Mofakhami, Fernando P. Santos, Simon Lacoste-Julien, Gauthier Gidel
PDF OpenReview
Permutation Tree Invariant Neural Architectures Johannes Urban, Sebastian Tschiatschek, Nils Morten Kriege
PDF OpenReview
PhaseEvo: Towards Unified Long-Context Prompt Optimization for Large Language Models Wendi Cui, Jiaxin Zhang, Zhuohang Li, Hao Sun, Damien Lopez, Kamalika Das, Bradley A. Malin, Sricharan Kumar
PDF OpenReview
Physical Backdoor Attack Can Jeopardize Driving with Vision-Large-Language Models Zhenyang Ni, Rui Ye, Yuxi Wei, Zhen Xiang, Yanfeng Wang, Siheng Chen
PDF OpenReview
Physics-Informed Neural Networks for Derivative-Constrained PDEs Kentaro Hoshisashi, Carolyn E. Phelan, Paolo Barucca
PDF OpenReview
Physics-Informed Weakly Supervised Learning for Interatomic Potentials Makoto Takamoto, Viktor Zaverkin, Mathias Niepert
PDF OpenReview
PICT: Adaptive GPU Accelerated Differentiable Fluid Simulation for Machine Learning Aleksandra Franz, Nils Thuerey
PDF OpenReview
PIED: Physics-Informed Experimental Design for Inverse Problems Apivich Hemachandra, Gregory Kang Ruey Lau, See-Kiong Ng, Bryan Kian Hsiang Low
PDF OpenReview
Pink Noise LQR: How Does Colored Noise Affect the Optimal Policy in RL? Jakob Hollenstein, Marko Zaric, Samuele Tosatto, Justus Piater
PDF OpenReview
PINNACLE: PINN Adaptive ColLocation and Experimental Points Selection Gregory Kang Ruey Lau, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low
PDF OpenReview
PIPER: Primitive-Informed Preference-Based Hierarchical Reinforcement Learning via Hindsight Relabeling Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Singh Bedi
PDF OpenReview
PIPER: Primitive-Informed Preference-Based Hierarchical Reinforcement Learning via Hindsight Relabeling Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Bedi
PDF OpenReview
PIXART-Δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen, Simian Luo, Enze Xie
PDF OpenReview
Planning Behavior in a Recurrent Neural Network That Plays Sokoban Adrià Garriga-Alonso, Mohammad Taufeeque, Adam Gleave
PDF OpenReview
Playing Large Games with Oracles and AI Debate Xinyi Chen, Angelica Chen, Dean Foster, Elad Hazan
PDF OpenReview
PLINDER: The Protein-Ligand Interactions Dataset and Evaluation Resource Janani Durairaj, Yusuf Adeshina, Zhonglin Cao, Xuejin Zhang, Vladas Oleinikovas, Thomas Duignan, Zachary McClure, Xavier Robin, Emanuele Rossi, Guoqing Zhou, Srimukh Prasad Veccham, Clemens Isert, Yuxing Peng, Prabindh Sundareson, Mehmet Akdel, Gabriele Corso, Hannes Stark, Zachary Wayne Carpenter, Michael M. Bronstein, Emine Kucukbenli, Torsten Schwede, Luca Naef
PDF OpenReview
PLUTO: Pathology-Universal Transformer Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma L Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric E. Walk, John Abel, Harsha Vardhan Pokkalla, Andrew H. Beck, Sean Grullon
PDF OpenReview
PLUTO: Pathology-Universal Transformer Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric Walk, John Abel, Harsha Vardhan Pokkalla, Andrew H. Beck, Sean Grullon
PDF OpenReview
PLUTO: Pathology-Universal Transformer Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma L Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric Walk, John Abel, Harsha Vardhan Pokkalla, Andrew H. Beck, Sean Grullon
PDF OpenReview
Policy Gradient Methods with Adaptive Policy Spaces Gianmarco Tedeschi, Matteo Papini, Marcello Restelli
PDF OpenReview
Policy Gradients for Optimal Parallel Tempering MCMC Daniel Zhao, Natesh S. Pillai
PDF OpenReview
Polynomial Convergence of Bandit No-Regret Dynamics in Congestion Games Leello Tadesse Dadi, Ioannis Panageas, Stratis Skoulakis, Luca Viano, Volkan Cevher
PDF OpenReview
Polynomial Regression as a Task for Understanding In-Context Learning Through Finetuning and Alignment Max Wilcoxson, Morten Svendgård, Ria Doshi, Dylan Davis, Reya Vir, Anant Sahai
PDF OpenReview
Population Transformer: Learning Population-Level Representations of Intracranial Activity Geeling Chau, Christopher Wang, Sabera J Talukder, Vighnesh Subramaniam, Saraswati Soedarmadji, Yisong Yue, Boris Katz, Andrei Barbu
PDF OpenReview
Population-Level Dark Energy Constraints from Strong Gravitational Lensing Using Simulation-Based Inference Sreevani Jarugula, Brian Nord, Abhijith Gandrakota, Aleksandra Ciprijanovic
PDF OpenReview
Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers Hanseul Cho, Jaeyoung Cha, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun
PDF OpenReview
Position Paper: Dual-System Language Models via Next-Action Prediction Zhehang Du, Weijie J Su
PDF OpenReview
POST: A Framework for Privacy of Soft-Prompt Transfer Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Adam Dziedzic
PDF OpenReview
POST: A Framework for Privacy of Soft-Prompt Transfer Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Adam Dziedzic
PDF OpenReview
Power Mean Estimation in Stochastic Monte-Carlo Tree Search Tuan Quang Dam, Odalric-Ambrym Maillard, Emilie Kaufmann
PDF OpenReview
PQV-Mobile: A Combined Pruning and Quantization Toolkit to Optimize Vision Transformers for Mobile Applications Kshitij Bhardwaj
PDF OpenReview
Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models Vishruth Veerendranath, Vishwa Shah, Kshitish Ghate
PDF OpenReview
Pre-Training of Single-Cell Language Models Through Genetic Pathway Learning Xuxi Chen, Zhangyang Wang, Marinka Zitnik, Manolis Kellis, Tianlong Chen
PDF OpenReview
Predicting Dark Matter Halo Masses from Simulated Galaxy Images and Environments Austin J Larson, John F Wu, Craig Jones
PDF OpenReview
Predicting Metal-Protein Interactions Using Cofolding Methods: Status Quo Simon L. Dürr, Ursula Rothlisberger
PDF OpenReview
Predictive Uncertainties Based on Proper Scoring Rules Nikita Kotelevskii, Maxim Panov
PDF OpenReview
Preference Elicitation for Offline Reinforcement Learning Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch, Giorgia Ramponi
PDF OpenReview
Preference Elicitation for Offline Reinforcement Learning Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch, Giorgia Ramponi
PDF OpenReview
Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho
PDF OpenReview
Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho
PDF OpenReview
Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models Siyan Zhao, Daniel Mingyi Israel, Guy Van den Broeck, Aditya Grover
PDF OpenReview
Pretrained Deep Models Outperform GBDTs in Learning-to-Rank Under Label Scarcity Charlie Hou, Kiran Koshy Thekumparampil, Michael Shavlovsky, Giulia Fanti, Sujay Sanghavi
PDF OpenReview
Pretrained Hybrids with MAD Skills Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi Gnvv, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala
PDF OpenReview
Pretrained Hybrids with MAD Skills Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi Gnvv, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala
PDF OpenReview
Pretrained Hybrids with MAD Skills Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi Gnvv, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala
PDF OpenReview
PrimeGuard: Safe and Helpful LLMs Through Tuning-Free Routing Blazej Manczak, Eric Lin, Eliott Zemour, Vaikkunth Mugunthan
PDF OpenReview
Privacy Auditing of Large Language Models Ashwinee Panda, Xinyu Tang, Milad Nasr, Christopher A. Choquette-Choo, Prateek Mittal
PDF OpenReview
Privacy Auditing of Large Language Models Ashwinee Panda, Xinyu Tang, Milad Nasr, Christopher A. Choquette-Choo, Prateek Mittal
PDF OpenReview
Private Attribute Inference from Images with Vision-Language Models Batuhan Tömekçe, Mark Vero, Robin Staab, Martin Vechev
PDF OpenReview
Private Fine-Tuning of Large Language Models with Zeroth-Order Optimization Xinyu Tang, Ashwinee Panda, Milad Nasr, Saeed Mahloujifar, Prateek Mittal
PDF OpenReview
Probabilistic World Modeling with Asymmetric Distance Measure Meng Song
PDF OpenReview
Probability Tools for Sequential Random Projection Yingru Li
PDF OpenReview
Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover
PDF OpenReview
Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover
PDF OpenReview
Processing Large-Scale Graphs with G-Signatures Lukas Gruber, Bernhard Schäfl, Johannes Brandstetter, Sepp Hochreiter
PDF OpenReview
ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations Sravanti Addepalli, Priyam Dey, Venkatesh Babu Radhakrishnan
PDF OpenReview
Progress Measures for Grokking on Real-World Tasks Satvik Golechha
PDF OpenReview
Progress or Regress? Self-Improvement Reversal in Post-Training Ting Wu, Xuefeng Li, Pengfei Liu
PDF OpenReview
Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
PDF OpenReview
Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel
PDF OpenReview
Progressive-Hint Prompting Improves Reasoning in Large Language Models Chuanyang Zheng, Zhengying Liu, Enze Xie, Zhenguo Li, Yu Li
PDF OpenReview
Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones Andrey Zhmoginov, Jihwan Lee, Mark Sandler
PDF OpenReview
Projected Language Models: A Large Model Pre-Segmented into Smaller Ones David Grangier, Angelos Katharopoulos, Pierre Ablin, Awni Hannun
PDF OpenReview
Projection Killer: Peering Through High Dimensional Posterior Distribution Marco Raveri, Cyrille Doux, Shivam Pandey
PDF OpenReview
Prompt Optimization with EASE? Efficient Ordering-Aware Automated Selection of Exemplars Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low
PDF OpenReview
Prompt Optimization with Human Feedback Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low
PDF OpenReview
Prompt-Prompted Adaptive Structured Pruning for Efficient LLM Generation Harry Dong, Beidi Chen, Yuejie Chi
PDF OpenReview
Prot2Token: A Multi-Task Framework for Protein Language Processing Using Autoregressive Language Modeling Mahdi Pourmirzaei, Farzaneh Esmaili, Mohammadreza Pourmirzaei, Duolin Wang, Dong Xu
PDF OpenReview
Protein Language Models Expose Viral Mimicry and Immune Escape Dan Ofer, Michal Linial
PDF OpenReview
Protein Language Models in Directed Evolution Russell Maguire, Kotryna Bloznelyte, Fikayo Adepoju, Matthew Armean-Jones, Shafiat Dewan, Akash Gupta, Frances Patricia Jones, Preet Lalli, Anna Schooneveld, Sean Thompson, Ece Ebrahimi, Stella Fozzard, David Berman, Luca Rossoni, Will Addison, Ian Taylor
PDF OpenReview
ProtMamba: A Homology-Aware but Alignment-Free Protein State Space Model Damiano Sgarbossa, Cyril Malbranke, Anne-Florence Bitbol
PDF OpenReview
Prototype-Based Methods in Explainable AI and Emerging Opportunities in the Geosciences Anushka Narayanan, Karianne Bergen
PDF OpenReview
Provable Benefit of Cutout and CutMix for Feature Learning Junsoo Oh, Chulhee Yun
PDF OpenReview
Provable Partially Observable Reinforcement Learning with Privileged Information Yang Cai, Xiangyu Liu, Argyris Oikonomou, Kaiqing Zhang
PDF OpenReview
Provable Tempered Overfitting of Minimal Nets and Typical Nets Itamar Harel, William M. Hoza, Gal Vardi, Itay Evron, Nathan Srebro, Daniel Soudry
PDF OpenReview
Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang
PDF OpenReview
Proving That Cryptic Crossword Clue Answers Are Correct Martin Andrews, Sam Witteveen
PDF OpenReview
ProxyTune: Hyperparameter Tuning Through Iteratively Refined Proxies Agrin Hilmkil, Wenbo Gong, Nick Pawlowski, Cheng Zhang
PDF OpenReview
PutnamBench: A Multilingual Competition-Mathematics Benchmark for Formal Theorem-Proving George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri
PDF OpenReview
QGFN: Controllable Greediness with Action Values Elaine Lau, Stephen Zhewen Lu, Ling Pan, Doina Precup, Emmanuel Bengio
PDF OpenReview
Quality-Diversity for One-Shot Biological Sequence Design Jérémie Dona, Arthur Flajolet, Andrei Marginean, Antoine Cully, Thomas Pierrot
PDF OpenReview
Quantifying Aleatoric and Epistemic Uncertainty: A Credal Approach Paul Hofman, Yusuf Sale, Eyke Hüllermeier
PDF OpenReview
Quantized Representations Prevent Dimensional Collapse in Self-Predictive RL Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaeinezhadfard, Arno Solin, Joni Pajarinen
PDF OpenReview
Quantum 3D Visual Grounding: A Step Towards Quantum-Inspired AI-Visualization Adib Bazgir, Rama chandra Praneeth Madugula, Yuwen Zhang
PDF OpenReview
Quantum Circuit Synthesis with Diffusion Models Florian Fürruter, Gorka Muñoz-Gil, Hans J Briegel
PDF OpenReview
Quantum-PEFT: Ultra Parameter-Efficient Fine-Tuning Toshiaki Koike-Akino, Francesco Tonin, Yongtao Wu, Leyla Naz Candogan, Volkan Cevher
PDF OpenReview
Query Design for Crowdsourced Clustering: Effect of Cognitive Overload and Contextual Bias Yi Chen, Ramya Korlakai Vinayak
PDF OpenReview
RamanSPy: Augmenting Raman Spectroscopy Data Analysis with AI Dimitar Georgiev, Simon Vilms Pedersen, Ruoxiao Xie, Álvaro Fernández-Galiana, Molly M. Stevens, Mauricio Barahona
PDF OpenReview
RamanSPy: Augmenting Raman Spectroscopy Data Analysis with AI Dimitar Georgiev, Simon Vilms Pedersen, Ruoxiao Xie, Álvaro Fernández-Galiana, Molly M. Stevens, Mauricio Barahona
PDF OpenReview
Random Matrix Theory Analysis of Neural Network Weight Matrices Matthias Thamm, Max Staats, Bernd Rosenow
PDF OpenReview
Randomized Confidence Bounds for Stochastic Partial Monitoring Maxime Heuillet, Ola Ahmad, Audrey Durand
PDF OpenReview
Rank Minimization, Alignment and Weight Decay in Neural Networks David Yunis, Kumar Kshitij Patel, Samuel Wheeler, Pedro Henrique Pamplona Savarese, Gal Vardi, Karen Livescu, Michael Maire, Matthew Walter
PDF OpenReview
Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Rafael Esteves, Shreya Kadambi, Shubhankar Borse, Paul Whatmough, Risheek Garrepalli, Mart Van Baalen, Harris Teague, Markus Nagel
PDF OpenReview
Realtime Reinforcement Learning: Towards Rapid Asynchronous Deployment of Large Models Matthew Riemer, Gopeshh Subbaraj, Glen Berseth, Irina Rish
PDF OpenReview
REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan Daniel Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun
PDF OpenReview
REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan Daniel Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun
PDF OpenReview
Recommender System Design via Online Feedback Optimization Sanjay Chandrasekaran, Giulia De Pasquale, Giuseppe Belgioioso, Florian Dorfler
PDF OpenReview
Recurrent Natural Policy Gradient for POMDPs Semih Cayci, Atilla Eryilmaz
PDF OpenReview
Recursive Introspection: Teaching Foundation Model Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
PDF OpenReview
Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
PDF OpenReview
Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
PDF OpenReview
Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar
PDF OpenReview
Reducing Uncertainty Through Mutual Information in Structural and Systems Biology Vincent Zaballa, Elliot E Hui
PDF OpenReview
Refusal in Language Models Is Mediated by a Single Direction Andy Arditi, Oscar Balcells Obeso, Aaquib Syed, Daniel Paleka, Nina Panickssery, Wes Gurnee, Neel Nanda
PDF OpenReview
Regression-Stratified Sampling for Optimized Algorithm Selection in Time-Constrained Tabular AutoML Mehdi Bahrami, So Hasegawa, Lei Liu, Wei-Peng Chen
PDF OpenReview
Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe
PDF OpenReview
Regularized Distribution Matching Distillation for One-Step Unpaired Image-to-Image Translation Denis Rakitin, Ivan Shchekotov, Dmitry Vetrov
PDF OpenReview
Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian Neural Networks Tristan Cinquin, Robert Bamler
PDF OpenReview
Reinforcement Learning for Efficient Design and Control Co-Optimisation of Energy Systems Marine Cauz, Adrien Bolland, Christophe Ballif, Nicolas Wyrsch
PDF OpenReview
Reinforcement Learning from Bagged Reward Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama
PDF OpenReview
Reinforcement Learning from Human Text Feedback: Learning a Reward Model from Human Text Input Belen Martin Urcelay, Andreas Krause, Giorgia Ramponi
PDF OpenReview
Reinforcement Learning in the Wild with Maximum Likelihood-Based Model Transfer Hannes Eriksson, Tommy Tram, Debabrota Basu, Mina Alibeigi, Christos Dimitrakakis
PDF OpenReview
Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems Gianluigi Silvestri, Fabio Valerio Massoli, Tribhuvanesh Orekondy, Afshin Abdi, Arash Behboodi
PDF OpenReview
Reinforcement Learning with Lookahead Information Nadav Merlis
PDF OpenReview
Reinforcement Learning with Quasi-Hyperbolic Discounting S R Eshwar, Nibedita Roy, Gugan Thoppe
PDF OpenReview
Relational Composition in Neural Networks: A Survey and Call to Action Martin Wattenberg, Fernanda Viégas
PDF OpenReview
Relatively Rational: Learning Utilities and Rationalities Jointly from Pairwise Preferences Taku Yamagata, Tobias Oberkofler, Timo Kaufmann, Viktor Bengs, Eyke Hüllermeier, Raul Santos-Rodriguez
PDF OpenReview
Relaxed Equivariant Graph Neural Networks Elyssa Hofgard, Rui Wang, Robin Walters, Tess Smidt
PDF OpenReview
Relaxing Graph Transformers for Adversarial Attacks Philipp Foth, Lukas Gosch, Simon Geisler, Leo Schwinn, Stephan Günnemann
PDF OpenReview
Reliability Thresholds for the Bethe Free Energy Approximation Harald Leisenberger, Christian Knoll, Franz Pernkopf
PDF OpenReview
ReLU Characteristic Activation Analysis Wenlin Chen, Hong Ge
PDF OpenReview
ReLU MLPs Can Compute Numerical Integration: Mechanistic Interpretation of a Non-Linear Activation Chun Hei Yip, Rajashree Agrawal, Jason Gross
PDF OpenReview
Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions Luca Arnaboldi, Yatin Dandi, Florent Krzakala, Luca Pesce, Ludovic Stephan
PDF OpenReview
RepoQA: Evaluating Long Context Code Understanding Jiawei Liu, Jia Le Tian, Vijay Daita, Yuxiang Wei, Yifeng Ding, Yuhan Katherine Wang, Jun Yang, Lingming Zhang
PDF OpenReview
Representing Rule-Based Chatbots with Transformers Dan Friedman, Abhishek Panigrahi, Danqi Chen
PDF OpenReview
Resolving Discrepancies in Compute-Optimal Scaling of Language Models Tomer Porian, Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, Yair Carmon
PDF OpenReview
Resource-Constrained Neural Architecture Search on Language Models: A Case Study Andreas Paraskeva, Joao Pedro Reis, Suzan Verberne, Jan N. van Rijn
PDF OpenReview
Rethinking Invariance in In-Context Learning Lizhe Fang, Yifei Wang, Khashayar Gatmiry, Lei Fang, Yisen Wang
PDF OpenReview
Rethinking Model-Based, Policy-Based, and Value-Based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng, Han Zhong
PDF OpenReview
Rethinking Molecular Design: Integrating Latent Variable and Auto-Regressive Models for Enhanced Goal Directed Generation Arthur-Louis Heath, Amina Mollaysa, Michael Krauthammer
PDF OpenReview
Retrieval & Fine-Tuning for In-Context Tabular Models Valentin Thomas, Junwei Ma, Rasa Hosseinzadeh, Keyvan Golestan, Guangwei Yu, Maksims Volkovs, Anthony L. Caterini
PDF OpenReview
Retrieve to Explain: Evidence-Driven Predictions with Language Models Ravi Patel, Angus Brayne, Rogier Hintzen, Daniel Jaroslawicz, Georgiana Neculae, Dane S. Corneil
PDF OpenReview
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami
PDF OpenReview
Revealing the Utilized Rank of Subspaces of Learning in Neural Networks Isha Garg, Christian Koguchi, Eshan Verma, Daniel Ulbricht
PDF OpenReview
Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference Xunpeng Huang, Difan Zou, Hanze Dong, Yi Zhang, Yian Ma, Tong Zhang
PDF OpenReview
Revisiting Cascaded Ensembles for Efficient Inference Steven Kolawole, Don Dennis, Ameet Talwalkar, Virginia Smith
PDF OpenReview
Revisiting Random Walks for Learning on Graphs Jinwoo Kim, Olga Zaghen, Ayhan Suleymanzade, Youngmin Ryou, Seunghoon Hong
PDF OpenReview
Revisiting Score Function Estimators for $k$-Subset Sampling Klas Wijk, Ricardo Vinuesa Motilva, Hossein Azizpour
PDF OpenReview
Revisiting Successor Features for Inverse Reinforcement Learning Arnav Kumar Jain, Harley Wiltzer, Jesse Farebrother, Irina Rish, Glen Berseth, Sanjiban Choudhury
PDF OpenReview
Reward Centering Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton
PDF OpenReview
Reweighted Bellman Targets for Continual Reinforcement Learning Ke Sun, Jun Jin, Xi Chen, Wulong Liu, Linglong Kong
PDF OpenReview
RFamLlama: An Efficient Conditional Language Model for RNA Sequence Generation Across Diverse Structural Families Jinyuan Sun, Han Li, Yifan Deng
PDF OpenReview
RGFN: Synthesizable Molecular Generation Using GFlowNets Michał Koziarski, Andrei Rekesh, Dmytro Shevchuk, Almer M. van der Sloot, Piotr Gaiński, Yoshua Bengio, Cheng-Hao Liu, Mike Tyers, Robert A. Batey
PDF OpenReview
RIO-CPD: A Riemannian Geometric Method for Correlation-Aware Online Change Point Detection Chengyuan Deng, Zhengzhang Chen, Xujiang Zhao, Haoyu Wang, Junxiang Wang, Haifeng Chen, Jie Gao
PDF OpenReview
RISE: 3D Perception Makes Real-World Robot Imitation Simple and Effective Chenxi Wang, Hongjie Fang, Hao-Shu Fang, Cewu Lu
PDF OpenReview
Risk-Aware Bandits for Best Crop Management Dorian Baudry, Romain Gautron
PDF OpenReview
RLHF and IIA: Perverse Incentives Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy
PDF OpenReview
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, Asuman E. Ozdaglar
PDF OpenReview
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, Asuman E. Ozdaglar
PDF OpenReview
RNA-FrameFlow for De Novo 3D RNA Backbone Design Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian Rokkum Jamasb, Charles Harris, Simon V Mathis, Kieran Didi, Bryan Hooi, Pietro Lio
PDF OpenReview
RNA-FrameFlow for De Novo 3D RNA Backbone Design Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian Rokkum Jamasb, Charles Harris, Simon V Mathis, Kieran Didi, Bryan Hooi, Pietro Lio
PDF OpenReview
RNAInvBench: Benchmark for the RNA Inverse Design Problem Jack Cole, Fan Li, Liwen Wu, Ke Li
PDF OpenReview
RNR: Teaching Large Language Models to Follow Roles and Rules Kuan Wang, Alexander Bukharin, Haoming Jiang, Qingyu Yin, Zhengyang Wang, Tuo Zhao, Jingbo Shang, Chao Zhang, Bing Yin, Xian Li, Jianshu Chen, Shiyang Li
PDF OpenReview
RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model Hantao Zhou, Tianying Ji, Lukas Sommerhalder, Michael Görner, Norman Hendrich, Fuchun Sun, Jianwei Dr. Zhang, Huazhe Xu
PDF OpenReview
Robust Best-of-Both-Worlds Gap Estimators Based on Importance-Weighted Sampling Sarah Clusiau, Saeed Masoudian, Yevgeny Seldin
PDF OpenReview
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein
PDF OpenReview
Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA Shuangyi Chen, Yue Ju, Hardik Dalal, Zhongwen Zhu, Ashish J Khisti
PDF OpenReview
Robust Knowledge Unlearning via Mechanistic Localizations Phillip Huang Guo, Aaquib Syed, Abhay Sheshadri, Aidan Ewart, Gintare Karolina Dziugaite
PDF OpenReview
Robust Learning of Transfer Functions for Single-Cell Transcriptomics Depth Normalization Da Kuang, Junhyong Kim
PDF OpenReview
Robust Unlearning via Mechanistic Localizations Phillip Huang Guo, Aaquib Syed, Abhay Sheshadri, Aidan Ewart, Gintare Karolina Dziugaite
PDF OpenReview
Robustness Analysis of AI Models in Critical Energy Systems Pantelis Dogoulis, Matthieu Jimenez, Maxime Cordy, Salah Ghamizi, Yves Le Traon
PDF OpenReview
Robustness of Explainable Artificial Intelligence in Industrial Process Modelling Benedikt Kantz, Clemens Staudinger, Christoph Feilmayr, Johannes Wachlmayr, Alexander Haberl, Stefan Schuster, Franz Pernkopf
PDF OpenReview
RouteFinder: Towards Foundation Models for Vehicle Routing Problems Federico Berto, Chuanbo Hua, Nayeli Gast Zepeda, André Hottung, Niels Wouda, Leon Lan, Kevin Tierney, Jinkyoo Park
PDF OpenReview
RouterBench: A Benchmark for Multi-LLM Routing System Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay
PDF OpenReview
Rule Based Rewards for Fine-Grained LLM Safety Tong Mu, Alec Helyar, Johannes Heidecke, Joshua Achiam, Andrea Vallone, Ian D Kivlichan, Molly Lin, Alex Beutel, John Schulman, Lilian Weng
PDF OpenReview
Rule-Enhanced Graph Learning Ali Khazraee, Abdolreza Mirzaei, Majjid Farhadi, Parmis Nadaff, Kiarash Zahirnia, Mohammad Salameh, Kevin Cannons, Richard Mar, Mingyi Wu, Oliver Schulte
PDF OpenReview
SA-DQAS: Self-Attention Enhanced Differentiable Quantum Architecture Search Yize Sun, Jiarui Liu, Zixin Wu, Zifeng Ding, Yunpu Ma, Thomas Seidl, Volker Tresp
PDF OpenReview
Safe Exploration in Reproducing Kernel Hilbert Spaces Abdullah Tokmak, Kiran G. Krishnan, Thomas B. Schön, Dominik Baumann
PDF OpenReview
Safe Online Nonstochastic Control from Data Sebastian Kerz, Armin Lederer, Marion Leibold, Dirk Wollherr
PDF OpenReview
Safe Reinforcement Learning with Contrastive Risk Prediction Hanping Zhang, Yuhong Guo
PDF OpenReview
Safer Reinforcement Learning by Going Off-Policy: A Benchmark Igor Kuznetsov
PDF OpenReview
SAIL: Self-Improving Efficient Online Alignment of Large Language Models Mucong Ding, Souradip Chakraborty, Vibhu Agrawal, Zora Che, Alec Koppel, Mengdi Wang, Amrit Bedi, Furong Huang
PDF OpenReview
SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-Resolution with Latent Diffusion Models Zhaoxu Luo, Bowen Song, Liyue Shen
PDF OpenReview
SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-Resolution with Latent Diffusion Models Bowen Song, Zhaoxu Luo, Liyue Shen
PDF OpenReview
Scalable AI Safety via Doubly-Efficient Debate Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras
PDF OpenReview
Scalable Anomaly Detection in Batch Polishing Processes for Inertial Confinement Fusion Shells Shashank Galla, Akash Tiwari, Kshitij Bhardwaj, Sean Michael Hayes, Satish Bukkapatnam, Suhas Bhandarkar
PDF OpenReview
Scalable Approaches for a Theory of Many Minds Maximilian Puelma Touzel, Amin Memarian, Matthew Riemer, Andrei Mircea, Andrew Robert Williams, Elin Ahlstrand, Lucas Lehnert, Rupali Bhati, Guillaume Dumas, Irina Rish
PDF OpenReview
Scalable Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem
PDF OpenReview
Scalable Multi-Task Transfer Learning for Molecular Property Prediction Chanhui Lee, Dae-Woong Jeong, Sung Moon Ko, Sumin Lee, Hyunseung Kim, Soorin Yim, Sehui Han, Sungwoong Kim, Sungbin Lim
PDF OpenReview
Scalable Oversight by Accounting for Unreliable Feedback Shivam Singhal, Cassidy Laidlaw, Anca Dragan
PDF OpenReview
Scalable Unsupervised Alignment of Metric and Nonmetric Structures Sanketh Vedula, Valentino Maiorca, Lorenzo Basile, Francesco Locatello, Alexander Bronstein
PDF OpenReview
Scalably Solving Assistance Games Cassidy Laidlaw, Eli Bronstein, Timothy Guo, Dylan Feng, Lukas Berglund, Justin Svegliato, Stuart Russell, Anca Dragan
PDF OpenReview
ScaLES: Scalable Latent Exploration Score for Pre-Trained Generative Networks Omer Ronen, Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk, Bin Yu
PDF OpenReview
Scalify: Scale Propagation for Efficient Low-Precision LLM Training Paul Balanca, Samuel Hosegood, Carlo Luschi, Andrew W Fitzgibbon
PDF OpenReview
Scaling Automated Quantum Error Correction Discovery with Reinforcement Learning Jan Olle, Remmy Zen, Matteo Puviani, Florian Marquardt
PDF OpenReview
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Alexander Hägele, Elie Bakouch, Atli Kosson, Loubna Ben Allal, Leandro Von Werra, Martin Jaggi
PDF OpenReview
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum
PDF OpenReview
Scaling the Vocabulary of Non-Autoregressive Models for Efficient Generative Retrieval Ravisri Valluri, Akash Kumar Mohankumar, Kushal S. Dave, Amit S, Jian Jiao, Manik Varma, Gaurav Sinha
PDF OpenReview
Scaling up Diffusion and Flow-Based XGBoost Models Jesse C. Cresswell, Taewoo Kim
PDF OpenReview
Scanning Tunneling Microscopy (STM) Image Segmentation Using Unsupervised and Few-Shot Learning Nikola Kolev, Emily Hofmann, Geoff Thornton, Max Trouton, Filippo Federici, David Gao, Steven Schofield, Taylor Stock, Neil Curson
PDF OpenReview
Scavenging Hyena: Distilling Transformers into Long Convolution Models Tokiniaina Raharison Ralambomihanta, Shahrad Mohammadzadeh, Sami Nur Islam, Wassim Jabbour, Laurence Liang
PDF OpenReview
SCENE-Net V2: Interpretable Multiclass 3D Scene Understanding with Geometric Priors Diogo Mateus Lavado, Claudia Soares, Alessandra Micheletti
PDF OpenReview
Scoreformer: A Surrogate Model for Large-Scale Prediction of Docking Scores Alvaro Ciudad Serrano, Adrian Morales-Pastor, Laura Malo, Isaac Filella-Merce, Victor Guallar, Alexis Molina
PDF OpenReview
scTree: Discovering Cellular Hierarchies in the Presence of Batch Effects in scRNA-Seq Data Moritz Vandenhirtz, Florian Barkmann, Laura Manduchi, Julia E Vogt, Valentina Boeva
PDF OpenReview
scTree: Discovering Cellular Hierarchies in the Presence of Batch Effects in scRNA-Seq Data Moritz Vandenhirtz, Florian Barkmann, Laura Manduchi, Julia E Vogt, Valentina Boeva
PDF OpenReview
SE(3)-Equivariant Diffusion Graph Nets: Synthesizing Flow Fields by Denoising Invariant Latents on Graphs Mario Lino Valencia, Nils Thuerey, Tobias Pfaff
PDF OpenReview
SE(3)-Hyena Operator for Scalable Equivariant Learning Artem Moskalev, Mangal Prakash, Rui Liao, Tommaso Mansi
PDF OpenReview
SE3ET: SE(3)-Equivariant Transformer for Low-Overlap Point Cloud Registration Chien Erh Lin, Minghan Zhu, Maani Ghaffari
PDF OpenReview
Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion Yutong Hu, Yang Tan, Andi Han, Lirong Zheng, Liang Hong, Bingxin Zhou
PDF OpenReview
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound Rishit Dagli, Shivesh Prakash, Robert Wu, Houman Khosravani
PDF OpenReview
Seeded LoRA: Collaborative Fine-Tuning Through Seed Initialization of Adapters Alejandro R. Salamanca, Ahmet Üstün, Nicki Skafte Detlefsen, Tim Dettmers
PDF OpenReview
Segmentation CNNs Are Denoising Models Luis A. Zavala-Mondragón, Ruud Van Sloun, Peter H.N. de With, Fons van der Sommen
PDF OpenReview
Self-Cognition in Large Language Models: An Exploratory Study Dongping Chen, Jiawen Shi, Neil Zhenqiang Gong, Yao Wan, Pan Zhou, Lichao Sun
PDF OpenReview
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller Min Cai, Yuchen Zhang, Shichang Zhang, Fan Yin, Difan Zou, Yisong Yue, Ziniu Hu
PDF OpenReview
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller Min Cai, Yuchen Zhang, Shichang Zhang, Fan Yin, Difan Zou, Yisong Yue, Ziniu Hu
PDF OpenReview
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment Shenao Zhang, Donghan Yu, Hiteshi Sharma, Ziyi Yang, Shuohang Wang, Hany Hassan Awadalla, Zhaoran Wang
PDF OpenReview
Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu
PDF OpenReview
Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries Alonso Urbano, David W. Romero
PDF OpenReview
Self-Supervised Learning for Crystal Property Prediction via Denoising Alexander New, Nam Q Le, Michael Pekala, Christopher D Stiles
PDF OpenReview
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs Jiatong Han, Jannik Kossen, Muhammed Razzak, Lisa Schut, Shreshth A Malik, Yarin Gal
PDF OpenReview
SemioLLM: Assessing Large Language Models for Semiological Analysis in Epilepsy Research Meghal Dani, Muthu Jeyanthi Prakash, Zeynep Akata, Stefanie Liebe
PDF OpenReview
Sequential Decision Making with Expert Demonstrations Under Unobserved Heterogeneity Vahid Balazadeh, Keertana Chidambaram, Viet Nguyen, Rahul Krishnan, Vasilis Syrgkanis
PDF OpenReview
Serial Monopoly on Blockchains with Quasi-Patient Users Paolo Penna, Manvir Schneider
PDF OpenReview
Setting the Record Straight on Transformer Oversmoothing Gbetondji Jean-Sebastien Dovonon, Michael M. Bronstein, Matt Kusner
PDF OpenReview
SGD vs GD: Rank Deficiency in Linear Networks Aditya Varre, Margarita Sagitova, Nicolas Flammarion
PDF OpenReview
Shall We Team up: Exploring Spontaneous Cooperation of Competing LLM Agents Zengqing Wu, Brian I. Kwon, Shuyuan Zheng, Qianying Liu, Xu Han, Makoto Onizuka, Shaojie Tang, Run Peng, Chuan Xiao
PDF OpenReview
Sheaf Diffusion Goes Nonlinear: Enhancing GNNs with Adaptive Sheaf Laplacians Olga Zaghen, Antonio Longa, Steve Azzolin, Lev Telyatnikov, Andrea Passerini, Pietro Lio
PDF OpenReview
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao
PDF OpenReview
Should You Trust DQN? Aditya Gopalan, Gugan Thoppe
PDF OpenReview
SiBBlInGS: Similarity-Driven Building-Block Inference Using Graphs Across States Noga Mudrik, Gal Mishne, Adam Shabti Charles
PDF OpenReview
Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo, Marianne Arriola, Aaron Gokaslan, Edgar Mariano Marroquin, Alexander M Rush, Yair Schiff, Justin T Chiu, Volodymyr Kuleshov
PDF OpenReview
Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo, Marianne Arriola, Aaron Gokaslan, Edgar Mariano Marroquin, Alexander M Rush, Yair Schiff, Justin T Chiu, Volodymyr Kuleshov
PDF OpenReview
Simple Linear Attention Language Models Balance the Recall-Throughput Tradeoff Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Re
PDF OpenReview
Simple, Unified Analysis of Johnson-Lindenstrauss with Applications Yingru Li
PDF OpenReview
Single Train Multi Deploy on Topology Search Spaces Using Kshot-Hypernet Jingyue Zhuge, Christian Mayr, Anand Subramoney, David Kappel
PDF OpenReview
SINR: Equivariant Neural Vector Fields David Ruhe, Patrick Forré
PDF OpenReview
Skill-Enhanced Reinforcement Learning Acceleration from Demonstrations Hanping Zhang, Yuhong Guo
PDF OpenReview
SkillAct: Using Skill Abstractions Improves LLM Agents Anthony Zhe Liu, Jongwook Choi, Sungryull Sohn, Yao Fu, Jaekyeom Kim, Dong-Ki Kim, Xinhe Wang, Jaewon Yoo, Honglak Lee
PDF OpenReview
Slicedit: Zero-Shot Video Editing with Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli
PDF OpenReview
Slicedit: Zero-Shot Video Editing with Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli
PDF OpenReview
Slow Games D Reusche, Christopher Goes, Nicolas Della Penna
PDF OpenReview
Smart Vision-Language Reasoners Denisa Roberts, Lucas Roberts
PDF OpenReview
Smoke and Mirrors in Causal Downstream Tasks Riccardo Cadei, Lukas Lindorfer, Sylvia Cremer, Cordelia Schmid, Francesco Locatello
PDF OpenReview
SMX: Sequential Monte Carlo Planning for Expert Iteration Edan Toledo, Matthew Macfarlane, Donal John Byrne, Siddarth Singh, Paul Duckworth, Alexandre Laterre
PDF OpenReview
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency Yanxiao Zhao, Yangge Qian, Tianyi Wang, Jingyang Shan, Xiaolin Qin
PDF OpenReview
SOLMformer - Incorporating Sequence and Observation Level Metadata for Categorical Time Series Modeling Yamini Vibha Ananth, Gregory Benton, Jingxing Fang, Jerry Junyang Cheung, Xu Chu, Cong Yu
PDF OpenReview
Sorting Out Quantum Monte Carlo Jack Richter-Powell, Luca Thiede, Alan Aspuru-Guzik, David Duvenaud
PDF OpenReview
Sparse Autoencoders Match Supervised Features for Model Steering on the IOI Task Aleksandar Makelov
PDF OpenReview
Sparse Network Initialization Using Deterministic Ramanujan Graphs Arindam Biswas, Suryam Arnav Kalra, Pabitra Mitra, Biswajit Basu
PDF OpenReview
Spatio-Spectral Graph Neural Networks Simon Geisler, Arthur Kosmala, Daniel Herbst, Stephan Günnemann
PDF OpenReview
SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths Kaixuan Huang, Xudong Guo, Mengdi Wang
PDF OpenReview
Specify What? a Case-Study Using GPT-4 and Formal Methods for Specification Synthesis George Granberry, Wolfgang Ahrendt, Moa Johansson
PDF OpenReview
Spectral State Space Models Naman Agarwal, Daniel Suo, Xinyi Chen, Elad Hazan
PDF OpenReview
Spectrum-Informed Multistage Neural Network: Multiscale Function Approximator of Machine Precision Jakin Ng, Yongji Wang, Ching-Yao Lai
PDF OpenReview
Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs Swanand Kadhe, Farhan Ahmed, Dennis Wei, Nathalie Baracaldo, Inkit Padhi
PDF OpenReview
Stability Analysis of Equivariant Convolutional Representations Through the Lens of Equivariant Multi-Layered CKNs Soutrik Roy Chowdhury
PDF OpenReview
Stabilizing the Training of Consistency Models with Score Guidance Jeongjun Lee, Jonggeon Park, Jongmin Yoon, Juho Lee
PDF OpenReview
Stable Differentiable Causal Discovery Achille Nazaret, Justin Hong, Elham Azizi, David Blei
PDF OpenReview
State Space Models Are Comparable to Transformers in Estimating Functions with Dynamic Smoothness Naoki Nishikawa, Taiji Suzuki
PDF OpenReview
Steering Language Models with Game-Theoretic Solvers Ian Gemp, Roma Patel, Yoram Bachrach, Marc Lanctot, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls
PDF OpenReview
Stein Variational Newton Neural Network Ensembles Klemens Flöge, Muhammad Abdul Moeed, Vincent Fortuin
PDF OpenReview
Step-on-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping Haoyu Wang, Guozheng Ma, Ziqiao Meng, Zeyu Qin, Li Shen, Zhong Zhang, Bingzhe Wu, Liu Liu, Yatao Bian, Tingyang Xu, Xueqian Wang, Peilin Zhao
PDF OpenReview
Stitching Manifolds: Leveraging Interaction to Compose Object Representations into Scenes. Hamza Keurti, Bernhard Schölkopf, Pau Vilimelis Aceituno, Benjamin F Grewe
PDF OpenReview
Stochastic Concept Bottleneck Models Moritz Vandenhirtz, Sonia Laguna, Ričards Marcinkevičs, Julia E Vogt
PDF OpenReview
Stochastic Concept Bottleneck Models Moritz Vandenhirtz, Sonia Laguna, Ričards Marcinkevičs, Julia E Vogt
PDF OpenReview
Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search Jonathan Light, Min Cai, Weiqin Chen, Guanzhi Wang, Xiusi Chen, Wei Cheng, Yisong Yue, Ziniu Hu
PDF OpenReview
STREAM: Embodied Reasoning Through Code Generation Daniil Cherniavskii, Phillip Lippe, Andrii Zadaianchuk, Efstratios Gavves
PDF OpenReview
Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack Xiaoyue Xu, Qinyuan Ye, Xiang Ren
PDF OpenReview
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang
PDF OpenReview
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang
PDF OpenReview
Strong Copyright Protection for Language Models via Adaptive Model Fusion Javier Abad, Konstantin Donhauser, Francesco Pinto, Fanny Yang
PDF OpenReview
Strongly Isomorphic Neural Optimal Transport Across Incomparable Spaces Athina Sotiropoulou, David Alvarez-Melis
PDF OpenReview
Structural Activity Prediction Models Recover Known Binding Modes (Poster Abstract) Michael Backenköhler, Joschka Groß, Paula Linh Kramer, Verena Wolf, Andrea Volkamer
PDF OpenReview
Structure- and Function-Aware Substitution Matrices via Differentiable Graph Matching Paolo Pellizzoni, Carlos Oliver, Karsten Borgwardt
PDF OpenReview
Structure-Based Drug Design Benchmark: Do 3D Methods Really Dominate? Kangyu Zheng, Yingzhou Lu, Zaixi Zhang, Zhongwei Wan, Yao Ma, Marinka Zitnik, Tianfan Fu
PDF OpenReview
Structured Generations: Using Hierarchical Clusters to Guide Diffusion Models Jorge da Silva Gonçalves, Laura Manduchi, Moritz Vandenhirtz, Julia E Vogt
PDF OpenReview
Sum-Max Submodular Bandits Stephen Pasteris, Alberto Rumi, Fabio Vitale, Nicolò Cesa-Bianchi
PDF OpenReview
Survival of the Fittest Representation: A Case Study with Modular Addition Xiaoman Delores Ding, Zifan Carl Guo, Eric J Michaud, Ziming Liu, Max Tegmark
PDF OpenReview
Survive on Planet Pandora: Robust Cross-Domain RL Under Distinct State-Action Representations Kuan-Chen Pan, MingHong Chen, Xi Liu, Ping-Chun Hsieh
PDF OpenReview
SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors Vijay Lingam, Atula Tejaswi Neerkaje, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Eunsol Choi, Alex Dimakis, Aleksandar Bojchevski, Sujay Sanghavi
PDF OpenReview
SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors Vijay Lingam, Atula Tejaswi Neerkaje, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi
PDF OpenReview
Swallowing the Bitter Pill: Simplified Scalable Conformer Generation Yuyang Wang, Ahmed A. A. Elhag, Navdeep Jaitly, Joshua M. Susskind, Miguel Ángel Bautista
PDF OpenReview
SWUS: Active Learning with Structure Weighted Uncertainty Score Andrea Karlova, Brooks Paige
PDF OpenReview
Symbolic Autoencoding for Self-Supervised Sequence Learning Mohammad Hossein Amani, Nicolas Baldwin, Amin Mansouri, Martin Josifoski, Maxime Peyrard, Robert West
PDF OpenReview
Symbolic Regression with a Learned Concept Library Arya Grayeli, Atharva Sehgal, Omar Costilla Reyes, Miles Cranmer, Swarat Chaudhuri
PDF OpenReview
Synthetic Data-Driven Prediction of Height for Childhood Malnutrition David Berthiaume, Yuan Tang, Chau Nguyen, Siyu Gai, Emilia Mazzolenis, Weiwei Pan
PDF OpenReview
TabMDA: Tabular Manifold Data Augmentation for Any Classifier Using Transformers with In-Context Subsetting Andrei Margeloiu, Adrián Bazaga, Nikola Simidjievski, Pietro Lio, Mateja Jamnik
PDF OpenReview
Tackling Polysemanticity with Neuron Embeddings Alex Foote
PDF OpenReview
TAGMol: Target-Aware Gradient-Guided Molecule Generation Vineeth Dorna, D. Subhalingam, Keshav Kolluru, Shreshth Tuli, Mrityunjay Singh, Saurabh Singal, N M Anoop Krishnan, Sayan Ranu
PDF OpenReview
Tail Extrapolation in Target-Aware Conditional Molecule Generation Weichi Yao, Cameron Gruich, Bryan Goldsmith, Yixin Wang
PDF OpenReview
Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs Valeriia Cherepanova, James Zou
PDF OpenReview
TarDis: Achieving Robust and Structured Disentanglement of Multiple Covariates Kemal Inecik, Aleyna Kara, Antony Rose, Muzlifah Haniffa, Fabian J Theis
PDF OpenReview
Task Addition and Weight Disentanglement in Closed-Vocabulary Models Adam Hazimeh, Alessandro Favero, Pascal Frossard
PDF OpenReview
Task Addition in Multi-Task Learning by Geometrical Alignment Soorin Yim, Dae-Woong Jeong, Sung Moon Ko, Sumin Lee, Hyunseung Kim, Chanhui Lee, Sehui Han
PDF OpenReview
Task Descriptors Help Transformers Learn Linear Models In-Context Ruomin Huang, Rong Ge
PDF OpenReview
Teaching Dark Matter Simulations to Speak the Halo Language Shivam Pandey, Francois Lanusse, Chirag Modi, Benjamin Dan Wandelt
PDF OpenReview
Teaching Large Language Models to Reason with Reinforcement Learning Alexander Havrilla, Yuqing Du, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Eric Hambro, Sainbayar Sukhbaatar, Roberta Raileanu
PDF OpenReview
Teaching Transformers Causal Reasoning Through Axiomatic Training Aniket Vashishtha, Abhinav Kumar, Abbavaram Gowtham Reddy, Vineeth N. Balasubramanian, Amit Sharma
PDF OpenReview
Technical Report for ICML 2024 Automated Math Reasoning Challenge: Solving Optimization Problems with Open Source Large Language Model Duc M. Nguyen, Sungahn Ko
PDF OpenReview
Temporal Graph Rewiring with Expander Graphs Katarina Petrović, Shenyang Huang, Farimah Poursafaei, Petar Veličković
PDF OpenReview
Test-Time Adaptation with State-Space Models Mona Schirmer, Dan Zhang, Eric Nalisnick
PDF OpenReview
Test-Time Prototype Evolution for Generalizable Vision-Language Models Ce Zhang, Simon Stepputtis, Katia P. Sycara, Yaqi Xie
PDF OpenReview
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning Simon Austin Lee, Kyoka Ono
PDF OpenReview
The Butterfly Effect: Tiny Perturbations Cause Neural Network Training to Diverge Gül Sena Altıntaş, Devin Kwok, David Rolnick
PDF OpenReview
The Concept Percolation Hypothesis: Analyzing the Emergence of Capabilities in Neural Networks Trained on Formal Grammars Ekdeep Singh Lubana, Kyogo Kawaguchi, Robert P. Dick, Hidenori Tanaka
PDF OpenReview
The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas
PDF OpenReview
The Convolution-Closed Hurdle Motif with an Application to Tensor Decomposition John Hood, Aaron Schein
PDF OpenReview
The Convolution-Closed Hurdle Motif with an Application to Tensor Decomposition John Hood, Aaron Schein
PDF OpenReview
The Effect of Data Corruption on Multimodal Long Form Responses Daniel Z Kaplan, Alexis Roger, Mohamed Osman, Irina Rish
PDF OpenReview
The Efficacy of Pre-Training in Chemical Graph Out-of-Distribution Generalization Qi Liu, Rosa H. M. Chan, Rose Yu
PDF OpenReview
The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts Wakana Haijima, Kou Nakakubo, Masahiro Suzuki, Yutaka Matsuo
PDF OpenReview
The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof Derek Lim, Theo Putterman, Robin Walters, Haggai Maron, Stefanie Jegelka
PDF OpenReview
The GAN Is Dead; Long Live the GAN! a Modern Baseline GAN Nick Huang, Aaron Gokaslan, Volodymyr Kuleshov, James Tompkin
PDF OpenReview
The Geometry of Categorical and Hierarchical Concepts in Large Language Models Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch
PDF OpenReview
The Geometry of Categorical and Hierarchical Concepts in Large Language Models Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch
PDF OpenReview
The Geometry of Diffusion Models: Tubular Neighbourhoods and Singularities Kotaro Sakamoto, Ryosuke Sakamoto, Masato Tanabe, Masatomo Akagawa, Yusuke Hayashi, Manato Yaguchi, Masahiro Suzuki, Yutaka Matsuo
PDF OpenReview
The Hidden Pitfalls of the Cosine Similarity Loss Andrew Draganov, Sharvaree Vadgama, Erik J Bekkers
PDF OpenReview
The Implicit Bias of Adam on Separable Data Chenyang Zhang, Difan Zou, Yuan Cao
PDF OpenReview
The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage Yuda Song, Gokul Swamy, Aarti Singh, Drew Bagnell, Wen Sun
PDF OpenReview
The Mamba in the Llama: Distilling and Accelerating Hybrid Models Junxiong Wang, Daniele Paliotta, Avner May, Alexander M Rush, Tri Dao
PDF OpenReview
The Minimax Regret of Sequential Probability Assignment, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood Ziyi Liu, Idan Attias, Daniel M. Roy
PDF OpenReview
The Missing Curve Detectors of InceptionV1: Applying Sparse Autoencoders to InceptionV1 Early Vision Liv Gorton
PDF OpenReview
The NGT200 Dataset - Geometric Multi-View Isolated Sign Recognition Oline Ranum, David Wessels, Gomèr Otterspeer, Erik J Bekkers, Floris Roelofsen, Jari I. Andersen
PDF OpenReview
The Optimization Landscape of Spectral Neural Network Chenghui Li, Rishi Sonthalia, Nicolas Garcia Trillos
PDF OpenReview
The Price of Freedom: Exploring Tradeoffs Between Expressivity and Computational Efficiency in Equivariant Tensor Products YuQing Xie, Ameya Daigavane, Mit Kotak, Tess Smidt
PDF OpenReview
The Pupil Becomes the Master: Eye-Tracking Feedback for Tuning LLMs Samuel Kiegeland, David Robert Reich, Ryan Cotterell, Lena Ann Jäger, Ethan Wilcox
PDF OpenReview
The Remarkable Robustness of LLMs: Stages of Inference? Vedang Lad, Wes Gurnee, Max Tegmark
PDF OpenReview
The Scaling Law in Astronomical Time Series Data Jia-Shu Pan, Yuan-Sen Ting, Jie Yu, Yang Huang, Ji-Feng Liu
PDF OpenReview
The Value of Reward Lookahead in Reinforcement Learning Nadav Merlis, Dorian Baudry, Vianney Perchet
PDF OpenReview
Theoretical Analyses of Hyperparameter Selection in Graph-Based Semi-Supervised Learning Ally Yalei Du, Eric Huang, Dravyansh Sharma
PDF OpenReview
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding Benjamin Bergner, Andrii Skliar, Amelie Royer, Tijmen Blankevoort, Yuki M Asano, Babak Ehteshami Bejnordi
PDF OpenReview
Thinking Out-of-the-Box: A Comparative Investigation of Human and LLMs in Creative Problem-Solving Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Yejin Choi, Thomas L. Griffiths, Faeze Brahman
PDF OpenReview
Three Mechanisms of Feature Learning in an Analytically Solvable Model Yizhou Xu, Liu Ziyin
PDF OpenReview
Tight Bounds for Online Convex Optimization with Adversarial Constraints Abhishek Sinha, Rahul Vaze
PDF OpenReview
TimeDiT: General-Purpose Diffusion Transformers for Time Series Foundation Model Defu Cao, Wen Ye, Yan Liu
PDF OpenReview
TinyAgent: Quantization-Aware Model Compression and Adaptation for On-Device LLM Agent Deployment Jason Kong, Lanxiang Hu, Flavio Ponzina, Tajana Rosing
PDF OpenReview
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Zhengqing Yuan, Zhaoxu Li, Weiran Huang, Yanfang Ye, Lichao Sun
PDF OpenReview
To Compete or to Collude: Builder Incentives in MEV-Boost Auctions Fei Wu, Thomas Thiery, Stefanos Leonardos, Carmine Ventre
PDF OpenReview
Tokenized SAEs: Disentangling SAE Reconstructions Thomas Dooms, Daniel Wilhelm
PDF OpenReview
Topological and Dynamical Representations for Radio Frequency Signal Classification Tegan Emerson, Timothy Doster, Colin C Olson, Audun Myers
PDF OpenReview
Topological Neural Networks Go Persistent, Equivariant and Continuous Yogesh Verma, Amauri H Souza, Vikas Garg
PDF OpenReview
Topology-Informed Graph Transformer Yun Young Choi, Sun Woo Park, Minho Lee, Youngho Woo
PDF OpenReview
Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models Weihang Xu, Maryam Fazel, Simon Shaolei Du
PDF OpenReview
Towards Adaptive Attacks on Constrained Tabular Machine Learning Thibault Simonetto, Salah Ghamizi, Maxime Cordy
PDF OpenReview
Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques Rishika Bhagwatkar, Shravan Nayak, Reza Bayat, Alexis Roger, Daniel Z Kaplan, Pouya Bashivan, Irina Rish
PDF OpenReview
Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques Rishika Bhagwatkar, Shravan Nayak, Reza Bayat, Alexis Roger, Daniel Z Kaplan, Pouya Bashivan, Irina Rish
PDF OpenReview
Towards Aligning Language Models with Textual Feedback Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan
PDF OpenReview
Towards Bridging Classical and Neural Computation Through a Read-Eval-Print Loop David W. Zhang, Michaël Defferrard, Corrado Rainone, Roland Memisevic
PDF OpenReview
Towards Detailed and Interpretable Hybrid Modeling of Continental-Scale Bird Migration Fiona Lippert, Bart Kranstauber, Patrick Forré, Emiel van Loon
PDF OpenReview
Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information Fedor Sergeev, Paola Malsot, Gunnar Ratsch, Vincent Fortuin
PDF OpenReview
Towards Efficient and Scalable Training of Differentially Private Deep Learning Sebastian Rodriguez Beltran, Marlon Tobaben, Niki Andreas Loppi, Antti Honkela
PDF OpenReview
Towards Efficient Large-Scale Language-3D Representation Learning Shentong Mo, Xiaogang Xu, Tongzhou Wang, Antonio Torralba, Shuang Li
PDF OpenReview
Towards Empowerment Gain Through Causal Structure Learning in Model-Based RL Hongye Cao, Fan Feng, Meng Fang, Shaokang Dong, Jing Huo, Yang Gao
PDF OpenReview
Towards Enforcing Hard Physics Constraints in Operator Learning Frameworks Valentin Duruisseaux, Miguel Liu-Schiaffini, Julius Berner, Anima Anandkumar
PDF OpenReview
Towards General Geometries for Embedding Knowledge Graphs Samuel G. Fadel, Tino Paulsen, Sebastian Mair
PDF OpenReview
Towards Generalizable Particle Picking in Cryo-EM Images by Leveraging Masked AutoEncoder Andreas Zamanos, Panagiotis Koromilas, Giorgos Bouritsas, Panagiotis L. Kastritis, Yannis Panagakis
PDF OpenReview
Towards Human-AI Collaboration in Healthcare: Guided Deferral Systems with Large Language Models Joshua Strong, Qianhui Men, Alison Noble
PDF OpenReview
Towards Linking Graph Topology to Model Performance for Biomedical Knowledge Graph Completion Alberto Cattaneo, Thomas Martynec, Stephen Bonner, Carlo Luschi, Daniel Justus
PDF OpenReview
Towards Reliable Uncertainty Estimates for Drug Discovery: A Large-Scale Temporal Study of Probability Calibration Hannah Rosa Friesacher, Emma Svensson, Adam Arany, Lewis Mervin, Ola Engkvist
PDF OpenReview
Towards Safe Large Language Models for Medicine Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju
PDF OpenReview
Towards Safe Large Language Models for Medicine Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju
PDF OpenReview
Towards Safe Large Language Models for Medicine Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju
PDF OpenReview
Towards Smaller Language Models via Layer Looping Sabri Eyuboglu, Dylan Zinsley, Jon Saad-Falcon, Simran Arora, Atri Rudra, James Zou, Christopher Re
PDF OpenReview
Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning Andreas Schlaginhaufen, Maryam Kamgarpour
PDF OpenReview
Towards Zero-Shot Generalization in Offline Reinforcement Learning Zhiyong Wang, Chen Yang, John C.S. Lui, Dongruo Zhou
PDF OpenReview
Trace Is the New AutoDiff — Unlocking Efficient Optimization of Computational Workflows Ching-An Cheng, Allen Nie, Adith Swaminathan
PDF OpenReview
TracrBench: Generating Interpretability Testbeds with Large Language Models Hannes Thurnherr, Jérémy Scheurer
PDF OpenReview
Train Your Cake and Eat It Too! Repurposing Collaborative Training to Tailor LLMs to Private Data Without Sharing Boris Radovič, Mohammed Aljahdali, Marco Canini, Veljko Pejović, Zuhair Khayyat
PDF OpenReview
Training Compute-Optimal Protein Language Models Xingyi Cheng, Bo Chen, Pan Li, Jing Gong, Jie Tang, Le Song
PDF OpenReview
Training Compute-Optimal Protein Language Models Xingyi Cheng, Bo Chen, Pan Li, Jing Gong, Jie Tang, Le Song
PDF OpenReview
Training Energy-Efficient Large Language Models Leveraging Equilibrium Driven Bio-Plausible Neural Dynamics Malyaban Bal, Abhronil Sengupta
PDF OpenReview
Training-Free Acceleration of ViTs with Delayed Spatial Merging Jung Hwan Heo, Seyedarmin Azizi, Arash Fayyazi, Massoud Pedram
PDF OpenReview
Training-Free Design of Augmentations with Data-Centric Principles Jieke Wu, Wei Huang, Mingyuan Bai, Xiaoling Hu, Yi Duan, Wuyang Chen
PDF OpenReview
Transcoders Find Interpretable LLM Feature Circuits Jacob Dunefsky, Philippe Chlenski, Neel Nanda
PDF OpenReview
Transductive Active Learning with Application to Safe Bayesian Optimization Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause
PDF OpenReview
Transfer Learning in Multi-Fidelity Surrogate Modeling: A Wind Farm Case Dichang Zhang, Zexia Zhang, Christian Santoni, Ali Khosronejad, Dimitris Samaras
PDF OpenReview
Transferability for Graph Convolutional Networks Christian Koke, Abhishek Saroha, Yuesong Shen, Marvin Eisenberger, Michael M. Bronstein, Daniel Cremers
PDF OpenReview
Transferable Reinforcement Learning via Generalized Occupancy Models Chuning Zhu, Xinqi Wang, Tyler Han, Simon Shaolei Du, Abhishek Gupta
PDF OpenReview
Transferable Reinforcement Learning via Generalized Occupancy Models Chuning Zhu, Xinqi Wang, Tyler Han, Simon Shaolei Du, Abhishek Gupta
PDF OpenReview
Transformer Conformal Prediction for Time Series Junghwan Lee, Chen Xu, Yao Xie
PDF OpenReview
Transformer Designs for In-Context Learning in Foundation Models for Time Series Forecasting with Covariates Afrin Dange, Vaibhav Raj, Praneeth Netrapalli, Sunita Sarawagi
PDF OpenReview
Transformer Efficiently Learns Low-Dimensional Target Functions In-Context Yujin Song, Denny Wu, Kazusato Oko, Taiji Suzuki
PDF OpenReview
Transformer Neural Autoregressive Flows Massimiliano Patacchiola, Aliaksandra Shysheya, Katja Hofmann, Richard E. Turner
PDF OpenReview
Transformers Are Minimax Optimal Nonparametric In-Context Learners Juno Kim, Tai Nakamaki, Taiji Suzuki
PDF OpenReview
Transformers Are Minimax Optimal Nonparametric In-Context Learners Juno Kim, Tai Nakamaki, Taiji Suzuki
PDF OpenReview
Transformers as Stochastic Optimizers Ryuichiro Hataya, Masaaki Imaizumi
PDF OpenReview
Transformers Can Do Arithmetic with the Right Embeddings Sean Michael McLeish, Arpit Bansal, Alex Stein, Neel Jain, John Kirchenbauer, Brian R. Bartoldson, Bhavya Kailkhura, Abhinav Bhatele, Jonas Geiping, Avi Schwarzschild, Tom Goldstein
PDF OpenReview
Transformers Can Perform Distributionally-Robust Optimisation Through In-Context Learning Taeyoung Kim, Hongseok Yang
PDF OpenReview
Transformers Learn Temporal Difference Methods for In-Context Reinforcement Learning Jiuqi Wang, Ethan H Blaser, Hadi Daneshmand, Shangtong Zhang
PDF OpenReview
Transformers Need Glasses! Information Over-Squashing in Language Tasks Federico Barbero, Andrea Banino, Steven Kapturowski, Dharshan Kumaran, João Guilherme Madeira Araújo, Alex Vitvitskyi, Razvan Pascanu, Petar Veličković
PDF OpenReview
Transformers on Markov Data: Constant Depth Suffices Nived Rajaraman, Marco Bondaschi, Ashok Vardhan Makkuva, Kannan Ramchandran, Michael Gastpar
PDF OpenReview
Transformers with Stochastic Competition for Tabular Data Modelling Andreas Voskou, Charalambos Christoforou, Sotirios Chatzis
PDF OpenReview
Transforming a Non-Differentiable Rasterizer into a Differentiable One with Stochastic Gradient Estimation Thomas Deliot, Eric Heitz, Laurent Belcour
PDF OpenReview
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum S Anderson, Yaron Singer, Amin Karbasi
PDF OpenReview
TriageAgent: Towards Better Multi-Agents Collaborations for Large Language Model-Based Clinical Triage Meng Lu, Ho Brandon, Ren Dennis, Xuan Wang
PDF OpenReview
TriLM vs FloatLM: Ternary LLMs Are More Performant than Quantized FP16 LLMs Ayush Kaushal, Tejas Vaidhya, Tejas Pandey, Aaryan Bhagat, Irina Rish
PDF OpenReview
Truly No-Regret Learning in Constrained MDPs Adrian Müller, Pragnya Alatur, Volkan Cevher, Giorgia Ramponi, Niao He
PDF OpenReview
TrustAgent: Towards Safe and Trustworthy LLM-Based Agents Through Agent Constitution Wenyue Hua, Xianjun Yang, Mingyu Jin, Zelong Li, Wei Cheng, Ruixiang Tang, Yongfeng Zhang
PDF OpenReview
Truthful Aggregation of LLMs\\ with an Application to Online Advertising Ermis Soumalias, Michael Curry, Sven Seuken
PDF OpenReview
Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang, Mingyi Hong, Fan Wang, Tsung-Hui Chang
PDF OpenReview
Two-Level Test-Time Adaptation in Multimodal Learning Jixiang Lei, Franz Pernkopf
PDF OpenReview
U-μP: The Unit-Scaled Maximal Update Parametrization Charlie Blake, Constantin Eichenberg, Josef Dean, Lukas Balles, Luke Yuri Prince, Björn Deiseroth, Andres Felipe Cruz-Salinas, Carlo Luschi, Samuel Weinbach, Douglas Orr
PDF OpenReview
U-μP: The Unit-Scaled Maximal Update Parametrization Charlie Blake, Constantin Eichenberg, Josef Dean, Lukas Balles, Luke Yuri Prince, Björn Deiseroth, Andres Felipe Cruz-Salinas, Carlo Luschi, Samuel Weinbach, Douglas Orr
PDF OpenReview
UHCone: Universal Hyperbolic Cone for Implicit Hierarchical Learning Menglin Yang, Jiahong Liu, Irwin King, Rex Ying
PDF OpenReview
Unavoidable Learning Constraints Alter the Foundations of Direct Preference Optimization David Wipf
PDF OpenReview
Uncertainty-Aware Preference Alignment in Reinforcement Learning from Human Feedback Sheng Xu, Bo Yue, Hongyuan Zha, Guiliang Liu
PDF OpenReview
Uncertainty-Aware Surrogate Models for Airfoil Flow Simulations with Denoising Diffusion Probabilistic Models Qiang Liu, Nils Thuerey
PDF OpenReview
Uncovering a Culture of AI Grassroots Experimentation by Boston City Employees: Safety Risks and Mitigation Jude Ha, Audrey Xing-Yun Chang
PDF OpenReview
Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete
PDF OpenReview
Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete
PDF OpenReview
Understanding Adversarially Robust Generalization via Weight-Curvature Index Yuelin Xu, Xiao Zhang
PDF OpenReview
Understanding and Minimising Outlier Features in Neural Network Training Bobby He, Lorenzo Noci, Daniele Paliotta, Imanol Schlag, Thomas Hofmann
PDF OpenReview
Understanding and Minimising Outlier Features in Neural Network Training Bobby He, Lorenzo Noci, Daniele Paliotta, Imanol Schlag, Thomas Hofmann
PDF OpenReview
Understanding and Mitigating Tokenization Bias in Language Models Buu Phan, Marton Havasi, Matthew J. Muckley, Karen Ullrich
PDF OpenReview
Understanding Counting in Small Transformers: The Interplay Between Attention and Feed-Forward Layers Freya Behrens, Luca Biggio, Lenka Zdeborova
PDF OpenReview
Understanding Hallucinations in Diffusion Models Through Mode Interpolation Sumukh K Aithal, Pratyush Maini, Zachary Chase Lipton
PDF OpenReview
Understanding Inhibition Through Maximally Tense Images Christopher J Hamblin, Srijani Saha, Talia Konkle, George A. Alvarez
PDF OpenReview
Understanding Nonlinear Implicit Bias via Region Counts in Input Space Jingwei Li, Jing Xu, Zifan Wang, Huishuai Zhang, Jingzhao Zhang
PDF OpenReview
Understanding the Cognitive Complexity in Language Elicited by Product Images Yan-Ying Chen, Shabnam Hakimi, Monica P Van, Francine Chen, Matthew K Hong, Matthew Klenk, Charlene C. Wu
PDF OpenReview
Understanding the Role of Equivariance in Self-Supervised Learning Yifei Wang, Kaiwen Hu, Sharut Gupta, Ziyu Ye, Yisen Wang, Stefanie Jegelka
PDF OpenReview
Understanding the Role of Functional Diversity in Weight-Ensembling with Ingredient Selection and Multidimensional Scaling Alex Rojas, David Alvarez-Melis
PDF OpenReview
Unfamiliar Finetuning Examples Control How Language Models Hallucinate Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine
PDF OpenReview
Unfamiliar Finetuning Examples Control How Language Models Hallucinate Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine
PDF OpenReview
Unfamiliar Finetuning Examples Control How Language Models Hallucinate Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine
PDF OpenReview
Unfolding Time: Generative Modeling for Turbulent Flows in 4D Abdullah Saydemir, Marten Lienen, Stephan Günnemann
PDF OpenReview
Unified Taxonomy in AI Safety: Watermarks, Adversarial Defenses, and Transferable Attacks Grzegorz Gluch, Sai Ganesh Nagarajan, Berkant Turan
PDF OpenReview
Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning Junyan Liu, Yunfan Li, Ruosong Wang, Lin Yang
PDF OpenReview
Universal Self-Consistency for Large Language Models Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou
PDF OpenReview
Unlocking the Global Synergies in Low-Rank Adapters Zixi Zhang, Cheng Zhang, Xitong Gao, Robert D. Mullins, George Anthony Constantinides, Yiren Zhao
PDF OpenReview
Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models Sanae Lotfi, Yilun Kuang, Marc Anton Finzi, Brandon Amos, Micah Goldblum, Andrew Gordon Wilson
PDF OpenReview
Unmixing Noise from Hawkes Process to Model Learned Physiological Events Guillaume Staerman, Virginie Loison, Thomas Moreau
PDF OpenReview
Unsupervised Feature Extraction from a Foundation Model Zoo for Cell Similarity Search in Oncological Microscopy Across Devices Gabriel Kalweit, Anusha Klett, Mehdi Naouar, Jens Rahnfeld, Yannick Vogt, Diana Laura Infante Ramirez, Rebecca Berger, Jesus Duque Afonso, Tanja Nicole Hartmann, Marie Follo, Michael Luebbert, Roland Mertelsmann, Evelyn Ullrich, Joschka Boedecker, Maria Kalweit
PDF OpenReview
Unsupervised Ground Metric Learning with Tree Wasserstein Distance Kira Michaela Düsterwald, Makoto Yamada
PDF OpenReview
Unveiling CLIP Dynamics: Linear Mode Connectivity and Generalization Alireza Abdollahpourrostam, Amartya Sanyal, Seyed-Mohsen Moosavi-Dezfooli
PDF OpenReview
Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang
PDF OpenReview
Upper Error Bounds for Score-Based Inverse Problem Solving in Imaging Irina Dobrianski, Dominik Narnhofer, Thomas Pock
PDF OpenReview
UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation Junhong Shen, Tanya Marwah, Ameet Talwalkar
PDF OpenReview
USCILab3D: A Large-Scale, Long-Term, Semantically Annotated Outdoor Dataset Kiran Lekkala, Henghui Bao, Peixu Cai, Wei Zer Lim, Chen Liu, Laurent Itti
PDF OpenReview
Using Degeneracy in the Loss Landscape for Mechanistic Interpretability Lucius Bushnaq, Jake Mendel, Stefan Heimersheim, Dan Braun, Nicholas Goldowsky-Dill, Kaarel Hänni, Cindy Wu, Marius Hobbhahn
PDF OpenReview
Using Gradients to Check Sensitivity of MCMC-Based Analyses to Removing Data Tin D. Nguyen, Ryan James Giordano, Rachael Meager, Tamara Broderick
PDF OpenReview
Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations Zilin Ma, Susannah Cheng Su, Nathan Zhao, Linn Bieske, Blake Bullwinkel, Yanyi Zhang, Jinglun Gao, Gekai Liao, Siyao Li, Ziqing Luo, Boxiang Wang, Zihan Wen, Yanrui Yang, Claude Bruderlein, Weiwei Pan
PDF OpenReview
VACoDe: Visual Augmented Contrastive Decoding Sihyeon Kim, Boryeong Cho, Sangmin Bae, Sumyeong Ahn, Se-Young Yun
PDF OpenReview
Variable Star Light Curves in Koopman Space Mario Pasquato, Gaia Carenini, Nicolas Mekhaël, Vittorio F. Braga, Piero Trevisan, Giuseppe Bono, Yashar Hezaveh
PDF OpenReview
Variance Reduction of Diffusion Model's Gradients with Taylor Approximation-Based Control Variate Paul Jeha, Will Sussman Grathwohl, Michael Riis Andersen, Carl Henrik Ek, Jes Frellsen
PDF OpenReview
Variance-Dependent Regret Bounds for Nonstationary Linear Bandits Zhiyong Wang, Jize Xie, Yi Chen, John C.S. Lui, Dongruo Zhou
PDF OpenReview
Variational and Explanatory Neural Networks for Encoding Cancer Profiles and Predicting Drug Responses Tianshu Feng, Rohan Gnanaolivu, Abolfazl Safikhani, Yuanhang Liu, Jun Jiang, Nicholas Chia, Alexander Partin, Priyanka Vasanthakumari, Yitan Zhu, Chen Wang
PDF OpenReview
Variational Inference Failures Under Model Symmetries: Permutation Invariant Posteriors for Bayesian Neural Networks Yoav Gelberg, Tycho F. A. van der Ouderaa, Mark van der Wilk, Yarin Gal
PDF OpenReview
Variational Inference with Censored Gaussian Process Regressors Andrea Karlova, Rishabh Kabra, Daniel Augusto de Souza, Brooks Paige
PDF OpenReview
Variational Stochastic Gradient Descent for Deep Neural Networks Haotian Chen, Anna Kuzina, Babak Esmaeili, Jakub M. Tomczak
PDF OpenReview
Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao, Robert Bamler, Bernhard Schölkopf, Weiyang Liu
PDF OpenReview
Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao, Robert Bamler, Bernhard Schölkopf, Weiyang Liu
PDF OpenReview
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency Vernon Toh Yan Han, Ratish Puduppully, Nancy F. Chen
PDF OpenReview
VFA: Vision Frequency Analysis of Foundation Models and Human Mohammad Javad Darvishi Bayazi, Md Rifat Arefin, Jocelyn Faubert, Irina Rish
PDF OpenReview
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-Horizon Manipulation Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh, Han-Yuan Hsu, Yi-Ting Chen, Winston H. Hsu
PDF OpenReview
Vid3D: Synthesis of Dynamic 3D Scenes Using 2D Video Diffusion Rishab Parthasarathy, Zachary Ankner, Aaron Gokaslan
PDF OpenReview
Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-Based LLMs Jinmin Li, Kuofeng Gao, Yang Bai, Jingyun Zhang, Shu-Tao Xia
PDF OpenReview
Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen, Oier Mees, Aviral Kumar, Sergey Levine
PDF OpenReview
Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen, Oier Mees, Aviral Kumar, Sergey Levine
PDF OpenReview
Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen, Oier Mees, Aviral Kumar, Sergey Levine
PDF OpenReview
Vision-LSTM: xLSTM as Generic Vision Backbone Benedikt Alkin, Maximilian Beck, Korbinian Pöppel, Sepp Hochreiter, Johannes Brandstetter
PDF OpenReview
Visualizing Neural Network Imagination Nevan Wichers, Victor Tao, Riccardo Volpato, Fazl Barez
PDF OpenReview
vMF-Exp: Von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Théo Bontempelli, Thomas Bouabça, Tristan Cazenave
PDF OpenReview
Von Mises Quasi-Processes for Bayesian Circular Regression Yarden Cohen, Alexandre Khae Wu Navarro, Jes Frellsen, Richard E. Turner, Raziel Riemer, Ari Pakman
PDF OpenReview
Wasserstein Modality Alignment Makes Your Multimodal Transformer More Robust Zhuo Zhi, Ziquan Liu, Qiangqiang Wu, Miguel R. D. Rodrigues
PDF OpenReview
Waterfall: Framework for Robust and Scalable Text Watermarking Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low
PDF OpenReview
Weak-to-Strong Extrapolation Expedites Alignment Chujie Zheng, Ziqi Wang, Heng Ji, Minlie Huang, Nanyun Peng
PDF OpenReview
Weak-to-Strong Jailbreaking on Large Language Models Xuandong Zhao, Xianjun Yang, Tianyu Pang, Chao Du, Lei Li, Yu-Xiang Wang, William Yang Wang
PDF OpenReview
WebCanvas: Benchmarking Web Agents in Online Environments Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu
PDF OpenReview
WebCanvas: Benchmarking Web Agents in Online Environments Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu
PDF OpenReview
Weight-Based Decomposition: A Case for Bilinear MLPs Michael T Pearce, Thomas Dooms, Alice Rigg
PDF OpenReview
What Can VLMs Do for Zero-Shot Embodied Task Planning? Xian Fu, Min Zhang, Jianye Hao, Peilong Han, Hao Zhang, Lei Shi, Hongyao Tang
PDF OpenReview
What Can VLMs Do for Zero-Shot Embodied Task Planning? Xian Fu, Min Zhang, Jianye Hao, Peilong Han, Hao Zhang, Lei Shi, Hongyao Tang
PDF OpenReview
What Makes a Machine Learning Task a Good Candidate for an Equivariant Network? Scott Mahan, Davis Brown, Timothy Doster, Henry Kvinge
PDF OpenReview
What Makes and Breaks Safety Fine-Tuning? a Mechanistic Study Samyak Jain, Ekdeep Singh Lubana, Kemal Oksuz, Tom Joy, Philip Torr, Amartya Sanyal, Puneet K. Dokania
PDF OpenReview
When Are Bias-Free ReLU Networks like Linear Networks? Yedi Zhang, Andrew M Saxe, Peter E. Latham
PDF OpenReview
When Do Language Models Need to Be Large? Zhixun Chen, Yali Du, David Henry Mguni
PDF OpenReview
When Is Mean-Field Reinforcement Learning Tractable and Relevant? Batuhan Yardim, Artur Goldman, Niao He
PDF OpenReview
When to Sense and Control? a Time-Adaptive Approach for Continuous-Time RL Lenart Treven, Bhavya Sukhija, Yarden As, Florian Dorfler, Andreas Krause
PDF OpenReview
Where Do Large Learning Rates Lead Us? a Feature Learning Perspective Ildus Sadrtdinov, Maxim Kodryan, Eduard Pokonechny, Ekaterina Lobacheva, Dmitry Vetrov
PDF OpenReview
Why Do Recurrent Neural Networks Suddenly Learn? Bifurcation Mechanisms in Neuro-Inspired Short-Term Memory Tasks Udith Haputhanthri, Liam Storan, Yiqi Jiang, Adam Shai, Hakki Orhun Akengin, Mark Schnitzer, Fatih Dinc, Hidenori Tanaka
PDF OpenReview
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo
PDF OpenReview
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo
PDF OpenReview
Why Pruning and Conditional Computation Work: A High-Dimensional Perspective Erdem Koyuncu
PDF OpenReview
Why Transformers Need Adam: A Hessian Perspective Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo
PDF OpenReview
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Nouha Dziri, Yejin Choi
PDF OpenReview
Wind Farm Control with Cooperative Multi-Agent Reinforcement Learning Claire Bizon Monroc, Ana Busic, Jiamin Zhu, Donatien Dubuc
PDF OpenReview
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Artem Sergeevich Agarkov, Viacheslav Sinii, Sergey Kolesnikov
PDF OpenReview
xLSTM: Extended Long Short-Term Memory Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael K Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter
PDF OpenReview
xLSTM: Extended Long Short-Term Memory Korbinian Pöppel, Maximilian Beck, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael K Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter
PDF OpenReview
xMINT: A Multimodal Integration Transformer for Xenium Gene Imputation Xiaohui Jiang, Yuxia Xie, Jichun Xie
PDF OpenReview
You Shall Pass: Dealing with the Zero-Gradient Problem in Predict and Optimize for Convex Optimization Grigorii Veviurko, Wendelin Boehmer, Mathijs de Weerdt
PDF OpenReview
Zero-Shot Generalization of GNNs over Distinct Attribute Domains Yangyi Shen, Beatrice Bevilacqua, Joshua Robinson, Charilaos Kanatsoulis, Jure Leskovec, Bruno Ribeiro
PDF OpenReview
Zero-Shot Generalization of GNNs over Distinct Attribute Domains Yangyi Shen, Beatrice Bevilacqua, Joshua Robinson, Charilaos Kanatsoulis, Jure Leskovec, Bruno Ribeiro
PDF OpenReview
Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion Hila Manor, Tomer Michaeli
PDF OpenReview
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity Wentao Guo, Jikai Long, Yimeng Zeng, Zirui Liu, Xinyu Yang, Yide Ran, Jacob R. Gardner, Osbert Bastani, Christopher De Sa, Xiaodong Yu, Beidi Chen, Zhaozhuo Xu
PDF OpenReview
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity Wentao Guo, Jikai Long, Yimeng Zeng, Zirui Liu, Xinyu Yang, Yide Ran, Jacob R. Gardner, Osbert Bastani, Christopher De Sa, Xiaodong Yu, Beidi Chen, Zhaozhuo Xu
PDF OpenReview
ZigMa: A DiT-Style Zigzag Mamba Diffusion Model Vincent Tao Hu, Stefan Andreas Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes Schusterbauer, Björn Ommer
PDF OpenReview