ICMLW 2024

1500 papers

"You Just Can’t Go Around Killing People'' Explaining Agent Behavior to a Human Terminator Uri Menkes, Ofra Amir, Assaf Hallak

PDF OpenReview

(Almost) Smooth Sailing: Towards Numerical Stability of Neural Networks Through Differentiable Regularization of the Condition Number Rossen Nenov, Daniel Haider, Peter Balazs

PDF OpenReview

(Deep) Generative Geodesics Beomsu Kim, Michael Anthony Puthawala, Jong Chul Ye, Emanuele Sansone

PDF OpenReview

$\alpha$-Fair Contextual Bandits Siddhant Chaudhary, Abhishek Sinha

PDF OpenReview

$\bf{\Phi}_\textrm{Flow}$: Differentiable Simulations for Machine Learning Philipp Holl, Nils Thuerey

PDF OpenReview

$\mathbb{X}$-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs Vlad Sobal, Mark Ibrahim, Randall Balestriero, Vivien Cabannes, Diane Bouchacourt, Pietro Astolfi, Kyunghyun Cho, Yann LeCun

PDF OpenReview

$\nabla \tau$: Gradient-Based and Task-Agnostic Machine Unlearning Daniel Trippa, Cesare Campagnano, Maria Sofia Bucarelli, Gabriele Tolomei, Fabrizio Silvestri

PDF OpenReview

2Bits of Protein: Efficient Protein Language Models at the Scale of 2-Bits Oliver M. Turnbull, Mohamed Baioumy, Charlotte Deane

PDF OpenReview

3D Reconstruction of Dark Matter Fields with Diffusion Models: Towards Application to Galaxy Surveys Core Francisco Park, Nayantara Mudur, Carolina Cuesta-Lazaro, Yueying Ni, Victoria Ono, Douglas Finkbeiner

PDF OpenReview

3D Shape Completion with Test-Time Training Michael Schopf-Kuester, Zorah Lähner, Michael Moeller

PDF OpenReview

A Bayesian Approach to Adversarially Robust Life Testing Dorina Weichert, Alexander Kister, Sebastian Houben, Gunar Ernis, Tim Wirtz

PDF OpenReview

A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays Saeed Masoudian, Julian Zimmert, Yevgeny Seldin

PDF OpenReview

A Case for Validation Buffer in Pessimistic Actor-Critic Michal Nauman, Mateusz Ostaszewski, Marek Cygan

PDF OpenReview

A Case-Based Reasoning Approach to Dynamic Few-Shot Prompting for Code Generation Dustin Dannenhauer, Zohreh Dannenhauer, Despina Christou, Kostas Hatalis

PDF OpenReview

A Classifier-Based Approach to Multi-Class Anomaly Detection Applied to Astronomical Time-Series Daniel Muthukrishna, Rithwik Gupta

PDF OpenReview

A Coding-Theoretic Analysis of Hyperspherical Prototypical Learning Geometry Martin Lindström, Borja Rodríguez Gálvez, Ragnar Thobaben, Mikael Skoglund

PDF OpenReview

A Critical Look at Tokenwise Reward-Guided Text Generation Ahmad Rashid, Ruotian Wu, Julia Grosse, Agustinus Kristiadi, Pascal Poupart

PDF OpenReview

A Deeper Look at Depth Pruning of LLMs Shoaib Ahmed Siddiqui, Xin Dong, Greg Heinrich, Thomas Breuel, Jan Kautz, David Krueger, Pavlo Molchanov

PDF OpenReview

A Differentiable Approach to Multi-Scale Brain Modeling Chaoming Wang, Muyang Lyu, Tianqiu Zhang, Sichao He, Si Wu

PDF OpenReview

A Differentiable Topological Notion of Local Maxima for Keypoint Detection Giovanni Barbarani, Francesco Vaccarino, Gabriele Trivigno, Marco Guerra, Gabriele Berton, Carlo Masone

PDF OpenReview

A Fast Learning-Based Surrogate of Electrical Machines Using a Reduced Basis Alejandro Ribes, Nawfal Benchekroun, Théo Delagnes

PDF OpenReview

A Framework for Differentiable Supervised Graph Prediction Paul Krzakala, Junjie Yang, Rémi Flamary, Florence d'Alché-Buc, Charlotte Laclau, Matthieu Labeau

PDF OpenReview

A Generative Foundation Model for Antibody Sequence Understanding Justin Barton, Aretas Gaspariunas, David A Yadin, Jorge Dias, Francesca L Nice, Danielle H Minns, Olivia Snudden, Chelsea Povall, Sara Valle Tomas, Harry Dobson, James H R Farmery, Jinwoo Leem, Jacob D Galson

PDF OpenReview

A Geometric Framework for Understanding Memorization in Generative Models Brendan Leigh Ross, Hamidreza Kamkari, Zhaoyan Liu, Tongzi Wu, George Stein, Gabriel Loaiza-Ganem, Jesse C. Cresswell

PDF OpenReview

A Geometric Framework for Understanding Memorization in Generative Models Brendan Leigh Ross, Hamidreza Kamkari, Zhaoyan Liu, Tongzi Wu, George Stein, Gabriel Loaiza-Ganem, Jesse C. Cresswell

PDF OpenReview

A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem

PDF OpenReview

A Hessian-Aware Stochastic Differential Equation for Modelling SGD Xiang Li, Zebang Shen, Liang Zhang, Niao He

PDF OpenReview

A Human-like Reasoning Framework for Multi-Phases Planning Task with Large Language Models Chengxing Xie, Difan Zou

PDF OpenReview

A Multi-View Mixture-of-Experts Based on Language and Graphs for Molecular Properties Prediction Victor Yukio Shirasuna, Eduardo Soares, Emilio Vital Brazil, Karen Fiorella Aquino Gutierrez, Renato Cerqueira, Seiji Takeda, Akihiro Kishimoto

PDF OpenReview

A Neural Material Point Method for Particle-Based Simulations Omer Rochman Sharabi, Sacha Lewin, Gilles Louppe

PDF OpenReview

A Peek into Token Bias: Large Language Models Are Not yet Genuine Reasoners Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J Su, Camillo Jose Taylor, Dan Roth

PDF OpenReview

A Phase Transition Between Positional and Semantic Learning in a Solvable Model of Dot-Product Attention Hugo Cui, Freya Behrens, Florent Krzakala, Lenka Zdeborova

PDF OpenReview

A Policy Optimization Approach to the Solution of Unregularized Mean Field Games Sihan Zeng, Sujay Bhatt, Alec Koppel, Sumitra Ganesh

PDF OpenReview

A Pontryagin Perspective on Reinforcement Learning Onno Eberhard, Claire Vernade, Michael Muehlebach

PDF OpenReview

A Practical Diffusion Path for Sampling Omar Chehab, Anna Korba

PDF OpenReview

A Random Matrix Analysis of Learning with Noisy Labels Aymane El Firdoussi, Mohamed El Amine Seddik

PDF OpenReview

A Recipe for Charge Density Prediction Xiang Fu, Andrew Scott Rosen, Kyle Bystrom, Rui Wang, Albert Musaelian, Boris Kozinsky, Tess Smidt, Tommi Jaakkola

PDF OpenReview

A Safe Exploration Approach to Constrained Markov Decision Processes Tingting Ni, Maryam Kamgarpour

PDF OpenReview

A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez

PDF OpenReview

A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $\Theta(T^{2/3})$ and Its Application to Best-of-Both-Worlds Taira Tsuchiya, Shinji Ito

PDF OpenReview

A Simple and Expressive Graph Neural Network Based Method for Structural Link Representation Veronica Lachi, Francesco Ferrini, Antonio Longa, Bruno Lepri, Andrea Passerini

PDF OpenReview

A Statistical Framework for Weak-to-Strong Generalization Seamus Somerstep, Felipe Maia Polo, Moulinath Banerjee, Yaacov Ritov, Mikhail Yurochkin, Yuekai Sun

PDF OpenReview

A Systematic Comparison of fMRI-to-Video Reconstruction Techniques Camilo Luciano Fosco, Ben Lahner, Alex J Andonian, Bowen Pan, Aude Oliva

PDF OpenReview

A Theoretical Formulation of Many-Body Message Passing Neural Networks Jiatong Han

PDF OpenReview

A Theoretical Framework for Partially Observed Reward-States in RLHF Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

PDF OpenReview

A Theoretical Framework for Partially-Observed Reward States in RLHF Chinmaya Kausik, Mirco Mutti, Aldo Pacchiano, Ambuj Tewari

PDF OpenReview

A Theoretical Understanding of Self-Correction Through In-Context Alignment Yifei Wang, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang

PDF OpenReview

A Theoretical Understanding of Self-Correction Through In-Context Alignment Yifei Wang, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang

PDF OpenReview

A Tractable Inference Perspective of Offline RL Xuejie Liu, Anji Liu, Guy Van den Broeck, Yitao Liang

PDF OpenReview

A Unified Approach to Feature Learning in Bayesian Neural Networks Noa Rubin, Zohar Ringel, Inbar Seroussi, Moritz Helias

PDF OpenReview

A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee, Se-Young Yun, Kwang-Sung Jun

PDF OpenReview

A Universal Class of Sharpness-Aware Minimization Algorithms Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri, Stefanie Jegelka, Patrick Jaillet

PDF OpenReview

A Variational Formulation of Reinforcement Learning in Infinite-Horizon Markov Decision Processes Tim G. J. Rudner

PDF OpenReview

AbFlex: Predicting the Conformational Flexibility of Antibody CDRs Fabian C Spoendlin, Wing Ki Wong, Guy Georges, Alexander Bujotzek, Charlotte Deane

PDF OpenReview

ABodyBuilder3: Improved and Scalable Antibody Structure Predictions Henry Kenlay, Frederic A Dreyer, Daniel Cutting, Daniel Allen Nissley, Charlotte Deane

PDF OpenReview

Abstract Understanding of Core-Knowledge Concepts: Humans vs. LLMs Alessandro B. Palmarini, Melanie Mitchell

PDF OpenReview

Accelerated Online Reinforcement Learning Using Auxiliary Start State Distributions Aman Mehra, Alexandre Capone, Jeff Schneider

PDF OpenReview

Accelerating Best-of-N via Speculative Rejection Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette

PDF OpenReview

Accelerating Best-of-N via Speculative Rejection Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette

PDF OpenReview

Accelerating Best-of-N via Speculative Rejection Ruiqi Zhang, Momin Haider, Ming Yin, Jiahao Qiu, Mengdi Wang, Peter Bartlett, Andrea Zanette

PDF OpenReview

Accelerating Electron Dynamics Simulations Through Machine Learned Time Propagators Karan Shah, Attila Cangi

PDF OpenReview

Accelerating NCE Convergence with Adaptive Normalizing Constant Computation Anish Sevekari, Rishal Aggarwal, Maria Chikina, David Koes

PDF OpenReview

Accelerating Simulation of Two-Phase Flows with Neural PDE Surrogates Yoeri Poels, Koen Minartz, Harshit Bansal, Vlado Menkovski

PDF OpenReview

Accelerating Statistical Inferences in Astrophysics with Neural Networks and Hamiltonian Monte Carlo Diego Gonzalez-Hernandez, Molly Wolfson, Joseph F. Hennawi

PDF OpenReview

Accelerating Statistical Inferences in Astrophysics with Neural Networks and Hamiltonian Monte Carlo Diego Gonzalez-Hernandez, Molly Wolfson, Joseph Hennawi

PDF OpenReview

Accelerating the Inference of String Generation-Based Chemical Reaction Models for Industrial Applications Mikhail Andronov, Natalia Andronova, Michael Wand, Djork-Arné Clevert, Jürgen Schmidhuber

PDF OpenReview

Accounting for Selection Effects in Supernova Cosmology with Simulation-Based Inference and Hierarchical Bayesian Modelling Benjamin M. Boyd, Matthew Grayling, Kaisey S. Mandel

PDF OpenReview

Accuracy on the Wrong Line: On the Pitfalls of Noisy Data for OOD Generalisation Amartya Sanyal, Yaxi Hu, Yaodong Yu, Yian Ma, Yixin Wang, Bernhard Schölkopf

PDF OpenReview

Acquiring Diverse Skills Using Curriculum Reinforcement Learning with Mixture of Experts Onur Celik, Aleksandar Taranovic, Gerhard Neumann

PDF OpenReview

Active Preference Optimization for Sample Efficient RLHF Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury

PDF OpenReview

Active Propulsion Noise Shaping for Multi-Rotor Aircraft Localization Tamir Shor, Gabriele Serussi, Tom Hirshberg, Chaim Baskin, Alex M. Bronstein

PDF OpenReview

AdaInf: Adaptive Inference for Resource-Constrained Foundation Models Zhuoyan Xu, Khoi Duc Nguyen, Preeti Mukherjee, Somali Chaterji, Yingyu Liang, Yin Li

PDF OpenReview

Adam Exploits $\ell_\infty$-Geometry of Loss Landscape via Coordinate-Wise Adaptivity Shuo Xie, Mohamad Amin Mohamadi, Zhiyuan Li

PDF OpenReview

Adam-Mini: Use Fewer Learning Rates to Gain More Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun

PDF OpenReview

AdaMeM: Memory Efficient Momentum for Adafactor Nikhil Vyas, Depen Morwani, Sham M. Kakade

PDF OpenReview

AdaNF: Quantization Group Adaptive NormalFloat for Low Bit Fine-Tuning of LLMs Yeojoon Youn, Sehoon Kim, Suhong Moon, Sang Keun Choe, Ce Zhang

PDF OpenReview

Adapting LLM Agents with Universal Feedback in Communication Kuan Wang, Yadong Lu, Michael Santacroce, Yeyun Gong, Chao Zhang, Yelong Shen

PDF OpenReview

Adaptive $q$-Network: On-the-Fly Target Selection for Deep Reinforcement Learning Théo Vincent, Fabian Wahren, Jan Peters, Boris Belousov, Carlo D'Eramo

PDF OpenReview

Adaptive Concept Bottleneck for Foundation Models Jihye Choi, Jayaram Raghuram, Yixuan Li, Suman Banerjee, Somesh Jha

PDF OpenReview

Adaptive Experimental Design for Policy Learning: Contextual Best Arm Identification Masahiro Kato, Kyohei Okumura, Takuya Ishihara, Toru Kitagawa

PDF OpenReview

Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation Yingru Li, Jiawei Xu, Zhi-Quan Luo

PDF OpenReview

Adaptive Model Pruning in Federated Learning Through Loss Exploration Christian Internò, Elena Raponi, Niki van Stein, Thomas Bäck, Markus Olhofer, Yaochu Jin, Barbara Hammer

PDF OpenReview

Adaptive Sampling for Continuous Group Equivariant Neural Networks Berfin Inal, Gabriele Cesa

PDF OpenReview

Adaptive Two-Level Quasi-Monte Carlo for Soft Actor-Critic Du Ouyang, Zhenpeng Shi, Aodong Guo, Huaze Tang, Hejin Wang, Chao Wang, Wenbo Ding

PDF OpenReview

AdaptiveBackdoor: Backdoored Language Model Agents That Detect Human Overseers Heng Wang, Ruiqi Zhong, Jiaxin Wen, Jacob Steinhardt

PDF OpenReview

AdaptiveBackdoor: Backdoored Language Model Agents That Detect Human Overseers Heng Wang, Ruiqi Zhong, Jiaxin Wen, Jacob Steinhardt

PDF OpenReview

AdsorbDiff: Adsorbate Placement via Conditional Denoising Diffusion Adeesh Kolluru, John R. Kitchin

PDF OpenReview

Advancing LLM Reasoning Generalists with Preference Trees Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun

PDF OpenReview

Advantage Alignment Algorithms Juan Agustin Duque, Milad Aghajohari, Tim Cooijmans, Tianyu Zhang, Aaron Courville

PDF OpenReview

Adversarial Circuit Evaluation Niels uit de Bos, Adrià Garriga-Alonso

PDF OpenReview

Adversarial Multi-Dueling Bandits Pratik Gajane

PDF OpenReview

Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura

PDF OpenReview

Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies Brian R. Bartoldson, James Diffenderfer, Konstantinos Parasyris, Bhavya Kailkhura

PDF OpenReview

Adversarial Training with Synthesized Data: A Path to Robust and Generalizable Neural Networks Reza Bayat, Irina Rish

PDF OpenReview

Adversarially Robust CLIP Models Induce Better (Robust) Perceptual Metrics Francesco Croce, Christian Schlarmann, Naman Deep Singh, Matthias Hein

PDF OpenReview

AI Agents with Formal Security Guarantees Mislav Balunovic, Luca Beurer-Kellner, Marc Fischer, Martin Vechev

PDF OpenReview

AI Alignment with Changing and Influenceable Reward Functions Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan

PDF OpenReview

AI Alignment with Changing and Influenceable Reward Functions Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan

PDF OpenReview

AI for an Inverse Problem: Physical Model Solving Quantum Gravity Koji Hashimoto, Koshiro Matsuo, Masaki Murata, Gakuto Ogiwara, Daichi Takeda

PDF OpenReview

Aligned Diffusion Models for Retrosynthesis Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg

PDF OpenReview

Aligned Diffusion Models for Retrosynthesis Najwa Laabid, Severi Rissanen, Markus Heinonen, Arno Solin, Vikas Garg

PDF OpenReview

Aligning Crowd Feedback via Distributional Preference Reward Modeling Dexun Li, Cong Zhang, Kuicai Dong, Derrick Goh Xin Deik, Ruiming Tang, Yong Liu

PDF OpenReview

Aligning Large Language Models with Representation Editing: A Control Perspective Lingkai Kong, Haorui Wang, Wenhao Mu, Yuanqi Du, Yuchen Zhuang, Yifei Zhou, Yue Song, Rongzhi Zhang, Kai Wang, Chao Zhang

PDF OpenReview

Alignment Calibration: Machine Unlearning for Contrastive Learning Under Auditing Yihan Wang, Yiwei Lu, Guojun Zhang, Franziska Boenisch, Adam Dziedzic, Yaoliang Yu, Xiao-Shan Gao

PDF OpenReview

Alignment Is All You Need: A Training-Free Augmentation Strategy for Pose-Guided Video Generation XiaoyuJin, Zunnan Xu, Mingwen Ou, Wenming Yang

PDF OpenReview

Alignment of MPNNs and Graph Transformers Bao Nguyen, Anjana Yodaiken, Petar Veličković

PDF OpenReview

All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models Charumathi Badrinath, Usha Bhalla, Alex Oesterling, Suraj Srinivas, Himabindu Lakkaraju

PDF OpenReview

Altared Environments: The Role of Normative Infrastructure in AI Alignment Rakshit Trivedi, Nikhil Chandak, Carter Blair, Atrisha Sarkar, Tehilla Weltman, Dylan Hadfield-Menell, Gillian K Hadfield

PDF OpenReview

AMBER: An Entropy Maximizing Environment Design Algorithm for Inverse Reinforcement Learning Paul Nitschke, Lars Lien Ankile, Eura Nofshin, Siddharth Swaroop, Finale Doshi-Velez, Weiwei Pan

PDF OpenReview

Amortized Active Causal Induction with Deep Reinforcement Learning Yashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster

PDF OpenReview

Amortized Probabilistic Detection of Communities in Graphs Yueqi Wang, Yoonho Lee, Pallab Basu, Juho Lee, Yee Whye Teh, Liam Paninski, Ari Pakman

PDF OpenReview

An Advanced Physics-Informed Neural Operator for Comprehensive Design Optimization of Highly-Nonlinear Systems: An Aerospace Composites Processing Case Study Milad Ramezankhani, Anirudh Deodhar, Rishi Yash Parekh, Dagnachew Birru

PDF OpenReview

An Adversarial Example for Direct Logit Attribution: Memory Management in GELU-4L Jett Janiak, Can Rager, James Dao, Yeu-Tong Lau

PDF OpenReview

An Analytical Approach to Enhancing DNN Efficiency and Accuracy Using Approximate Multiplication Salar Shakibhamedan, Anice Jahanjoo, Amin Aminifar, Nima Amirafshar, Nima TaheriNejad, Axel Jantsch

PDF OpenReview

An Auditing Test to Detect Behavioral Shift in Language Models Leo Richter, Nitin Agrawal, Xuanli He, Pasquale Minervini, Matt Kusner

PDF OpenReview

An Embodied Generalist Agent in 3D World Jiangyong Huang, Silong Yong, Xiaojian Ma, Xiongkun Linghu, Puhao Li, Yan Wang, Qing Li, Song-Chun Zhu, Baoxiong Jia, Siyuan Huang

PDF OpenReview

An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Foundation Models Scott C. Lowe, Joakim Bruslund Haurum, Sageev Oore, Thomas B. Moeslund, Graham W. Taylor

PDF OpenReview

An Equivariant Flow Matching Framework for Learning Molecular Crystallization Shengchao Liu, Liang Yan, Hongyu Guo, Anima Anandkumar

PDF OpenReview

An Exactly Solvable Model for Emergence and Scaling Laws Yoonsoo Nam, Nayara Fonseca, Seok Hyeong Lee, Chris Mingard, Ard A. Louis

PDF OpenReview

An In-Context Learning Theoretic Analysis of Chain-of-Thought Chenxiao Yang, Zhiyuan Li, David Wipf

PDF OpenReview

An Information-Theoretic Study of Lying in LLMs Ann-Kathrin Dombrowski, Guillaume Corlouer

PDF OpenReview

An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr

PDF OpenReview

Analysing Feature Learning of Gradient Descent Using Periodic Functions Jaehui Hwang, Taeyoung Kim, Hongseok Yang

PDF OpenReview

Analysis of Atom-Level Pretraining with QM Data for Graph Neural Networks Molecular Property Models Jose Arjona-Medina, Ramil Nugmanov

PDF OpenReview

Analyzing & Eliminating Learning Rate Warmup in GPT Pre-Training Atli Kosson, Bettina Messmer, Martin Jaggi

PDF OpenReview

Analyzing and Improving Surrogate Gradient Training in Binary Neural Networks Using Dynamical Systems Theory Rainer Engelken, Larry Abbott

PDF OpenReview

Analyzing GFlowNets: Stability, Expressiveness, and Assessment Tiago Silva, Eliezer de Souza da Silva, Rodrigo Barreto Alves, Luiz Max Carvalho, Amauri H Souza, Samuel Kaski, Vikas Garg, Diego Mesquita

PDF OpenReview

Analyzing the Generalization and Reliability of Steering Vectors Daniel Chee Hian Tan, David Chanin, Aengus Lynch, Adrià Garriga-Alonso, Dimitrios Kanoulas, Brooks Paige, Robert Kirk

PDF OpenReview

Anthropocentric Bias and the Possibility of Artificial Cognition Raphaël Millière, Charles Rathkopf

PDF OpenReview

Antigen-Specific Antibody Design via Direct Energy-Based Preference Optimization Xiangxin Zhou, Dongyu Xue, Ruizhe Chen, Zaixiang Zheng, Liang Wang, Quanquan Gu

PDF OpenReview

Approximate Natural Gradient in Gaussian Processes with Non-Log-Concave Likelihoods Marcelo Hartmann

PDF OpenReview

Are Large Language Models Chameleons? Mingmeng Geng, Sihong He, Roberto Trotta

PDF OpenReview

Are Protein Language Models Compute Optimal? Yaiza Serrano, Alvaro Ciudad Serrano, Alexis Molina

PDF OpenReview

AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields Louis Serrano, Thomas X Wang, Etienne Le Naour, Jean-Noël Vittaut, Patrick Gallinari

PDF OpenReview

AsEP: Benchmarking Deep Learning Methods for Antibody-Specific Epitope Prediction ChuNan Liu, Lilian Denzler, Yihong Chen, Brooks Paige, Andrew CR Martin

PDF OpenReview

Assessing the Viability of Generative Modeling in Simulated Astronomical Observations Patrick Janulewicz, Laurence Perreault-Levasseur, Tracy Webb

PDF OpenReview

Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL Eduardo Pignatelli, Johan Ferret, Davide Paglieri, Samuel Coward, Tim Rocktäschel, Edward Grefenstette, Laura Toni

PDF OpenReview

AssistanceZero: Scalably Solving Assistance Games Cassidy Laidlaw, Eli Bronstein, Timothy Guo, Dylan Feng, Lukas Berglund, Justin Svegliato, Stuart Russell, Anca Dragan

PDF OpenReview

AstroPT: Scaling Large Observation Models for Astronomy Michael J. Smith, Ryan J. Roberts, Eirini Angeloudi, Marc Huertas-Company

PDF OpenReview

Asymptotic Dynamics for Delayed Feature Learning in a Toy Model Blake Bordelon, Tanishq Kumar, Samuel J. Gershman, Cengiz Pehlevan

PDF OpenReview

Asynchronous Local-SGD Training for Language Modeling Bo Liu, Rachita Chhaparia, Arthur Douillard, Satyen Kale, Andrei Alex Rusu, Jiajun Shen, Arthur Szlam, MarcAurelio Ranzato

PDF OpenReview

Asynchrony Invariance Loss Functions for Graph Neural Networks Pablo Monteagudo-Lago, Arielle Rosinski, Andrew Joseph Dudzik, Petar Veličković

PDF OpenReview

Attacking Large Language Models with Projected Gradient Descent Simon Geisler, Tom Wollschläger, M. H. I. Abdalla, Johannes Gasteiger, Stephan Günnemann

PDF OpenReview

Attention Is All You Need but You Don’t Need All of It for Inference of Large Language Models Georgy Tyukin, Gbetondji Jean-Sebastien Dovonon, Jean Kaddour, Pasquale Minervini

PDF OpenReview

Attention with Markov: A Curious Case of Single-Layer Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Alliot Nagle, Adway Girish, Hyeji Kim, Martin Jaggi, Michael Gastpar

PDF OpenReview

Augmenting Evolutionary Models with Structure-Based Retrieval Yining Huang, Zuobai Zhang, Jian Tang, Debora Susan Marks, Pascal Notin

PDF OpenReview

AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents Yao Fu, Dong-Ki Kim, Jaekyeom Kim, Sungryull Sohn, Lajanugen Logeswaran, Kyunghoon Bae, Honglak Lee

PDF OpenReview

Automatic Domain Adaptation by Transformers in In-Context Learning Ryuichiro Hataya, Kota Matsui, Masaaki Imaizumi

PDF OpenReview

Automatic Jailbreaking of the Text-to-Image Generative AI Systems Minseon Kim, Hyomin Lee, Boqing Gong, Huishuai Zhang, Sung Ju Hwang

PDF OpenReview

Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models Bang An, Sicheng Zhu, Ruiyi Zhang, Michael-Andrei Panaitescu-Liess, Yuancheng Xu, Furong Huang

PDF OpenReview

Automatically Identifying Local and Global Circuits with Linear Computation Graphs Xuyang Ge, Fukang Zhu, Wentao Shu, Junxuan Wang, Zhengfu He, Xipeng Qiu

PDF OpenReview

Baba Is AI: Break the Rules to Beat the Benchmark Nathan Cloos, Meagan Jens, Michelangelo Naim, Yen-Ling Kuo, Ignacio Cases, Andrei Barbu, Christopher J Cueva

PDF OpenReview

Babysit a Language Model from Scratch: Interactive Language Learning by Trials and Demonstrations Ziqiao Ma, Zekun Wang, Joyce Chai

PDF OpenReview

BAM! Just like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Nicolaus Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli

PDF OpenReview

Bandits with Abstention Under Expert Advice Stephen Pasteris, Alberto Rumi, Maximilian Thiessen, Shota Saito, Atsushi Miyauchi, Fabio Vitale, Mark Herbster

PDF OpenReview

Bandits with Preference Feedback: A Stackelberg Game Perspective Barna Pásztor, Parnian Kassraie, Andreas Krause

PDF OpenReview

Base-Change at Prediction: Inference-Time Update of Fine-Tuned Models Daiki Chijiwa, Taku Hasegawa, Kyosuke Nishida, Kuniko Saito, Susumu Takeuchi

PDF OpenReview

Batch Learning via Log-Sum-Exponential Estimator from Logged Bandit Feedback Armin Behnamnia, Gholamali Aminian, Alireza Aghaei, Chengchun Shi, Vincent Y. F. Tan, Hamid R. Rabiee

PDF OpenReview

Batch-Effect Invariant Graph Neural Networks for Predicting Chemotherapy Response in Triple-Negative Breast Cancer Patients Asif Khan, Giuseppe Torrisi, Luciana Luque, Claudia Owczarek, Maddy Parsons, Chris Sander, Linus Schumacher

PDF OpenReview

Batched Fixed-Confidence Pure Exploration for Bandits with Switching Constraints Newton Mwai, Milad Malekipirbazari, Fredrik D. Johansson

PDF OpenReview

Bayesian Optimization for the Discovery of Redox Active Quinones Giacomo De Gobbi, Reyhan Yagmur, Janine Maier, Stefan Spirk, Robert Peharz

PDF OpenReview

Bayesian Reward Models for LLM Alignment Adam X. Yang, Maxime Robeyns, Thomas Coste, Zhengyan Shi, Jun Wang, Haitham Bou Ammar, Laurence Aitchison

PDF OpenReview

Bayesian-LoRA: LoRA Based Parameter Efficient Fine-Tuning Using Optimal Quantization Levels and Rank Values Trough Differentiable Bayesian Gates Cristian Meo, Ksenia Sycheva, Anirudh Goyal, Justin Dauwels

PDF OpenReview

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents That Solve Fuzzy Tasks Stephanie Milani, Anssi Kanervisto, Karolis Jucys, Sander V Schulhoff, Brandon Houghton, Rohin Shah

PDF OpenReview

Behavior Generation with Latent Actions Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. Jin Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

PDF OpenReview

Behavioral Bias of Vision-Language Models: A Behavioral Finance View Yuhang Xiao, Yudilin, Ming-Chang Chiu

PDF OpenReview

BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards Diego Dorn, Alexandre Variengien, Charbel-Raphael Segerie, Vincent Corruble

PDF OpenReview

Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulation Georg Kohl, Liwei Chen, Nils Thuerey

PDF OpenReview

Benchmarking Mental State Representations in Language Models Matteo Bortoletto, Constantin Ruhdorfer, Lei Shi, Andreas Bulling

PDF OpenReview

Benchmarking Probabilistic Machine Learning in Protein FItness Landscape Predictions Ningning Chen, Wenkai Han, Sai T. Reddy

PDF OpenReview

Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks Antoni Kowalczuk, Jan Dubiński, Atiyeh Ashari Ghomi, Yi Sui, George Stein, Jiapeng Wu, Jesse C. Cresswell, Franziska Boenisch, Adam Dziedzic

PDF OpenReview

Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks Bálint Mucsányi, Michael Kirchhof, Seong Joon Oh

PDF OpenReview

BenchMARL: Benchmarking Multi-Agent Reinforcement Learning Matteo Bettini, Amanda Prorok, Vincent Moens

PDF OpenReview

Beyond Model Collapse: Scaling up with Synthesized Data Requires Reinforcement Yunzhen Feng, Elvis Dohmatob, Pu Yang, Francois Charton, Julia Kempe

PDF OpenReview

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham

PDF OpenReview

Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models Sahil Kuchlous, Marvin Li, Jeffrey George Wang

PDF OpenReview

Bias Transmission in Large Language Models: Evidence from Gender-Occupation Bias in GPT-4 Kirsten Morehouse, Weiwei Pan, Juan Manuel Contreras, Mahzarin R. Banaji

PDF OpenReview

Bias-Inducing Geometries: Exactly Solvable Data Model with Fairness Implications Stefano Sarao Mannelli, Federica Gerace, Negar Rostamzadeh, Luca Saglietti

PDF OpenReview

Bidirectional Consistency Models Liangchen Li, Jiajun He

PDF OpenReview

Bigger, Regularized, Optimistic: Scaling for Compute and Sample-Efficient Continuous Control Michal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Miłoś, Marek Cygan

PDF OpenReview

Bilevel Optimization with Lower-Level Contextual MDPs Vinzenz Thoma, Barna Pásztor, Andreas Krause, Giorgia Ramponi, Yifan Hu

PDF OpenReview

Bilingual Adaptation of Monolingual Foundation Models Gurpreet Gosal, Yishi Xu, Gokulakrishnan Ramakrishnan, Rituraj Joshi, Avraham Sheinin, Zhiming Chen, Biswajit Mishra, Sunil Kumar Sahu, Neha Sengupta, Natalia Vassilieva, Joel Hestness, Samujjwal Ghosh, Bokang Jia, Onkar Arun Pandit, Satheesh Katipomu, Samta Kamboj, Rahul Pal, Parvez Mullah, Soundar Balaji Doraiswamy, Karim Chami, Preslav Nakov

PDF OpenReview

BioinformaticsBench: A Collaboratively Built Large Language Model Benchmark for Bioinformatics Reasoning Varuni Sarwal, Seungmo Lee, Rosemary He, Aingela Kattapuram, Xiaoxuan Wang, Eleazar Eskin, Wei Wang, Serghei Mangul

PDF OpenReview

BiPer: Binary Neural Networks Using a Periodic Function Edwin Vargas, Claudia V. Correa, Carlos Hinojosa, Henry Arguello

PDF OpenReview

Bisimulation Metrics Are Optimal Transport Distances, and Can Be Computed Efficiently Sergio Calo, Anders Jonsson, Gergely Neu, Ludovic Schwartz, Javier Segovia-Aguas

PDF OpenReview

Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev

PDF OpenReview

Black-Box Detection of Language Model Watermarks Thibaud Gloaguen, Nikola Jovanović, Robin Staab, Martin Vechev

PDF OpenReview

Block Verification Accelerates Speculative Decoding Ziteng Sun, Uri Mendlovic, Yaniv Leviathan, Asaf Aharoni, Ahmad Beirami, Jae Hun Ro, Ananda Theertha Suresh

PDF OpenReview

BMapEst: Estimation of Brain Tissue Probability Maps Using a Differentiable MRI Simulator Utkarsh Gupta, Emmanouil Nikolakakis, Moritz Zaiss, Razvan Marinescu

PDF OpenReview

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin, Chien-Yi Wang, Ping-Chun Hsieh

PDF OpenReview

Boolean Logic for Low-Energy Deep Learning Van Minh Nguyen, Cristian Ocampo, Aymen Askri, Ba-Hien Tran

PDF OpenReview

Boost Your Crystal Model with Denoising Pre-Training Shuaike Shen, Ke Liu, Muzhi Zhu, Hao Chen

PDF OpenReview

Bootstrapping Language Models with DPO Implicit Rewards Changyu Chen, Zichen Liu, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin

PDF OpenReview

Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity Zhuo Zhi, Ziquan Liu, Moe Elbadawi, Adam Daneshmend, Mine Orlu, Abdul W Basit, Andreas Demosthenous, Miguel R. D. Rodrigues

PDF OpenReview

Boundary Between Noise and Information Applied to Filtering Neural Network Weight Matrices Max Staats, Matthias Thamm, Bernd Rosenow

PDF OpenReview

BPNAS: Bayesian Progressive Neural Architecture Search Hyunwoong Chang, Anirban Samaddar, Sandeep Madireddy

PDF OpenReview

Bridging Distributional and Risk-Sensitive Reinforcement Learning: Balancing Statistical, Computational, and Risk Considerations Hao Liang

PDF OpenReview

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage Kishan Panaganti, Zaiyan Xu, Dileep Kalathil, Mohammad Ghavamzadeh

PDF OpenReview

BUILD: Buffer-Free Incremental Learning with OOD Detection for the Wild Srishti Gupta, Daniele Angioni, Lea Schönherr, Ambra Demontis, Battista Biggio

PDF OpenReview

Bundle Neural Networks for Message Diffusion on Graphs Jacob Bamberger, Federico Barbero, Xiaowen Dong, Michael M. Bronstein

PDF OpenReview

CADO: Cost-Aware Diffusion Solvers for Combinatorial Optimization Through RL Fine-Tuning Deunsol Yoon, Hyungseok Song, Kanghoon Lee, Woohyung Lim

PDF OpenReview

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Yair Schiff, Chia Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov

PDF OpenReview

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Yair Schiff, Chia Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov

PDF OpenReview

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling Yair Schiff, Chia Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu, Volodymyr Kuleshov

PDF OpenReview

Calibrated Self-Rewarding Vision Language Models Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao

PDF OpenReview

CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory Zexue He, Leonid Karlinsky, Donghyun Kim, Julian McAuley, Dmitry Krotov, Rogerio Feris

PDF OpenReview

Can Editing LLMs Inject Harm? Canyu Chen, Baixiang Huang, Zekun Li, Zhaorun Chen, Shiyang Lai, Xiongxiao Xu, Jia-Chen Gu, Jindong Gu, Huaxiu Yao, Chaowei Xiao, Xifeng Yan, William Yang Wang, Philip Torr, Dawn Song, Kai Shu

PDF OpenReview

Can Go AIs Be Adversarially Robust? Tom Tseng, Euan McLean, Kellin Pelrine, Tony Tong Wang, Adam Gleave

PDF OpenReview

Can Language Models Safeguard Themselves, Instantly and for Free? Dyah Adila, Changho Shin, Yijing Zhang, Frederic Sala

PDF OpenReview

Can Large Language Models Explore In-Context? Akshay Krishnamurthy, Keegan Harris, Dylan J Foster, Cyril Zhang, Aleksandrs Slivkins

PDF OpenReview

Can Learned Optimization Make Reinforcement Learning Less Difficult? Alexander D. Goldie, Chris Lu, Matthew Thomas Jackson, Shimon Whiteson, Jakob Nicolaus Foerster

PDF OpenReview

Can LLMs Enhance Performance Prediction for Deep Learning Models? Karthick Panner Selvam, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Bryan Perozzi, Mats Brorsson

PDF OpenReview

Can LLMs Predict the Convergence of Stochastic Gradient Descent? Oussama Zekri, Abdelhakim Benechehab, Ievgen Redko

PDF OpenReview

Can Mamba In-Context Learn Task Mixtures? Yingcong Li, Xupeng Wei, Haonan Zhao, Taigao Ma

PDF OpenReview

Can Models Learn Skill Composition from Examples? Haoyu Zhao, Simran Kaur, Dingli Yu, Anirudh Goyal, Sanjeev Arora

PDF OpenReview

Can Transformers Solve Least Squares to High Precision? Jerry Weihong Liu, Jessica Grogan, Owen M Dugan, Simran Arora, Atri Rudra, Christopher Re

PDF OpenReview

Can Transformers Solve Least Squares to High Precision? Jerry Weihong Liu, Jessica Grogan, Owen M Dugan, Simran Arora, Atri Rudra, Christopher Re

PDF OpenReview

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? Michael-Andrei Panaitescu-Liess, Zora Che, Bang An, Yuancheng Xu, Pankayaraj Pathmanathan, Souradip Chakraborty, Sicheng Zhu, Tom Goldstein, Furong Huang

PDF OpenReview

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models Peng Xia, Ze Chen, Juanxi Tian, Gong Yangrui, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao

PDF OpenReview

Cascade Reward Sampling for Efficient Decoding-Time Alignment Bolian Li, Yifan Wang, Ananth Grama, Ruqi Zhang

PDF OpenReview

Catastrophic Goodhart: Regularizing RLHF with KL Divergence Does Not Mitigate Heavy-Tailed Reward Misspecification Thomas Kwa, Drake Thomas, Adrià Garriga-Alonso

PDF OpenReview

Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations Around Unknown Marginals Ziyi Liu, Idan Attias, Daniel M. Roy

PDF OpenReview

Causal Discovery over High-Dimensional Structured Hypothesis Spaces with Causal Graph Partitioning Ashka Shah, Adela Frances DePavia, Nathaniel C Hudson, Ian Foster, Rick Stevens

PDF OpenReview

CD-POS: Long Context Generalization in LLMs Through Continuous and Discrete Position Synthesis Zhiyuan Hu, Yuliang Liu, Jinman Zhao, Suyuchen Wang, WangYan, Wei Shen, Chao Yin, Bryan Hooi

PDF OpenReview

Cell Morphology-Guided Small Molecule Generation with GFlowNets Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski

PDF OpenReview

Cell Morphology-Guided Small Molecule Generation with GFlowNets Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski

PDF OpenReview

Cell Morphology-Guided Small Molecule Generation with GFlowNets Stephen Zhewen Lu, Ziqing Lu, Ehsan Hajiramezanali, Tommaso Biancalani, Yoshua Bengio, Gabriele Scalia, Michał Koziarski

PDF OpenReview

CellFlows: Inferring Splicing Kinetics from Latent and Mechanistic Cellular Dynamics Sei Chang, Zaiqian Chen, Bianca Dumitrascu, David A. Knowles

PDF OpenReview

Certifiably Robust RAG Against Retrieval Corruption Chong Xiang, Tong Wu, Zexuan Zhong, David Wagner, Danqi Chen, Prateek Mittal

PDF OpenReview

Certified Robustness in NLP Under Bounded Levenshtein Distance Elias Abad Rocamora, Grigorios Chrysos, Volkan Cevher

PDF OpenReview

Certifying Robustness to Adaptive Data Poisoning Avinandan Bose, Madeleine Udell, Laurent Lessard, Maryam Fazel, Krishnamurthy Dj Dvijotham

PDF OpenReview

CGMTorch: A Framework for Gradient-Based Design of Computational Granular Metamaterials Atoosa Parsa, Corey OHern, Rebecca Kramer-Bottiglio, Josh Bongard

PDF OpenReview

Chain of LoRA: Efficient Fine-Tuning of Language Models via Residual Learning Wenhan Xia, Chengwei Qin, Elad Hazan

PDF OpenReview

Chained Information-Theoretic Bounds and Tight Regret Rate for Linear Bandit Problems Amaury Gouverneur, Borja Rodríguez Gálvez, Tobias Oechtering, Mikael Skoglund

PDF OpenReview

Chained Tuning Leads to Biased Forgetting Megan Ung, Alicia Yi Sun, Samuel Bell, Levent Sagun, Adina Williams

PDF OpenReview

Chained Tuning Leads to Biased Forgetting Megan Ung, Alicia Yi Sun, Samuel Bell, Levent Sagun, Adina Williams

PDF OpenReview

Challenges in Mechanistically Interpreting Model Representations Satvik Golechha, James Dao

PDF OpenReview

Characterizing Prompt Compression Methods for Long Context Inference Siddharth Jha, Lutfi Eren Erdogan, Sehoon Kim, Kurt Keutzer, Amir Gholami

PDF OpenReview

CharED: Character-Wise Ensemble Decoding for Large Language Models Kevin Gu, Eva Tuecke, Dmitriy A Katz, Raya Horesh, David Alvarez-Melis, Mikhail Yurochkin

PDF OpenReview

Chemical Language Modeling with Structured State Spaces Rıza Özçelik, Sarah de Ruiter, Emanuele Criscuolo, Francesca Grisoni

PDF OpenReview

CLAM: Unifying Finetuning, Quantization, and Pruning by Chaining LLM Adapter Modules Neelay Velingker, Jason Liu, Amish Sethi, William Dodds, Zhiqiu Xu, Saikat Dutta, Mayur Naik, Eric Wong

PDF OpenReview

Class-Aware Initialization of Early Exits for Pre-Training Large Language Models Alperen Gormez, Erdem Koyuncu

PDF OpenReview

Classification of Freshwater Snails of the Genus Radomaniola with Multimodal Triplet Networks Dennis Vetter, Muhammad Ahsan, Diana Delicado, Thomas A. Neubauer, Thomas Wilke, Gemma Roig

PDF OpenReview

Closed Form of the Hessian Spectrum for Some Neural Networks Sidak Pal Singh, Thomas Hofmann

PDF OpenReview

Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms Samuel Don Stanton, Robert G Alberstein, Nathan C. Frey, Andrew Martin Watkins, Kyunghyun Cho

PDF OpenReview

Cluster-Norm for Unsupervised Probing of Knowledge Walter Laurito, Sharan Maiya, Grégoire Dhimoïla, Owen Ho Wan Yeung, Kaarel Hänni

PDF OpenReview

CO2: Precise Attention Score Observation for Improving KV Cache Replacement in Large Language Model Meguru Yamazaki, Shivaram Venkataraman

PDF OpenReview

Coarse-to-Fine Semi-Structured Pruning of Graph Convolutional Networks for Skeleton-Based Recognition Hichem Sahbi

PDF OpenReview

Code Agents Are State of the Art Software Testers Niels Mündler, Mark Niklas Mueller, Jingxuan He, Martin Vechev

PDF OpenReview

Code Agents Are State of the Art Software Testers Niels Mündler, Mark Niklas Mueller, Jingxuan He, Martin Vechev

PDF OpenReview

CodonMPNN for Organism Specific and Codon Optimal Inverse Folding Hannes Stark, Umesh Padia, Julia Balla, Cameron Diao

PDF OpenReview

CodonMPNN for Organism Specific and Codon Optimal Inverse Folding Hannes Stark, Umesh Padia, Julia Balla, Cameron Diao

PDF OpenReview

CogErgLLM: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics Azmine Toushik Wasi

PDF OpenReview

Cognitive Assessment of Language Models Daniel McDuff, David Munday, Xin Liu, Isaac Galatzer-Levy

PDF OpenReview

Cognitive Flexibility of Large Language Models Sean M Kennedy, Robert D Nowak

PDF OpenReview

Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential Expression Generation Polina Tsvilodub, Michael Franke, Fausto Carcassi

PDF OpenReview

Collaborative Learning Under Strategic Behavior: Mechanisms for Eliciting Feedback in Principal-Agent Bandit Games Ramakrishnan K, Arpit Agarwal, Lakshminarayanan Subramanian, Maximilian Nickel

PDF OpenReview

Collective Variable Free Transition Path Sampling with Generative Flow Network Kiyoung Seong, Seonghyun Park, Seonghwan Kim, Woo Youn Kim, Sungsoo Ahn

PDF OpenReview

Collusion of Reinforcement Learning-Based Pricing Algorithms in Episodic Markets Paul Friedrich, Barna Pásztor, Giorgia Ramponi

PDF OpenReview

Color Style Transfer with Modulated Flows Maria Larchenko, Alexander Lobashev, Dmitry Guskov, Vladimir Vladimirovich Palyulin

PDF OpenReview

Combining Graph Attention and Recurrent Neural Networks in a Variational Autoencoder for Molecular Representation Learning and Drug Design Alex T. Müller, Kenneth Atz, Michael Reutlinger, Nicolas Zorn

PDF OpenReview

Combining Neural Networks and Symbolic Regression for Analytical Lyapunov Function Discovery Jie Feng, Haohan Zou, Yuanyuan Shi

PDF OpenReview

Combining Pre-Trained LoRA Modules Improves Few-Shot Adaptation of Foundation Models to New Tasks Nader Asadi, Mahdi Beitollahi, Yasser H. Khalil, Yinchuan Li, Guojun Zhang, Xi Chen

PDF OpenReview

Combining Reconstruction and Contrastive Methods for Multimodal Representations in RL Philipp Becker, Sebastian Mossburger, Fabian Otto, Gerhard Neumann

PDF OpenReview

Comgra: A Tool for Analyzing and Debugging Neural Networks Florian Dietz, Sophie Fellenz, Dietrich Klakow, Marius Kloft

PDF OpenReview

Communication Efficient Federated Learning with Differentiated Aggregation Peyman Gholami, Hulya Seferoglu

PDF OpenReview

Commute-Time-Optimised Graphs for GNNs Igor Sterner, Shiye Su, Petar Veličković

PDF OpenReview

Compact Proofs of Model Performance via Mechanistic Interpretability Jason Gross, Rajashree Agrawal, Thomas Kwa, Euan Ong, Chun Hei Yip, Alex Gibson, Soufiane Noubir, Lawrence Chan

PDF OpenReview

Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, Aditya Grover

PDF OpenReview

Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries Xuening Feng, Zhaohui Jiang, Timo Kaufmann, Eyke Hüllermeier, Paul Weng, Yifei Zhu

PDF OpenReview

Compatible Gradient Approximations for Actor-Critic Algorithms Baturay Saglam, Dionysis Kalogerias

PDF OpenReview

CompeteAI: Understanding the Competition Dynamics of Large Language Model-Based Agents Qinlin Zhao, Jindong Wang, Yixuan Zhang, Yiqiao Jin, Kaijie Zhu, Hao Chen, Xing Xie

PDF OpenReview

Composable Contracts for Multi-Agent Coordination Christy Chen, Louis Parker

PDF OpenReview

Compositional Communication with LLMs and Reasoning About Chemical Structures Dmitry Zubarev, Sarathkrishna Swaminathan

PDF OpenReview

Compress Then Serve: Serving Thousands of LoRA Adapters with Little Overhead Rickard Brüel Gabrielsson, Jiacheng Zhu, Onkar Bhardwaj, Leshem Choshen, Kristjan Greenewald, Mikhail Yurochkin, Justin Solomon

PDF OpenReview

Compressing the Latent Space of Single-Sequence Protein Predictors for Multimodal Generation Amy X. Lu, Wilson Yan, Vladimir Gligorijevic, Pieter Abbeel, Kevin K Yang, Nathan C. Frey

PDF OpenReview

Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels Zhuorui Ye, Stephanie Milani, Fei Fang, Geoffrey J. Gordon

PDF OpenReview

Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels Zhuorui Ye, Stephanie Milani, Fei Fang, Geoff Gordon

PDF OpenReview

Conditional Common Entropy for Instrumental Variable Testing and Partial Identification Ziwei Jiang, Murat Kocaoglu

PDF OpenReview

Conditional Flow Matching for Time Series Modelling Ella Tamir, Najwa Laabid, Markus Heinonen, Vikas Garg, Arno Solin

PDF OpenReview

Conditional Generative Models Are Sufficient to Sample from Any Causal Effect Estimand Md Musfiqur Rahman, Matt Jordan, Murat Kocaoglu

PDF OpenReview

Conditional Meta-Reinforcement Learning with State Representation Yuxuan Sun, Laura Toni, Yiannis Andreopoulos

PDF OpenReview

Confidence Regulation Neurons in Language Models Alessandro Stolfo, Ben Peng Wu, Wes Gurnee, Yonatan Belinkov, Xingyi Song, Mrinmaya Sachan, Neel Nanda

PDF OpenReview

Conformalized Credal Set Predictors Alireza Javanmardi, David Stutz, Eyke Hüllermeier

PDF OpenReview

Consistency Checks for Language Model Forecasters Abhimanyu Pallavi Sudhir, Alejandro Alvarez, Adam Shen, Daniel Paleka

PDF OpenReview

Consistency Checks for Language Model Forecasters Abhimanyu Pallavi Sudhir, Alejandro Alvarez, Adam Shen, Daniel Paleka

PDF OpenReview

Consistency Models with Learned Idempotent Boundary Conditions Gianluigi Silvestri, Luca Ambrogioni

PDF OpenReview

Consistent Validation for Predictive Methods in Spatial Settings David R. Burt, Yunyi Shen, Tamara Broderick

PDF OpenReview

Constructing Artificial Life and Materials Scientists with Accelerated AI Using Deep AndersoNN Saleem Abdul Fattah Ahmed Al Dajani, David Keyes

PDF OpenReview

Constructing Gauge-Invariant Neural Networks for Scientific Applications Manos Theodosis, Demba E. Ba, Nima Dehmamy

PDF OpenReview

Constructing Gauge-Invariant Neural Networks for Scientific Applications Manos Theodosis, Demba E. Ba, Nima Dehmamy

PDF OpenReview

ContextCite: Attributing Model Generation to Context Benjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry

PDF OpenReview

ContextCite: Attributing Model Generation to Context Benjamin Cohen-Wang, Harshay Shah, Kristian Georgiev, Aleksander Madry

PDF OpenReview

Contextualized Hybrid Ensemble Q-Learning: Learning Fast with Control Priors Emma Cramer, Bernd Frauenknecht, Ramil Sabirov, Sebastian Trimpe

PDF OpenReview

Continual Deep Learning on the Edge via Stochastic Local Competition Among Subnetworks Theodoros Christophides, Kyriakos Tolias, Sotirios Chatzis

PDF OpenReview

Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents Yoann Poupart

PDF OpenReview

Controlling Large Language Model Agents with Entropic Activation Steering Nate Rahn, Pierluca D'Oro, Marc G Bellemare

PDF OpenReview

CoordConformer: Heterogenous EEG Datasets Decoding Using Transformers Sharat Patil, Robin Tibor Schirrmeister, Frank Hutter, Tonio Ball

PDF OpenReview

Coordination Failure in Cooperative Offline MARL Callum Rhys Tilbury, Juan Claude Formanek, Louise Beyers, Jonathan Phillip Shock, Arnu Pretorius

PDF OpenReview

CoSy: Evaluating Textual Explanations of Neurons Laura Kopf, Philine Lou Bommer, Anna Hedström, Sebastian Lapuschkin, Marina MC Höhne, Kirill Bykov

PDF OpenReview

CoSy: Evaluating Textual Explanations of Neurons Laura Kopf, Philine Lou Bommer, Anna Hedström, Sebastian Lapuschkin, Marina MC Höhne, Kirill Bykov

PDF OpenReview

CPeSFA: Empowering SFs for Policy Learning and Transfer in Continuous Action Spaces Yining Li, Tianpei Yang, Wei Guo, Jianye Hao, Yan Zheng

PDF OpenReview

Crafting Large Language Models for Enhanced Interpretability Chung-En Sun, Tuomas Oikarinen, Tsui-Wei Weng

PDF OpenReview

Cramming Protein Language Model Training in 24 GPU Hours Nathan C. Frey, Taylor Joren, Aya Abdelsalam Ismail, Allen Goodman, Richard Bonneau, Kyunghyun Cho, Vladimir Gligorijevic

PDF OpenReview

Cross-Domain Knowledge Transfer for RL via Preference Consistency Ting-Hsuan Huang, Ping-Chun Hsieh

PDF OpenReview

Cross-Lingual QA: A Key to Unlocking In-Context Cross-Lingual Performance Sunkyoung Kim, Dayeon Ki, Yireun Kim, Jinsik Lee

PDF OpenReview

Cross-Modality Matching and Prediction of Perturbation Responses with Labeled Gromov-Wasserstein Optimal Transport Jayoung Ryu, Romain Lopez, Charlotte Bunne, Luca Pinello, Aviv Regev

PDF OpenReview

Cross-Modality Matching and Prediction of Perturbation Responses with Labeled Gromov-Wasserstein Optimal Transport Jayoung Ryu, Romain Lopez, Charlotte Bunne, Luca Pinello, Aviv Regev

PDF OpenReview

DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation Xueqing Wu, Rui Zheng, Jingzhen Sha, Te-Lin Wu, Hanyu Zhou, Tang Mohan, Kai-Wei Chang, Nanyun Peng, Haoran Huang

PDF OpenReview

DARE: The Deep Adaptive Regulator for Control of Uncertain Continuous-Time Systems Harrison Waldon, Fayçal Drissi, Yannick Limmer, Uljad Berdica, Jakob Nicolaus Foerster, Alvaro Cartea

PDF OpenReview

DASH: Warm-Starting Neural Network Training Without Loss of Plasticity Under Stationarity Baekrok Shin, Junsoo Oh, Hanseul Cho, Chulhee Yun

PDF OpenReview

Data as a Consumable Resource Dar Gilboa, Siddhartha Jain, Jarrod Ryan McClean

PDF OpenReview

Data Mixture Inference: What Do BPE Tokenizers Reveal About Their Training Data? Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith

PDF OpenReview

Deciphering the Definition of Adversarial Robustness for Post-Hoc OOD Detectors Peter Lorenz, Mario Ruben Fernandez, Jens Müller, Ullrich Koethe

PDF OpenReview

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan

PDF OpenReview

Decoder Ensembling for Learned Latent Geometries Stas Syrota, Pablo Moreno-Muñoz, Søren Hauberg

PDF OpenReview

Decoding Chemical Predictions: Group Contribution Methods for XAI Gabriel Cathoud, Vignesh Ram Somnath, Luis Macedo, Kjell Jorner

PDF OpenReview

Decoding-Time Language Model Alignment with Multiple Objectives Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon Shaolei Du

PDF OpenReview

Decomposed Evaluations of Geographic Disparities in Text-to-Image Models Abhishek Sureddy, Dishant Padalia, Nandhinee Periyakaruppan, Oindrila Saha, Adina Williams, Adriana Romero-Soriano, Megan Richards, Polina Kirichenko, Melissa Hall

PDF OpenReview

Decomposed Linear Dynamical Systems (dLDS) for Identifying the Latent Dynamics Underlying High-Dimensional Time-Series Noga Mudrik, Yenho Chen, Eva Yezerets, Christopher John Rozell, Adam Shabti Charles

PDF OpenReview

Decomposing and Editing Predictions by Modeling Model Computation Harshay Shah, Andrew Ilyas, Aleksander Madry

PDF OpenReview

Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Sriram Balasubramanian, Samyadeep Basu, Soheil Feizi

PDF OpenReview

Decoupled Differentiable Neural Architecture Search: Memory-Efficient Differentiable NAS via Disentangled Search Space Libin Hou

PDF OpenReview

Decoupled Stochastic Gradient Descent for N-Player Games Ali Zindari, Parham Yazdkhasti, Tatjana Chavdarova, Sebastian U Stich

PDF OpenReview

Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models Vorakit Vorakitphan, Milos Basic, Guilhaume Leroy Meline

PDF OpenReview

Deep Learning for Protein-Ligand Docking: Are We There yet? Alex Morehead, Nabin Giri, Jian Liu, Jianlin Cheng

PDF OpenReview

Deep Networks Always Grok and Here Is Why Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

PDF OpenReview

Deep Reinforcement Learning for Equilibrium Computation in Multi-Stage Auctions and Contests Fabian Raoul Pieroth, Nils Kohring, Martin Bichler

PDF OpenReview

Deep Supramolecular Language Processing for Co-Crystal Prediction Rebecca Birolo, Rıza Özçelik, Andrea Aramini, Michele R. Chierotti, Roberto Gobetto, Francesca Grisoni

PDF OpenReview

DeePC-Hunt: Data-Enabled Predictive Control Hyperparameter Tuning via Differentiable Optimization Michael Cummins, Alberto Padoan, Keith Moffat, John Lygeros, Florian Dorfler

PDF OpenReview

Defending Against Unknown Corrupted Agents: Reinforcement Learning of Adversarially Robust Nash Equilibria Andi Nika, Jonathan Nöther, Adish Singla, Goran Radanovic

PDF OpenReview

Delay Embedding Theory of Neural Sequence Models Mitchell Ostrow, Adam Joseph Eisen, Ila R Fiete

PDF OpenReview

Delayed Adversarial Attacks on Stochastic Multi-Armed Bandits Pierriccardo Olivieri, Matteo Castiglioni, Nicola Gatti

PDF OpenReview

Demonstrations in In-Context Learning for LLMs with Large Label Space Zhan Li, Fanghui Liu, Volkan Cevher, Grigorios Chrysos

PDF OpenReview

Demystifying Amortized Causal Discovery with Transformers Francesco Montagna, Max Cairney-Leeming, Dhanya Sridhar, Francesco Locatello

PDF OpenReview

Denoising Diffusion Variational Inference: Diffusion Models as Expressive Variational Posteriors Wasu Top Piriyakulkij, Yingheng Wang, Volodymyr Kuleshov

PDF OpenReview

Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models Nicholas Bai, Rahul Ajay Iyer, Tuomas Oikarinen, Tsui-Wei Weng

PDF OpenReview

DETAIL: Task DEmonsTration Attribution for Interpretable In-Context Learning Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

PDF OpenReview

Detecting Critical Treatment Effect Bias in Small Subgroups Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang

PDF OpenReview

Detrimental Memories in Transfer Learning Amal Alnouri, Timothy J Wroge, Bilal Alsallakh

PDF OpenReview

Diagnosing and Fixing Common Problems in Bayesian Optimization for Molecule Design Austin Tripp, José Miguel Hernández-Lobato

PDF OpenReview

Differentiable Approximations of Fair OWA Optimization My H Dinh, James Kotary, Ferdinando Fioretto

PDF OpenReview

Differentiable Cluster Graph Neural Network Yanfei Dong, Mohammed Haroon Dupty, Lambert Deng, Zhuanghua Liu, Yong Liang Goh, Wee Sun Lee

PDF OpenReview

Differentiable Cost-Parameterized Monge mAP Estimators Samuel Howard, George Deligiannidis, Patrick Rebeschini, James Thornton

PDF OpenReview

Differentiable Iterated Function Systems Cory Braker Scott

PDF OpenReview

Differentiable Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem

PDF OpenReview

Differentiable Mapper for Topological Optimization of Data Representation Ziyad Oulhaj, Mathieu Carrière, Bertrand Michel

PDF OpenReview

Differentiable Short-Time Fourier Transform: A Time-Frequency Layer with Learnable Parameters Maxime Leiber, Yosra Marnissi, Axel Barrau

PDF OpenReview

Differentiable Soft Min-Max Loss to Restrict Weight Range for Model Quantization Arnav Kundu, Chungkuk Yoo, Minsik Cho, Saurabh Adya

PDF OpenReview

Differentiable Weighted Automata Anand Balakrishnan, Jyotirmoy V. Deshmukh

PDF OpenReview

Differentiable Wireless Simulation with Geometric Transformers Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy, Arash Behboodi, Johann Brehmer

PDF OpenReview

DiffFit: Differentiable Fitting of Molecule Structures to a Cryo-EM mAP Deng Luo, Zainab Alsuwaykit, Dawar Khan, Ondrej Strnad, Tobias Isenberg, Ivan Viola

PDF OpenReview

Diffusion Domain Expansion: Learning to Coordinate Pre-Trained Diffusion Models Egor Lifar, Semyon Savkin, Timur Garipov, Shangyuan Tong, Tommi Jaakkola

PDF OpenReview

Diffusion Models with Group Equivariance Haoye Lu, Spencer Szabados, Yaoliang Yu

PDF OpenReview

Diffusion-Based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning Jihwan Oh, Sungnyun Kim, Gahee Kim, SeongHwan Kim, Se-Young Yun

PDF OpenReview

DiffusionBlend: Learning 3D Image Prior Through Position-Aware Diffusion Score Blending for 3D Computed Tomography Reconstruction Bowen Song, Jason Hu, Zhaoxu Luo, Jeffrey A Fessler, Liyue Shen

PDF OpenReview

DiffusionGuard: A Robust Defense Against Malicious Diffusion-Based Image Editing June Suk Choi, Kyungmin Lee, Jongheon Jeong, Saining Xie, Jinwoo Shin, Kimin Lee

PDF OpenReview

DiffusionPDE: Generative PDE-Solving Under Partial Observation Jiahe Huang, Guandao Yang, Zichen Wang, Jeong Joon Park

PDF OpenReview

DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar

PDF OpenReview

DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar

PDF OpenReview

DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Yifei Zhou, Hao Bai, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar

PDF OpenReview

DiLoCo: Distributed Low-Communication Training of Language Models Arthur Douillard, Qixuan Feng, Andrei Alex Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, MarcAurelio Ranzato, Arthur Szlam, Jiajun Shen

PDF OpenReview

DiMViS: Diffusion-Based Multi-View Synthesis Giuseppe Di Giacomo, Giulio Franzese, Tania Cerquitelli, Carla Fabiana Chiasserini, Pietro Michiardi

PDF OpenReview

Dirac--Bianconi Graph Neural Networks - Enabling Long-Range Graph Predictions Christian Nauck, Rohan Gorantla, Michael Lindner, Konstantin Schürholt, Antonia S J S Mey, Frank Hellmann

PDF OpenReview

Discovering Preference Optimization Algorithms with and for Large Language Models Chris Lu, Samuel Holt, Claudio Fanconi, Alex James Chan, Jakob Nicolaus Foerster, Mihaela van der Schaar, Robert Tjarko Lange

PDF OpenReview

Discrete Diffusion Posterior Sampling for Protein Design Mert Cemri, Ajil Jalal, Kannan Ramchandran

PDF OpenReview

Disentangled Representation Learning Through Geometry Preservation with the Gromov-Monge Gap Théo Uscidda, Luca Eyring, Karsten Roth, Fabian J Theis, Zeynep Akata, Marco Cuturi

PDF OpenReview

Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models Aruna Sankaranarayanan, Dylan Hadfield-Menell, Aaron Mueller

PDF OpenReview

Dissecting Query-Key Interaction in Vision Transformers Xu Pan, Aaron Philip, Ziqian Xie, Odelia Schwartz

PDF OpenReview

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection Yewon Lim, Changyeon Lee, Aerin Kim, Oren Etzioni

PDF OpenReview

Distillation Based Robustness Verification with PAC Guarantees Patrick Indri, Peter Blohm, Anagha Athavale, Ezio Bartocci, Georg Weissenbacher, Matteo Maffei, Dejan Nickovic, Thomas Gärtner, Sagar Malhotra

PDF OpenReview

Distilling LLMs’ Decomposition Abilities into Compact Language Models Denis Tarasov, Kumar Shridhar

PDF OpenReview

Distilling LLMs’ Decomposition Abilities into Compact Language Models Denis Tarasov, Kumar Shridhar

PDF OpenReview

Distributional Monte-Carlo Planning with Thompson Sampling in Stochastic Environments Tuan Quang Dam, Brahim Driss, Odalric-Ambrym Maillard

PDF OpenReview

Distributional Preference Alignment of LLMs via Optimal Transport Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan Greenewald, Jiri Navratil, Jarret Ross

PDF OpenReview

Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet

PDF OpenReview

DiveR-CT: Diversity-Enhanced Red Teaming with Relaxing Constraints Andrew Zhao, Quentin Xu, Matthieu Lin, Shenzhi Wang, Yong-jin Liu, Zilong Zheng, Gao Huang

PDF OpenReview

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation Idan Schwartz, Guy Yariv, Itai Gat, Yossi Adi, Sagie Benaim, Lior Wolf

PDF OpenReview

Do LLM Agents Have Regret? a Case Study in Online Learning and Games Chanwoo Park, Xiangyu Liu, Asuman E. Ozdaglar, Kaiqing Zhang

PDF OpenReview

Do LLM Agents Have Regret? a Case Study in Online Learning and Games Chanwoo Park, Xiangyu Liu, Asuman E. Ozdaglar, Kaiqing Zhang

PDF OpenReview

Do LLMs Dream of Elephants (when Told Not to)? Latent Concept Association and Associative Memory in Transformers Yibo Jiang, Goutham Rajendran, Pradeep Kumar Ravikumar, Bryon Aragam

PDF OpenReview

Do LLMs Dream of Elephants (when Told Not to)? Latent Concept Association and Associative Memory in Transformers Yibo Jiang, Goutham Rajendran, Pradeep Kumar Ravikumar, Bryon Aragam

PDF OpenReview

Do Parameters Reveal More than Loss for Membership Inference? Anshuman Suri, Xiao Zhang, David Evans

PDF OpenReview

DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation Ahmad Mohammadshirazi, Ali Nosratifiroozsalari, Mengxi Zhou, Dheeraj Kulshrestha, Rajiv Ramnath

PDF OpenReview

Does Editing Provide Evidence for Localization? Zihao Wang, Victor Veitch

PDF OpenReview

Does SGD Really Happen in Tiny Subspaces? Minhak Song, Kwangjun Ahn, Chulhee Yun

PDF OpenReview

Does Your Data Spark Joy? Performance Gains from Domain Upsampling at the End of Training Cody Blakeney, Mansheej Paul, Brett W. Larsen, Sean Owen, Jonathan Frankle

PDF OpenReview

Domain-Aware Fine-Tuning of Foundation Models Uğur Ali Kaplan, Yumeng Li, Margret Keuper, Anna Khoreva, Dan Zhang

PDF OpenReview

Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling Yuanqi Du, Michael Plainer, Rob Brekelmans, Chenru Duan, Frank Noe, Carla P Gomes, Alan Aspuru-Guzik, Kirill Neklyudov

PDF OpenReview

DPM: Dual Preferences-Based Multi-Agent Reinforcement Learning Sehyeok Kang, Yongsik Lee, Se-Young Yun

PDF OpenReview

DPO Meets PPO: Reinforced Token Optimization for RLHF Han Zhong, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, Liwei Wang

PDF OpenReview

DPO-Finetuned Large Multi-Modal Planner with Retrieval-Augmented Generation @ EgoPlan Challenge ICML 2024 Kwanghyeon Lee, Mina Kang, Hyungho Na, HeeSun Bae, Byeonghu Na, Doyun Kwon, Seungjae Shin, Yeongmin Kim, Kim Taewoo, Seungmin Yun, Il-chul Moon

PDF OpenReview

DrJAX: Scalable and Differentiable MapReduce Primitives in JAX J Keith Rush, Zachary Charles, Zachary Garrett, Sean Augenstein, Nicole Elyse Mitchell

PDF OpenReview

Dual Approximation Policy Optimization Zhihan Xiong, Maryam Fazel, Lin Xiao

PDF OpenReview

Dual Risk Minimization for Robust Fine-Tuning of Zero-Shot Models Kaican Li, Weiyan Xie, Ricardo Silva, Nevin L. Zhang

PDF OpenReview

DualBind: A Dual-Loss Framework for Protein-Ligand Binding Affinity Prediction Meng Liu, Saee Gopal Paliwal

PDF OpenReview

DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning Anthony Liang, Guy Tennenholtz, ChihWei Hsu, Yinlam Chow, Erdem Biyik, Craig Boutilier

PDF OpenReview

E-ProTran: Efficient Probabilistic Transformers for Forecasting Batuhan Koyuncu, Tim Nico Bauerschmidt, Isabel Valera

PDF OpenReview

E(n) Equivariant Message Passing Cellular Networks Veljko Kovac, Erik J Bekkers, Pietro Lio, Floor Eijkelboom

PDF OpenReview

Early Period of Training Impacts Out-of-Distribution Generalization Chen Cecilia Liu, Iryna Gurevych

PDF OpenReview

EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation Yuqiao Wen, Behzad Shayegh, Chenyang Huang, Yanshuai Cao, Lili Mou

PDF OpenReview

ECO: Efficient Computational Optimization for Exact Machine Unlearning in Deep Neural Networks Yu-Ting Huang, Pei-Yuan Wu, Chuan-Ju Wang

PDF OpenReview

EEG2TEXT: Open Vocabulary EEG-to-Text Decoding with EEG Pre-Training and Multi-View Transformer Hanwen Liu, Daniel Hajialigol, Benny Antony, Aiguo Han, Xuan Wang

PDF OpenReview

Effect of Random Learning Rate: Theoretical Analysis of SGD Dynamics in Non-Convex Optimization via Stationary Distribution Naoki Yoshida, Shogo Nakakita, Masaaki Imaizumi

PDF OpenReview

Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders Christian Toth, Christian Knoll, Franz Pernkopf, Robert Peharz

PDF OpenReview

Effective Layer Pruning Through Similarity Metric Perspective Ian Pons, Bruno Yamamoto, Anna Helena Reali Costa, Artur Jordao

PDF OpenReview

Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling Moritz Haas, Jin Xu, Volkan Cevher, Leena Chennuru Vankadara

PDF OpenReview

Efficiency and Transferability of Inductive Mondrian Conformal Predictors for Drug-Drug Synergy Arushi GK Majha

PDF OpenReview

Efficient 3D Molecular Generation with Flow Matching and Scale Optimal Transport Ross Irwin, Alessandro Tibo, Jon Paul Janet, Simon Olsson

PDF OpenReview

Efficient Adaptive Federated Optimization Su Hyeong Lee, Sidharth Sharma, Manzil Zaheer, Tian Li

PDF OpenReview

Efficient Differentially Private Fine-Tuning of Diffusion Models Jing Liu, Andrew Lowy, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang

PDF OpenReview

Efficient Document Ranking with Learnable Late Interactions Himanshu Jain, Ziwei Ji, Ankit Singh Rawat, Andreas Veit, Sadeep Jayasumana, Sashank J. Reddi, Aditya Krishna Menon, Felix Yu

PDF OpenReview

Efficient Document Ranking with Learnable Late Interactions Himanshu Jain, Ziwei Ji, Sashank J. Reddi, Ankit Singh Rawat, Felix Yu, Aditya Krishna Menon, Sadeep Jayasumana

PDF OpenReview

Efficient Evolutionary Search over Chemical Space with Large Language Models Haorui Wang, Marta Skreta, Yuanqi Du, Wenhao Gao, Lingkai Kong, Cher Tian Ser, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Alan Aspuru-Guzik, Kirill Neklyudov, Chao Zhang

PDF OpenReview

Efficient Inverse Reinforcement Learning Without Compounding Errors Nicolas Espinosa Dice, Gokul Swamy, Sanjiban Choudhury, Wen Sun

PDF OpenReview

Efficient Linear System Solver with Transformers Max Vladymyrov, Johannes von Oswald, Nolan Andrew Miller, Mark Sandler

PDF OpenReview

Efficient LLM Pruning with Global Token-Dependency Awareness and Hardware-Adapted Inference Oshin Dutta, Ritvik Gupta, Sumeet Agarwal

PDF OpenReview

Efficient Multi-Prompt Evaluation of LLMs Felipe Maia Polo, Ronald Xu, Lucas Weber, Mírian Silva, Onkar Bhardwaj, Leshem Choshen, Allysson Flavio Melo de Oliveira, Yuekai Sun, Mikhail Yurochkin

PDF OpenReview

Efficient Offline Learning of Ranking Policies via Top-$k$ Policy Decomposition Ren Kishimoto, Koichi Tanaka, Haruka Kiyohara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto, Yuta Saito

PDF OpenReview

Efficient Offline Reinforcement Learning: The Critic Is Critical Adam Jelley, Trevor McInroe, Sam Devlin, Amos Storkey

PDF OpenReview

Efficient Training of Language Models with Compact and Consistent Next Token Distributions Ashutosh Sathe, Sunita Sarawagi

PDF OpenReview

EggNet: An Evolving Graph-Based Graph Attention Network for Particle Track Reconstruction Paolo Calafiura, Jay Chan, Loic Delabrouille, Brandon Wang

PDF OpenReview

EgoSim: Egocentric Exploration in Virtual Worlds with Multi-Modal Conditioning Wei Yu, Songheng Yin, Steve Easterbrook, Animesh Garg

PDF OpenReview

EigenVI: Score-Based Variational Inference with Orthogonal Function Expansions Diana Cai, Chirag Modi, Charles Margossian, Robert M. Gower, David Blei, Lawrence K. Saul

PDF OpenReview

Eliciting Black-Box Representations from LLMs Through Self-Queries Dylan Sam, Marc Anton Finzi

PDF OpenReview

ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Thibaut Thonet, Jos Rozen, Laurent Besacier

PDF OpenReview

Emergent Representations in Networks Trained with the Forward-Forward Algorithm Niccolo Tosato, Lorenzo Basile, Emanuele Ballarin, Giuseppe De Alteriis, Alberto Cazzaniga, Alessio Ansuini

PDF OpenReview

EMPO: A Clustering-Based On-Policy Algorithm for Offline Reinforcement Learing Jongeui Park, Myungsik Cho, Youngchul Sung

PDF OpenReview

End-to-End Causal Effect Estimation from Unstructured Natural Language Data Nikita Dhawan, Leonardo Cotta, Karen Ullrich, Rahul Krishnan, Chris J. Maddison

PDF OpenReview

End-to-End Differentiable Model of Robot-Terrain Interactions Ruslan Agishev, Vladimír Kubelka, Martin Pecka, Tomas Svoboda, Karel Zimmermann

PDF OpenReview

Energy-Based Hopfield Boosting for Out-of-Distribution Detection Claus Hofmann, Simon Lucas Schmid, Bernhard Lehner, Daniel Klotz, Sepp Hochreiter

PDF OpenReview

Energy-Based Hopfield Boosting for Out-of-Distribution Detection Claus Hofmann, Simon Lucas Schmid, Bernhard Lehner, Daniel Klotz, Sepp Hochreiter

PDF OpenReview

Energy-Free Guidance of Geometric Diffusion Models for 3D Molecule Inverse Design Aksh Garg, Jiaqi Han, Sanjay Nagaraj, Minkai Xu

PDF OpenReview

Energy-Free Guidance of Geometric Diffusion Models for 3D Molecule Inverse Design Jiaqi Han, Aksh Garg, Sanjay Nagaraj, Minkai Xu

PDF OpenReview

Energy-Free Guidance of Geometric Diffusion Models for 3D Molecule Inverse Design Sanjay Nagaraj, Jiaqi Han, Aksh Garg, Minkai Xu

PDF OpenReview

Enhancing Actor-Critic Decision-Making with Afterstate Models for Continuous Control Norio Kosaka

PDF OpenReview

Enhancing Concept-Based Learning with Logic Deepika Vemuri, Gautham Bellamkonda, Vineeth N. Balasubramanian

PDF OpenReview

Enhancing Concept-Based Learning with Logic Deepika Vemuri, Gautham Bellamkonda, Vineeth N. Balasubramanian

PDF OpenReview

Enhancing Fine-Grained Multi-Modal Alignment via Adapters: A Parameter-Efficient Training Framework for Referring Image Segmentation Zunnan Xu, Jiaqi Huang, Ting Liu, Yong Liu, Haonan Han, Kehong Yuan, Xiu Li

PDF OpenReview

Enhancing Intent Understanding for Ambiguous Prompt: A Human-Machine Co-Adaption Strategy Yangfan He, Yuxuan Bai, Tianyu Shi

PDF OpenReview

Enhancing LLM Complex Reasoning Capability Through Hyperbolic Geometry Menglin Yang, Aosong Feng, Bo Xiong, Jiahong Liu, Irwin King, Rex Ying

PDF OpenReview

Enhancing Multi-Tip Artifact Detection in STM Images Using Fourier Transform and Vision Transformers Tommaso Rodani, Alessio Ansuini, Alberto Cazzaniga

PDF OpenReview

Enhancing Peak Assignment in CNMR Spectroscopy: A Novel Approach Using Multimodal Alignment Hao Xu, Zhengyang Zhou, Pengyu Hong

PDF OpenReview

Enhancing Protein Design Robustness Through Noise-Informed Sequence Design Yehlin Cho, Sergey Ovchinnikov, Christopher Frank

PDF OpenReview

Enhancing Single-Cell VAE Latent Space via Semi-Supervision Meichen Gong, Konstantin Ivanov, Merja Heinäniemi, Ville Hautamaki

PDF OpenReview

Enhancing Stability for Large Models Training in Constrained Bandwidth Networks Yun Dai, Tejas Dharamsi, Pin-Lun Hsu, Tao Song, Hamed Firooz

PDF OpenReview

Enhancing the Resilience of LLMs Against Grey-Box Extractions Hanbo Huang, Yihan Li, Bowen Jiang, Bo Jiang, Lin Liu, Zhuotao Liu, Ruoyu Sun, Shiyu Liang

PDF OpenReview

Ensemble Guidance: Towards Generative 3D SBDD in Bioactive Chemical Spaces Charles Harris, Arian Rokkum Jamasb, Pietro Lio, Tom Leon Blundell

PDF OpenReview

EPD: Long-Term Memory Extraction, Context-Aware Planning and Multi-Iteration Decision @ EgoPlan Challenge ICML 2024 Letian Shi, Qi Lv, Xiang Deng, Liqiang Nie

PDF OpenReview

Equation Identification for Fluid Flows via Physics-Informed Neural Networks Alexander New, Marisel Villafañe-Delgado, Charles Shugert

PDF OpenReview

EquiTorch: A Modularized Package for Flexibly Constructing Equivariant GNNs Building upon PyTorch-Geometric Tong Wang, Chuan Chen

PDF OpenReview

Equivariant Flow Matching for Molecular Conformer Generation Majdi Hassan, Nikhil Shenoy, Jungyoon Lee, Hannes Stark, Stephan Thaler, Dominique Beaini

PDF OpenReview

Equivariant Flow Matching for Molecular Conformer Generation Majdi Hassan, Nikhil Shenoy, Jungyoon Lee, Hannes Stark, Stephan Thaler, Dominique Beaini

PDF OpenReview

Equivariant Neural Diffusion for Molecule Generation François R J Cornet, Grigory Bartosh, Mikkel N. Schmidt, Christian A. Naesseth

PDF OpenReview

Equivariant Transformer Forcefields for Molecular Conformer Generation Rui Feng, Binghong Chen, Chao Zhang

PDF OpenReview

Equivariant vs. Invariant Layers: A Comparison of Backbone and Pooling for Point Cloud Classification Abihith Kothapalli, Ashkan Shahbazi, Xinran Liu, Robert Sheng, Soheil Kolouri

PDF OpenReview

Essentially Sharp Estimates on the Entropy Regularization Error in Discounted Markov Decision Processes Johannes Müller, Semih Cayci

PDF OpenReview

Estimating Probability Densities of Tabular Data Using a Transformer Model Combined with Denoising Diffusion Henry W. Leung, Jo Bovy, Joshua S. Speagle

PDF OpenReview

Ethereum AI Agent Coordinator (EAAC): A Framework for AI Agent Activity Coordination Taehoon Kim

PDF OpenReview

Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models Yuzhu Cai, Sheng Yin, Yuxi Wei, Chenxin Xu, Weibo Mao, Felix Juefei-Xu, Siheng Chen, Yanfeng Wang

PDF OpenReview

Euler Operators for Mis-Specified Physics-Informed Neural Networks Charlie Cowen-Breen, Yongji Wang, Stephen Bates, Ching-Yao Lai

PDF OpenReview

Evaluating Self-Supervised Foundation Models in Holographic Imaging Silas Dietler, Yanick Zeder, Elias Graf, Kilian Koch, Andreas Schwendimann, Tommaso Bendinelli

PDF OpenReview

Evaluation of RAG Metrics for Question Answering in the Telecom Domain Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Neeraj Gunda, Vansh Chhabra, Sai Krishna Bala

PDF OpenReview

EVCL: Elastic Variational Continual Learning with Weight Consolidation Hunar Batra, Ronald Clark

PDF OpenReview

Event-Based Federated Q-Learning Guner Dilsad Er, Michael Muehlebach

PDF OpenReview

EvoSBDD: Latent Evolution for Accurate and Efficient Structure-Based Drug Design Danny Reidenbach

PDF OpenReview

Exact Soft Analytical Side-Channel Attacks Using Tractable Circuits Thomas Wedenig, Rishub Nagpal, Gaëtan Cassiers, Stefan Mangard, Robert Peharz

PDF OpenReview

Explaining the Model, Protecting Your Data: Revealing and Mitigating the Data Privacy Risks of Post-Hoc Model Explanations via Membership Inference Catherine Huang, Martin Pawelczyk, Himabindu Lakkaraju

PDF OpenReview

Exploiting Activation Sparsity with Dense to Dynamic-K Mixture-of-Experts Conversion Filip Szatkowski, Bartosz Wójcik, Mikołaj Piórczyński, Simone Scardapane

PDF OpenReview

Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning Batuhan Yardim, Niao He

PDF OpenReview

Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning Jia Wan, Sean R. Sinclair, Devavrat Shah, Martin J Wainwright

PDF OpenReview

Exploiting LLM Quantization Kazuki Egashira, Mark Vero, Robin Staab, Jingxuan He, Martin Vechev

PDF OpenReview

ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers Under Domain Shifts Samar Khanna, Medhanie Irgau, David B. Lobell, Stefano Ermon

PDF OpenReview

Exploration and Application of AI in Space Science Xiang Zhao, You Song

PDF OpenReview

Exploring and Improving Drafts in Blockwise Parallel Decoding Taehyeon Kim, Ananda Theertha Suresh, Kishore A Papineni, Michael Riley, Sanjiv Kumar, Adrian Benton

PDF OpenReview

Exploring Integrality Grip for Mixed-Integer Programming by MCTS Planning Defeng Liu

PDF OpenReview

Exploring Monotonicity in Early-Exiting Language Models Filipe Laitenberger, Max Belitsky, Denys Sheremet

PDF OpenReview

Exploring Neural Scaling Laws in Molecular Pretraining with Synthetic Tasks Rodrigo Hormazabal, Seung Woo Ko, Inwan Yoo, Sehui Han, Paul Bertens

PDF OpenReview

Exploring Scaling Trends in LLM Robustness Nikolaus H. R. Howe, Michał Zając, Ian R. McKenzie, Oskar John Hollinsworth, Pierre-Luc Bacon, Adam Gleave

PDF OpenReview

Exploring Sequence Landscape of Biosynthetic Gene Clusters with Protein Language Models Tatiana Malygina, Olga Kalinina

PDF OpenReview

Exploring the Development of Complexity over Depth and Time in Deep Neural Networks Hannah Pinson, Aurélien Boland, Vincent Ginis, Mykola Pechenizkiy

PDF OpenReview

Exploring the Internal Mechanisms of Music LLMs: A Study of Root and Quality via Probing and Intervention Techniques Wenye Ma, Gus Xia

PDF OpenReview

ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement Eashan Adhikarla, Kai Zhang, John Nicholson, Brian D. Davison

PDF OpenReview

Exponential Quantum Communication Advantage in Distributed Inference and Learning Hagay Michaeli, Dar Gilboa, Daniel Soudry, Jarrod Ryan McClean

PDF OpenReview

Expressivity of Neural Networks with Fixed Weights and Learned Biases Ezekiel Williams, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Alexandre Payeur, Matthew G Perich, Luca Mazzucato, Guillaume Lajoie

PDF OpenReview

Extracting Finite State Machines from Transformers Rik Adriaensen, Jaron Maene

PDF OpenReview

Extracting Training Data from Document-Based VQA Models Francesco Pinto, Nathalie Rauschmayr, Florian Tramèr, Philip Torr, Federico Tombari

PDF OpenReview

Extrapolative Protein Design Through Triplet-Based Preference Learning Mostafa Karimi, Sharmi Banerjee, Tommi Jaakkola, Bella Dubrov, Shang Shang, Ron Benson

PDF OpenReview

Fairness Through Controlled (Un)Awareness in Node Embeddings Dennis Vetter, Jasper Forth, Gemma Roig, Holger Dell

PDF OpenReview

Fairness Through Partial Awareness: Evaluation of the Addition of Demographic Information for Bias Mitigation Methods Chung Peng Lee, Rachel Hong, Jamie Heather Morgenstern

PDF OpenReview

FairPFN: Transformers Can Do Counterfactual Fairness Jake Robertson, Noah Hollmann, Noor Awad, Frank Hutter

PDF OpenReview

Faithful and Fast Influence Function via Advanced Sampling Jungyeon Koh, Hyeonsu Lyu, Jonggyu Jang, Hyun Jong Yang

PDF OpenReview

Fast Adaptation and Robust Quantization of Multi-Modal Foundation Models from Associative Memory: A Case Study in SpeechLM Shang Wu, Yen-Ju Lu, Haozheng Luo, Jerry Yao-Chieh Hu, Jiayi Wang, Najim Dehak, Jesus Villalba, Han Liu

PDF OpenReview

Fast and Memory-Efficient Multi-Sequence Generation via Structured Masking Daniel Mingyi Israel, Siyan Zhao, Guy Van den Broeck, Aditya Grover

PDF OpenReview

Fast Machine Unlearning via Robust Training Youssef Allouah, Joshua Kazdan, Rachid Guerraoui, Sanmi Koyejo

PDF OpenReview

Fast Training Dataset Attribution via In-Context Learning Milad Fotouhi, Mohammad Taha Bahadori, Seyi Feyisetan, Payman Arabshahi, David Heckerman

PDF OpenReview

Fast yet Safe: Early-Exiting with Risk Control Metod Jazbec, Alexander Timans, Tin Hadži Veljković, Kaspar Sakmann, Dan Zhang, Christian A. Naesseth, Eric Nalisnick

PDF OpenReview

Fast yet Safe: Early-Exiting with Risk Control Metod Jazbec, Alexander Timans, Tin Hadži Veljković, Kaspar Sakmann, Dan Zhang, Christian A. Naesseth, Eric Nalisnick

PDF OpenReview

Fast-Forward FARGO: Accelerating Protoplanetary Disk Simulations with Limited Data Valentina Tardugno Poleo, David W Hogg, Shirley Ho

PDF OpenReview

FastDecode: High-Throughput LLM Serving Through Disaggregating Attention Computation Jiaao He, Kezhao Huang, Jidong Zhai

PDF OpenReview

Feature Learning Dynamics Under Grokking in a Sparse Parity Task Javier Sanguino Bautiste, Gregor Bachmann, Bobby He, Lorenzo Noci, Thomas Hofmann

PDF OpenReview

Federated Fine-Tuning of Vision Foundation Models via Probabilistic Masking Vasileios Tsouvalas, Yuki M Asano, Aaqib Saeed

PDF OpenReview

Fewer Truncations Improve Language Modeling Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto

PDF OpenReview

Filling in the Gaps: LLM-Based Structured Data Generation from Semi-Structured Scientific Data Hanbum Ko, Hongjun Yang, Sehui Han, Sungwoong Kim, Sungbin Lim, Rodrigo Hormazabal

PDF OpenReview

Filtered Direct Preference Optimization Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu

PDF OpenReview

Finding NeMo: Localizing Neurons Responsible for Memorization in Diffusion Models Lukas Struppek, Dominik Hintersdorf, Kristian Kersting, Adam Dziedzic, Franziska Boenisch

PDF OpenReview

Finding Structure-Property Relationships for Molecular Property Predictions with Globally Explainable AI Jonas Teufel, Pascal Friederich

PDF OpenReview

Finding Visual Task Vectors Alberto Hojel, Yutong Bai, Trevor Darrell, Amir Globerson, Amir Bar

PDF OpenReview

Fine-Grained Analysis of In-Context Linear Estimation Yingcong Li, Ankit Singh Rawat, Samet Oymak

PDF OpenReview

Fine-Grained Analysis of In-Context Linear Estimation: Data, Architecture, and Beyond Yingcong Li, Ankit Singh Rawat, Samet Oymak

PDF OpenReview

Fine-Tuned Network Relies on Generic Representation to Solve Unseen Cognitive Task Dongyan Lin

PDF OpenReview

Fine-Tuning Large Language Models with User-Level Differential Privacy Zachary Charles, Arun Ganesh, Ryan McKenna, Hugh Brendan McMahan, Nicole Elyse Mitchell, Krishna Pillutla, J Keith Rush

PDF OpenReview

Fine-Tuning Medical Language Models for Enhanced Long-Contextual Understanding and Domain Expertise Qimin Yang, Rongshengwang, Chen Jiexin, Runqi Su, Tao Tan

PDF OpenReview

Fine-Tuning the ESM2 Protein Language Model to Understand the Functional Impact of Missense Variants Ali Saadat, Jacques Fellay

PDF OpenReview

Fine-Tuning with Uncertainty-Aware Priors Makes Vision and Language Foundation Models More Reliable Tim G. J. Rudner, Xiang Pan, Yucen Lily Li, Ravid Shwartz-Ziv, Andrew Gordon Wilson

PDF OpenReview

Finite Sample Identification: From Frequency to Time Domain Anastasios Tsiamis, Mohamed Abdalmoaty, Roy S. Smith, John Lygeros

PDF OpenReview

Finite-Time Convergence to an $\epsilon$-Efficient Nash Equilibrium in Potential Games Anna Maria Maddux, Reda Ouhamma, Maryam Kamgarpour

PDF OpenReview

Fisher-Aware Quantization for DETR Detectors with Critical-Category Objectives Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang

PDF OpenReview

Flexible Docking via Unbalanced Flow Matching Gabriele Corso, Vignesh Ram Somnath, Noah Getz, Regina Barzilay, Tommi Jaakkola, Andreas Krause

PDF OpenReview

Flexible Docking via Unbalanced Flow Matching Gabriele Corso, Vignesh Ram Somnath, Noah Getz, Regina Barzilay, Tommi Jaakkola, Andreas Krause

PDF OpenReview

FlowBack: A Flow-Matching Approach for Generative Backmapping of Macromolecules Michael Jones, Smayan Khanna, Andrew Ferguson

PDF OpenReview

FoMu-SSL: Foundation Model-Guided Multi-Sensor Self-Supervised Learning for Remote Sensing Dabin Seo, Haeji Jung, Jinkyu Kim

PDF OpenReview

Forecasting Smog Clouds with Deep Learning: A Proof-of-Concept Valentijn Oldenburg, Juan Cardenas-Cartagena, Matias Valdenegro-Toro

PDF OpenReview

Fourier Neural Operator Based Surrogates for $\textrm{CO}_2$ Storage in Realistic Geologies Anirban Chandra, Marius Koch, Suraj Pawar, Aniruddha Panda, Kamyar Azizzadenesheli, Jeroen Snippe, Faruk O. Alpak, Farah Hariri, Clement Etienam, Pandu Devarakota, Anima Anandkumar, Detlef Hohl

PDF OpenReview

Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos Jiahe Liu, Youran Qu, Qi Yan, Xiaohui Zeng, Lele Wang, Renjie Liao

PDF OpenReview

Free-Energy Equilibria: Toward a Theory of Interactions Between Boundedly-Rational Agents David Hyland, Tomáš Gavenčiak, Lancelot Da Costa, Conor Heins, Vojtech Kovarik, Julian Gutierrez, Michael J. Wooldridge, Jan Kulveit

PDF OpenReview

From AlexNet to Transformers: Measuring the Non-Linearity of Deep Neural Networks with Affine Optimal Transport Quentin Bouniot, Ievgen Redko, Anton Mallasto, Charlotte Laclau, Oliver Struckmeier, Karol Arndt, Markus Heinonen, Ville Kyrki, Samuel Kaski

PDF OpenReview

From Graph Diffusion to Graph Classification Jia Jun Cheng Xian, Sadegh Mahdavi, Renjie Liao, Oliver Schulte

PDF OpenReview

From Laboratory to Everyday Life: Personalized Stress Prediction via Smartwatches Batuhan Koyuncu, Aleyna Dilan Kıran, Katja Heilmann, Laith Hamid, Anja Buder, Veronika Engert, Martin Walter, Isabel Valera

PDF OpenReview

From Text to Pixel: Advancing Long-Context Understanding in MLLMs Yujie Lu, Xiujun Li, Tsu-Jui Fu, Miguel Eckstein, William Yang Wang

PDF OpenReview

From Words to Worlds: Compositionality for Cognitive Architectures Ruchira Dhar, Anders Søgaard

PDF OpenReview

Function Space Diversity for Uncertainty Prediction via Repulsive Last-Layer Ensembles Sophie Steger, Christian Knoll, Bernhard Klein, Holger Fröning, Franz Pernkopf

PDF OpenReview

Functional Acceleration for Policy Mirror Descent Veronica Chelu, Doina Precup

PDF OpenReview

Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models Adway Girish, Alliot Nagle, Ashok Vardhan Makkuva, Marco Bondaschi, Michael Gastpar, Hyeji Kim

PDF OpenReview

Fundamental Limits of Weak Learnability in High-Dimensional Multi-Index Models Emanuele Troiani, Yatin Dandi, Leonardo Defilippis, Lenka Zdeborova, Bruno Loureiro, Florent Krzakala

PDF OpenReview

FusionDTI: Fine-Grained Binding Discovery with Token-Level Fusion for Drug-Target Interaction Zhaohan Meng, Zaiqiao Meng, Iadh Ounis

PDF OpenReview

FusOn-pLM: A Fusion Oncoprotein-Specific Language Model via Focused Probabilistic Masking Sophia Vincoff, Shrey Goel, Kseniia Kholina, Pranam Chatterjee

PDF OpenReview

Future-Proof Vaccine Design with a Generative Model of Antibody Cross-Reactivity Noor Youssef, Sarah Gurev, Hannah Rivka Pierce-Hoffman, Alexander A Cohen, Luis F Caldera, Pamela J Bjorkman, Debora Susan Marks

PDF OpenReview

Games for AI-Control: Models of Safety Evaluations of AI Deployment Protocols Charlie Griffin, Buck Shlegeris, Alessandro Abate

PDF OpenReview

Gaussian Process-Based Representation Learning via Timeseries Symmetries Petar Bevanda, Max Beier, Armin Lederer, Alexandre Capone, Stefan Georg Sosnowski, Sandra Hirche

PDF OpenReview

Gene Regulatory Network Inference from Pre-Trained Single-Cell Transcriptomics Transformer with Joint Graph Learning Sindhura Kommu, Yizhi Wang, Yue Wang, Xuan Wang

PDF OpenReview

Gene-Centric Evaluation of Causal Variant Prediction for DNA Models Chantriolnt-Andreas Kapourani, Alice Del Vecchio, Agnieszka Dobrowolska, Andrew Anighoro, Edith M. Hessel, Lindsay Edwards, Cristian Regep

PDF OpenReview

Generalization vs. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data Antonis Antoniades, Xinyi Wang, Yanai Elazar, Alfonso Amayuelas, Alon Albalak, Kexun Zhang, William Yang Wang

PDF OpenReview

Generalized Linear Bandits with Limited Adaptivity Ayush Sawarni, Nirjhar Das, Siddharth Barman, Gaurav Sinha

PDF OpenReview

Generalizing Convolution to Point Clouds Davide Bacciu, Francesco Landolfi

PDF OpenReview

Generalizing Offline Alignment Theoretical Paradigm with Diverse Divergence Constraints Haoyuan Sun, Yuxin Zheng, Yifei Zhao, Yongzhe Chang, Xueqian Wang

PDF OpenReview

Generated Audio Detectors Are Not Robust in Real-World Conditions Soumya Shaw, Ben Nassi, Lea Schönherr

PDF OpenReview

Generating Fine-Grained Causality in Climate Time Series Data for Forecasting and Anomaly Detection Dongqi Fu, Yada Zhu, Hanghang Tong, Kommy Weldemariam, Onkar Bhardwaj, Jingrui He

PDF OpenReview

Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion Hossein Souri, Arpit Bansal, Hamid Kazemi, Liam H Fowl, Aniruddha Saha, Jonas Geiping, Andrew Gordon Wilson, Rama Chellappa, Tom Goldstein, Micah Goldblum

PDF OpenReview

Generation and Human-Expert Evaluation of Interesting Research Ideas Using Knowledge Graphs and Large Language Models Xuemei Gu, Mario Krenn

PDF OpenReview

Generation Constraint Scaling Can Mitigate Hallucination Georgios Kollias, Payel Das, Subhajit Chaudhury

PDF OpenReview

Generative Acceleration of Molecular Dynamics Simulations for Solid-State Electrolytes Juno Nam, Sulin Liu, Gavin Winter, Rafael Gomez-Bombarelli

PDF OpenReview

Generative Autoencoding of Dropout Patterns Shunta Maeda

PDF OpenReview

Generative Classifiers Avoid Shortcut Solutions Alexander Cong Li, Ananya Kumar, Deepak Pathak

PDF OpenReview

Generative Design of Decision Tree Policies for Reinforcement Learning Jacob Pettit, Chak Shing Lee, Jiachen Yang, Alex Ho, Daniel Faissol, Brenden K. Petersen, Mikel Landajuela

PDF OpenReview

Generative Fractional Diffusion Models Gabriel Nobis, Maximilian Springenberg, Marco Aversa, Michael Detzel, Rembert Daems, Roderick Murray-Smith, Shinichi Nakajima, Sebastian Lapuschkin, Stefano Ermon, Tolga Birdal, Manfred Opper, Christoph Knochenhauer, Luis Oala, Wojciech Samek

PDF OpenReview

Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets Ulrich Armel Mbou Sob, Qiulin Li, Miguel Arbesú, Oliver Bent, Andries Petrus Smit, Arnu Pretorius

PDF OpenReview

Generative Modeling of Molecular Dynamics Trajectories Bowen Jing, Hannes Stark, Tommi Jaakkola, Bonnie Berger

PDF OpenReview

Geometric Algebra Based Encoding for Graph Prompting Sotirios Panagiotis Chytas, Rudrasis Chakraborty, Vikas Singh

PDF OpenReview

Geometric Algebra Transformers for Large 3D Meshes via Cross-Attention Julian Suk, Pim De Haan, Baris Imre, Jelmer M. Wolterink

PDF OpenReview

Geometric Median Matching for Robust Data Pruning Anish Acharya, Inderjit S Dhillon, Sujay Sanghavi

PDF OpenReview

Geometric Self-Supervised Pretraining on 3D Protein Structures Using Subgraphs Michail Chatzianastasis, George Dasoulas, Michalis Vazirgiannis

PDF OpenReview

Geometric Wireless Simulation with Equivariant Transformers Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy, Arash Behboodi, Johann Brehmer

PDF OpenReview

Geometry Aware Deep Learning for Integrated Closed-Shell and Open-Shell Systems Beom Seok Kang, Vignesh C Bhethanabotla, Mohammadamin Tavakoli, William Goddard, Anima Anandkumar

PDF OpenReview

Geometry Fidelity for Spherical Images Anders Christensen, Nooshin Mojab, Khushman Patel, Karan Ahuja, Zeynep Akata, Ole Winther, Mar Gonzalez-Franco, Andrea Colaco

PDF OpenReview

Geometry-Aware Autoencoders for Metric Learning and Generative Modeling on Data Manifolds Xingzhi Sun, Danqi Liao, Kincaid MacDonald, Yanlei Zhang, Guillaume Huguet, Guy Wolf, Ian Adelstein, Tim G. J. Rudner, Smita Krishnaswamy

PDF OpenReview

Geometry-Informed Neural Networks Arturs Berzins, Andreas Radler, Sebastian Sanokowski, Sepp Hochreiter, Johannes Brandstetter

PDF OpenReview

GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut

PDF OpenReview

Get It Cooperating: Enhancing Generative Agent Cooperation with Commitment Devices Feng Yan, Qitian Jason Hu, Nan Jiang, Xinyuan Sun

PDF OpenReview

Get Rich Quick: Exact Solutions Reveal How Unbalanced Initializations Promote Rapid Feature Learning Daniel Kunin, Allan Raventos, Clémentine Carla Juliette Dominé, Feng Chen, David Klindt, Andrew M Saxe, Surya Ganguli

PDF OpenReview

Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment Jiaxiang Li, Siliang Zeng, Hoi To Wai, Chenliang Li, Alfredo Garcia, Mingyi Hong

PDF OpenReview

GLAD: Improving Latent Graph Generative Modeling with Simple Quantization Van Khoa Nguyen, Yoann Boget, Frantzeska Lavda, Alexandros Kalousis

PDF OpenReview

Glauber Generative Model: Discrete Diffusion Models via Binary Classification Harshit Varma, Dheeraj Mysore Nagaraj, Karthikeyan Shanmugam

PDF OpenReview

GLAudio Listens to the Sound of the Graph Aurelio Sulser, Johann Wenckstern, Clara Kümpel

PDF OpenReview

Gone with the Bits: Benchmarking Bias in Facial Phenotype Degradation Under Low-Rate Neural Compression Tian Qiu, Arjun Nichani, Rasta Tadayon, Haewon Jeong

PDF OpenReview

GPT-HyperAgent: Scalable Uncertainty Estimation and Exploration for Foundation Model Decisions Yingru Li, Jiawei Xu, Zhi-Quan Luo

PDF OpenReview

GPTVQ: The Blessing of Dimensionality for LLM Quantization Mart Van Baalen, Andrey Kuzmin, Markus Nagel, Peter Couperus, Artem Bolshakov, Cedric Bastoul, Eric Mahurin, Tijmen Blankevoort, Paul Whatmough

PDF OpenReview

Gradient Descent Induces Alignment Between Weights and the Pre-Activation Tangents for Deep Non-Linear Networks Daniel Beaglehole, Ioannis Mitliagkas, Atish Agarwala

PDF OpenReview

Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks Chenyang Zhang, Gao Peifeng, Difan Zou, Yuan Cao

PDF OpenReview

Gradient Descent with Polyak’s Momentum Finds Flatter Minima via Large Catapults Prin Phunyaphibarn, Junghyun Lee, Bohan Wang, Huishuai Zhang, Chulhee Yun

PDF OpenReview

Gradient Dissent in Language Model Training and Saturation Andrei Mircea, Ekaterina Lobacheva, Irina Rish

PDF OpenReview

Gradient-Based Discrete Sampling with Automatic Cyclical Scheduling Patrick Pynadath, Riddhiman Bhattacharya, Arun Narayanan Hariharan, Ruqi Zhang

PDF OpenReview

Graph Convolutional Networks for Learning Laplace-Beltrami Operators Yingying Wu, Roger Fu, Richard Peng, Qifeng Chen

PDF OpenReview

Graph Multi-Similarity Learning for Molecular Property Prediction Hao Xu, Zhengyang Zhou, Pengyu Hong

PDF OpenReview

Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge Julien Delile, Srayanta Mukherjee, Anton Van Pamel, Leonid Zhukov

PDF OpenReview

Graph2Token: Make LLMs Understand Molecule Graphs Runze Wang, Mingqi Yang, Yanming Shen

PDF OpenReview

GraphBPE: Molecular Graphs Meet Byte-Pair Encoding Yuchen Shen, Barnabas Poczos

PDF OpenReview

GraphKAN: Graph Kolmogorov Arnold Network for Small Molecule-Protein Interaction Predictions Tashin Ahmed, Md Habibur Rahman Sifat

PDF OpenReview

Grappa - A Machine Learned Molecular Mechanics Force Field Leif Seute, Eric Hartmann, Jan Stuehmer, Frauke Gräter

PDF OpenReview

GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients Aashiq Muhamed, Oscar Li, David Woodruff, Mona T. Diab, Virginia Smith

PDF OpenReview

GROD: Enhancing Generalization of Transformer with Out-of-Distribution Detection Yijin Zhou, Yu Guang Wang

PDF OpenReview

Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization Boshi Wang, Xiang Yue, Yu Su, Huan Sun

PDF OpenReview

Grokking and the Geometry of Circuit Formation Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

PDF OpenReview

Grokking, Rank Minimization and Generalization in Deep Learning David Yunis, Kumar Kshitij Patel, Samuel Wheeler, Pedro Henrique Pamplona Savarese, Gal Vardi, Karen Livescu, Michael Maire, Matthew Walter

PDF OpenReview

GROOT-1.5: Learning to Follow Multi-Modal Instructions from Weak Supervision Shaofei Cai, Bowei Zhang, Zihao Wang, Xiaojian Ma, Anji Liu, Yitao Liang

PDF OpenReview

Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution Tim Seyde, Peter Werner, Wilko Schwarting, Markus Wulfmeier, Daniela Rus

PDF OpenReview

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: Directional Exploration and Redundancy Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf

PDF OpenReview

Handling Delay in Reinforcement Learning Caused by Parallel Computations of Neurons Ivan Anokhin, Rishav Rishav, Stephen Chung, Irina Rish, Samira Ebrahimi Kahou

PDF OpenReview

Hardware-Efficient Quantization for Green Custom Foundation Models Toshiaki Koike-Akino, Chang Meng, Volkan Cevher, Giovanni De Micheli

PDF OpenReview

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms Michael Hanna, Sandro Pezzelle, Yonatan Belinkov

PDF OpenReview

Heterogeneous Federated Zeroth-Order Optimization Using Gradient Surrogates Yao Shu, Xiaoqiang Lin, Zhongxiang Dai, Bryan Kian Hsiang Low

PDF OpenReview

Hidden Learning Dynamics of Capability Before Behavior in Diffusion Models Core Francisco Park, Maya Okawa, Andrew Lee, Ekdeep Singh Lubana, Hidenori Tanaka

PDF OpenReview

Hierarchical Contrastive Learning for Enzyme Function Prediction Soorin Yim, Doyeong Hwang, Kiyoung Kim, Sehui Han

PDF OpenReview

Hierarchical Reinforcement Learning and Model Predictive Control for Strategic Motion Planning in Autonomous Racing Rudolf Reiter, Jasper Hoffmann, Joschka Boedecker, Moritz Diehl

PDF OpenReview

Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling Raunaq Bhirangi, Chenyu Wang, Venkatesh Pattabiraman, Carmel Majidi, Abhinav Gupta, Tess Hellebrekers, Lerrel Pinto

PDF OpenReview

High-Resolution in Silico Painting with Generative Models Trang Le

PDF OpenReview

Higher Order and Self-Referential Evolution for Population-Based Methods Samuel Coward, Chris Lu, Alistair Letcher, Minqi Jiang, Jack Parker-Holder, Jakob Nicolaus Foerster

PDF OpenReview

HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs via High Level Synthesis Darren Yan Key, Andy He, Mason Bulling, Andrew Chang, Skyler Shapiro, Everett Lee

PDF OpenReview

How Consensus-Based Optimization Can Be Interpreted as a Stochastic Relaxation of Gradient Descent Konstantin Riedl, Timo Klock, Carina Geldhauser, Massimo Fornasier

PDF OpenReview

How Do Llamas Process Multilingual Text? a Latent Exploration Through Activation Patching Clément Dumas, Veniamin Veselovsky, Giovanni Monea, Robert West, Chris Wendler

PDF OpenReview

How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability? Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen

PDF OpenReview

How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability? Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen

PDF OpenReview

How Do Transformers "Do" Physics? Investigating the Simple Harmonic Oscillator Subhash Kantamneni, Ziming Liu, Max Tegmark

PDF OpenReview

How Do Transformers Fill in the Blanks? a Case Study on Matrix Completion Pulkit Gopalani, Ekdeep Singh Lubana, Wei Hu

PDF OpenReview

How Do Transformers Fill in the Blanks? a Case Study on Matrix Completion Pulkit Gopalani, Ekdeep Singh Lubana, Wei Hu

PDF OpenReview

How Do Transformers Fill in the Blanks? a Case Study on Matrix Completion Pulkit Gopalani, Ekdeep Singh Lubana, Wei Hu

PDF OpenReview

How Does Return Distribution in Distributional Reinforcement Learning Help Optimization? Ke Sun, Bei Jiang, Linglong Kong

PDF OpenReview

How Transformers Learn Diverse Attention Correlations in Masked Vision Pretraining Yu Huang, Zixin Wen, Yuejie Chi, Yingbin Liang

PDF OpenReview

How Transformers Utilize Multi-Head Attention in In-Context Learning? a Case Study on Sparse Linear Regression Xingwu Chen, Lei Zhao, Difan Zou

PDF OpenReview

How Truncating Weights Improves Reasoning in Language Models Lei Chen, Joan Bruna, Alberto Bietti

PDF OpenReview

How Truncating Weights Improves Reasoning in Language Models Lei Chen, Joan Bruna, Alberto Bietti

PDF OpenReview

Humans Linguistically Align to Their Conversational Partners, and Language Models Should Too Rachel Ostrand, Sara E Berger

PDF OpenReview

Hummer: Towards Limited Competitive Preference Dataset Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

PDF OpenReview

Hummer: Towards Limited Competitive Preference Dataset Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

PDF OpenReview

Hybrid Recurrent Models Support Emergent Descriptions for Hierarchical Planning and Control Poppy Collis, Ryan Singh, Paul Kinghorn, Christopher Buckley

PDF OpenReview

Hydragen: High-Throughput LLM Inference with Shared Prefixes Jordan Juravsky, Bradley Brown, Ryan Saul Ehrlich, Daniel Y Fu, Christopher Re, Azalia Mirhoseini

PDF OpenReview

Hyperspectral Unmixing for Raman Spectroscopy via Physics-Constrained Autoencoders Dimitar Georgiev, Álvaro Fernández-Galiana, Simon Vilms Pedersen, Georgios Papadopoulos, Ruoxiao Xie, Molly M. Stevens, Mauricio Barahona

PDF OpenReview

Hypothesis Testing the Circuit Hypothesis in LLMs Claudia Shi, Nicolas Beltran-Velez, Achille Nazaret, Carolina Zheng, Adrià Garriga-Alonso, Andrew Jesson, Maggie Makar, David Blei

PDF OpenReview

Identifiable Latent Bandits: Combining Observational Data and Exploration for Personalized Healthcare Ahmet Zahid Balcıoğlu, Emil Carlsson, Fredrik D. Johansson

PDF OpenReview

Identifying Biological Priors and Structure in Single-Cell Foundation Models Flavia Pedrocchi, Stefan Stark, Gunnar Ratsch, Amir Joudaki

PDF OpenReview

Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning Dan Braun, Jordan Taylor, Nicholas Goldowsky-Dill, Lee Sharkey

PDF OpenReview

Identifying Latent State Transition in Non-Linear Dynamical Systems Çağlar Hızlı, Çagatay Yildiz, Matthias Bethge, S. T. John, Pekka Marttinen

PDF OpenReview

Impact4Cast: Forecasting High-Impact Research Topics via Machine Learning on Evolving Knowledge Graphs Xuemei Gu, Mario Krenn

PDF OpenReview

Implementability of Information Elicitation Mechanisms with Pre-Trained Language Models Zachary Robertson, Hannah Cha, Andrew Sheha, Sanmi Koyejo

PDF OpenReview

Implicit Diffusion: Efficient Optimization Through Stochastic Sampling Pierre Marion, Anna Korba, Peter Bartlett, Mathieu Blondel, Valentin De Bortoli, Arnaud Doucet, Felipe Llinares-López, Courtney Paquette, Quentin Berthet

PDF OpenReview

Implicit Optimization Bias of Next-Token Prediction in Linear Models Christos Thrampoulidis

PDF OpenReview

Implicit Optimization Bias of Next-Token Prediction in Linear Models Christos Thrampoulidis

PDF OpenReview

Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems Bingcong Li, Liang Zhang, Niao He

PDF OpenReview

ImportanceWeighted Multi-Draft Speculative Sampling Ashish J Khisti, Arash Behravesh, Hassan Dbouk, Arash Behboodi, Roland Memisevic, Christos Louizos

PDF OpenReview

Improve Temporal Awareness of LLMs for Domain-General Sequential Recommendation Zhendong Chu, Zichao Wang, Ruiyi Zhang, Yangfeng Ji, Hongning Wang, Tong Sun

PDF OpenReview

Improved Algorithms for Adversarial Bandits with Unbounded Losses Mingyu Chen, Xuezhou Zhang

PDF OpenReview

Improved Algorithms for Contextual Dynamic Pricing Matilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet

PDF OpenReview

Improved Algorithms for Kernel Matrix-Vector Multiplication Piotr Indyk, Michael Kapralov, Kshiteej Sheth, Tal Wagner

PDF OpenReview

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses Xiaosen Zheng, Tianyu Pang, Chao Du, Qian Liu, Jing Jiang, Min Lin

PDF OpenReview

Improving AlphaFlow for Efficient Protein Ensembles Generation Shaoning Li, Mingyu Li, Yusong Wang, Xinheng He, Zhang Jian, Nanning Zheng, Pheng-Ann Heng

PDF OpenReview

Improving Consistency Models with Generator-Induced Coupling Thibaut Issenhuth, Ludovic Dos Santos, Jean-Yves Franceschi, Alain Rakotomamonjy

PDF OpenReview

Improving Equivariant Networks with Probabilistic Symmetry Breaking Hannah Lawrence, Vasco Portilheiro, Yan Zhang, Sékou-Oumar Kaba

PDF OpenReview

Improving Flow Matching for Posterior Inference with Physics-Based Controls Benjamin Holzschuh, Nils Thuerey

PDF OpenReview

Improving Fragment-Based Deep Molecular Generative Models Panukorn Taleongpong, Brooks Paige

PDF OpenReview

Improving GFlowNets for Text-to-Image Diffusion Alignment Dinghuai Zhang, Yizhe Zhang, Jiatao Gu, Ruixiang Zhang, Joshua M. Susskind, Navdeep Jaitly, Shuangfei Zhai

PDF OpenReview

Improving GFlowNets for Text-to-Image Diffusion Alignment Dinghuai Zhang, Yizhe Zhang, Jiatao Gu, Ruixiang Zhang, Joshua M. Susskind, Navdeep Jaitly, Shuangfei Zhai

PDF OpenReview

Improving GFlowNets with Monte Carlo Tree Search Nikita Morozov, Daniil Tiapkin, Sergey Samsonov, Alexey Naumov, Dmitry Vetrov

PDF OpenReview

Improving Graph-Language Alignment with Hierarchical Graph Tokenization Yongqiang Chen, Quanming Yao, Juzheng Zhang, James Cheng, Yatao Bian

PDF OpenReview

Improving Molecular Modeling with Geometric GNNs: An Empirical Study Ali Ramlaoui, Théo Saulus, Basile Terver, Victor Schmidt, David Rolnick, Fragkiskos D. Malliaros, Alexandre AGM Duval

PDF OpenReview

Improving Performance Prediction of Electrolyte Formulations with Transformer-Based Molecular Representation Model Indra Priyadarsini, Vidushi Sharma, Seiji Takeda, Akihiro Kishimoto, Lisa Hamada, Hajime Shinohara

PDF OpenReview

Improving Route Development Using Convergent Retrosynthesis Planning Paula Torren-Peraire, Jonas Verhoeven, Dorota Herman, Hugo Ceulemans, Igor V. Tetko, Jörg K. Wegner

PDF OpenReview

Improving Self Consistency in LLMs Through Probabilistic Tokenization Ashutosh Sathe, Divyanshu Aggarwal, Sunayana Sitaram

PDF OpenReview

Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, Janos Kramar, Rohin Shah, Neel Nanda

PDF OpenReview

Improving the Accuracy of Coarse-Grained Partial Differential Equations with Grid-Based Reinforcement Learning Jan-Philipp von Bassewitz, Sebastian Kaltenbach, Petros Koumoutsakos

PDF OpenReview

Improving the Efficiency of Self-Supervised Adversarial Training Through Latent Clustering-Based Selection Somrita Ghosh, Yuelin Xu, Xiao Zhang

PDF OpenReview

In Defense of Structural Sparse Adapters for Concurrent LLM Serving Junda Su, Zirui Liu, Zeju Qiu, Weiyang Liu, Zhaozhuo Xu

PDF OpenReview

In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning Mikhail Terekhov, Caglar Gulcehre

PDF OpenReview

In Search of Forgotten Domain Generalization Prasanna Mayilvahanan, Roland S. Zimmermann, Thaddäus Wiedemer, Evgenia Rusak, Attila Juhos, Matthias Bethge, Wieland Brendel

PDF OpenReview

In-Context Generalization to New Tasks from Unlabeled Observation Data Anthony Liang, Pavel Czempin, Yutai Zhou, Stephen Tu, Erdem Biyik

PDF OpenReview

In-Context Learning from Training on Unstructured Data: The Role of Co-Occurrence, Positional Information, and Training Data Structure Kevin Christian Wibisono, Yixin Wang

PDF OpenReview

In-Context Learning from Training on Unstructured Data: The Role of Co-Occurrence, Positional Information, and Training Data Structure Kevin Christian Wibisono, Yixin Wang

PDF OpenReview

In-Context Learning Improves Compositional Understanding of Vision-Language Models Matteo Nulli, Anesa Ibrahimi, Avik Pal, Hoshe Lee, Ivona Najdenkoska

PDF OpenReview

In-Context Learning in Presence of Spurious Correlations Hrayr Harutyunyan, Rafayel Darbinyan, Samvel Karapetyan, Hrant Khachatrian

PDF OpenReview

In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Pengrui Han, Peiyang Song, Haofei Yu, Jiaxuan You

PDF OpenReview

In-Context Learning of Energy Functions Rylan Schaeffer, Mikail Khona, Sanmi Koyejo

PDF OpenReview

In-Context Learning with Long-Context Models: An In-Depth Exploration Amanda Bertsch, Maor Ivgi, Uri Alon, Jonathan Berant, Matthew R. Gormley, Graham Neubig

PDF OpenReview

In-Context Learning with Representations: Contextual Generalization of Trained Transformers Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi

PDF OpenReview

In-Context Learning with Topological Information for LLM-Based Knowledge Graph Completion Udari Madhushani Sehwag, Kassiani Papasotiriou, Jared Vann, Sumitra Ganesh

PDF OpenReview

In-Context Learning, Can It Break Safety? Sophie Xhonneux, David Dobre, Michael Noukhovitch, Jian Tang, Gauthier Gidel, Dhanya Sridhar

PDF OpenReview

In-Context Principle Learning from Mistakes Tianjun Zhang, Aman Madaan, Luyu Gao, Steven Zhang, Swaroop Mishra, Yiming Yang, Niket Tandon, Uri Alon

PDF OpenReview

In-Context Reinforcement Learning Without Optimal Action Labels Juncheng Dong, Moyang Guo, Ethan X Fang, Zhuoran Yang, Vahid Tarokh

PDF OpenReview

In-Context Symmetries: Self-Supervised Learning Through Contextual World Models Sharut Gupta, Chenyu Wang, Yifei Wang, Tommi Jaakkola, Stefanie Jegelka

PDF OpenReview

Incorporating Stability into Flow Matching Christopher Iliffe Sprague, Arne Elofsson, Hossein Azizpour

PDF OpenReview

Inference Performance Optimization for Large Language Models on CPUs Pujiang He, Shan Zhou, Wenhuan Huang, Changqing Li, Duyi Wang, Bin Guo, Chen Meng, Sheng Gui, Weifei Yu, Yi Xie

PDF OpenReview

Inferring Physiological Properties of Motor Neurons Using Neural Posterior Estimation Pranav Mamidanna, Dario Farina

PDF OpenReview

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory Chaojun Xiao, Pengle Zhang, Xu Han, Guangxuan Xiao, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu, Maosong Sun

PDF OpenReview

InfoNCE: Identifying the Gap Between Theory and Practice Evgenia Rusak, Patrik Reizinger, Attila Juhos, Oliver Bringmann, Roland S. Zimmermann, Wieland Brendel

PDF OpenReview

InfoNCE: Identifying the Gap Between Theory and Practice Evgenia Rusak, Patrik Reizinger, Attila Juhos, Oliver Bringmann, Roland S. Zimmermann, Wieland Brendel

PDF OpenReview

Information Theoretic Guarantees for Policy Alignment in Large Language Models Youssef Mroueh

PDF OpenReview

Information-Theoretic Progress Measures Reveal Grokking Is an Emergent Phase Transition Kenzo Clauw, Daniele Marinazzo, Sebastiano Stramaglia

PDF OpenReview

Informed Meta-Learning Kasia Kobalczyk, Mihaela van der Schaar

PDF OpenReview

Informed Meta-Learning Kasia Kobalczyk, Mihaela van der Schaar

PDF OpenReview

Injecting Hierarchical Biological Priors into Graph Neural Networks for Flow Cytometry Prediction Fatemeh Nassajian Mojarrad, Lorenzo Bini, Thomas Matthes, Stephane Marchand-Maillet

PDF OpenReview

Inpainting Crystal Structure Generations with Score-Based Denoising Xinzhe Dai, Peichen Zhong, Bowen Deng, Yifan Chen, Gerbrand Ceder

PDF OpenReview

Inpainting Galaxy Counts onto N-Body Simulations over Multiple Cosmologies and Astrophysics Antoine Bourdin, Ronan Legin, Matthew Ho, Alexandre Adam, Yashar Hezaveh, Laurence Perreault-Levasseur

PDF OpenReview

InstructBooth: Instruction-Following Personalized Text-to-Image Generation Daewon Chae, Nokyung Park, Jinkyu Kim, Kimin Lee

PDF OpenReview

Instruction Tuning with Loss over Instructions Zhengyan Shi, Adam X. Yang, Bin Wu, Laurence Aitchison, Emine Yilmaz, Aldo Lipani

PDF OpenReview

Instruction-Guided Visual Masking Jinliang Zheng, Jianxiong Li, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan

PDF OpenReview

Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering Hongxuan Liu, Haoyu Yin, Zhiyao Luo, Xiaonan Wang

PDF OpenReview

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models Cong Lu, Shengran Hu, Jeff Clune

PDF OpenReview

Interactome-Scale Comparison of Co-Immunoprecipitation and Yeast Two-Hybrid Assays for Protein Interaction Prediction Kapil Devkota, Lenore Cowen, Rohit Singh

PDF OpenReview

InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques Rohan Gupta, Iván Arcuschin, Thomas Kwa, Adrià Garriga-Alonso

PDF OpenReview

Interpolated-MLPs: Controllable Inductive Bias Sean Wu, Jordan Hong, Keybai, Gregor Bachmann

PDF OpenReview

Interpretability Analysis on a Pathology Foundation Model Reveals Biologically Relevant Embeddings Across Modalities Nhat Le, Ciyue Shen, Chintan Shah, Blake Martin, Daniel Shenker, Harshith Padigela, Jennifer A. Hipp, Sean Grullon, John Abel, Harsha Vardhan Pokkalla, Dinkar Juyal

PDF OpenReview

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent Karolis Jucys, George Adamopoulos, Mehrab Hamidi, Stephanie Milani, Mohammad Reza Samsami, Artem Zholus, Sonia Joseph, Blake Aaron Richards, Irina Rish, Özgür Şimşek

PDF OpenReview

Interpreting Attention Layer Outputs with Sparse Autoencoders Connor Kissane, Robert Krzyzanowski, Joseph Isaac Bloom, Arthur Conmy, Neel Nanda

PDF OpenReview

Inverse Reinforcement Learning from Demonstrations for LLM Alignment Hao Sun, Mihaela van der Schaar

PDF OpenReview

InversionView: A General-Purpose Method for Reading Information from Neural Activations Xinting Huang, Madhur Panwar, Navin Goyal, Michael Hahn

PDF OpenReview

Invertible Temper Modeling Using Normalizing Flows and the Effects of Structure Preserving Loss Tegan Emerson, Henry Kvinge, Keerti Sahithi Kappagantula, Sylvia Howland

PDF OpenReview

Investigating Generalization Behaviours of Generative Flow Networks Lazar Atanackovic, Emmanuel Bengio

PDF OpenReview

Investigating the Indirect Object Identification Circuit in Mamba Danielle Ensign, Adrià Garriga-Alonso

PDF OpenReview

Investigating the Interpretability of Biometric Face Templates Using Gated Sparse Autoencoders and Differentiable Image Parametrizations Peter Rot, Klemen Grm

PDF OpenReview

Is a Good Description Worth a Thousand Pictures? Reducing Multimodal Alignment to Text-Based, Unimodal Alignment Amin Memarian, Touraj Laleh, Irina Rish, Ardavan S. Nobandegani

PDF OpenReview

Is ChatGPT Transforming Academics' Writing Style? Mingmeng Geng, Roberto Trotta

PDF OpenReview

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Tomasz Korbak, Henry Sleight, Rajashree Agrawal, John Hughes, Dhruv Bhandarkar Pai, Andrey Gromov, Dan Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

PDF OpenReview

Is My Data Safe? Predicting Instance-Level Membership Inference Success for White-Box and Black-Box Attacks Tobias Leemann, Bardh Prenkaj, Gjergji Kasneci

PDF OpenReview

Is Persona Enough for Personality? Using ChatGPT to Reconstruct an Agent's Latent Personality from Simple Descriptions Yongyi Ji, Zhisheng Tang, Mayank Kejriwal

PDF OpenReview

Is Poisoning a Real Threat to LLM Alignment? Maybe More so than You Think Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang

PDF OpenReview

Is Self-Knowledge and Action Consistent or Not: Investigating Large Language Model's Personality Yiming Ai, Zhiwei He, Ziyin Zhang, Wenhong Zhu, Hongkun Hao, Kai Yu, Lingjun Chen, Rui Wang

PDF OpenReview

Is Transformer a Stochastic Parrot? a Case Study in Simple Arithmetic Task Peixu Wang, Chen Yu, Yu Ming

PDF OpenReview

Is Value Functions Estimation with Classification Plug-and-Play for Offline Reinforcement Learning? Denis Tarasov, Kirill Brilliantov, Dmitrii Kharlapenko

PDF OpenReview

Is Value Learning Really the Main Bottleneck in Offline RL? Seohong Park, Kevin Frans, Sergey Levine, Aviral Kumar

PDF OpenReview

It Takes Two: On the Seamlessness Between Reward and Policy Model in RLHF TaiMing Lu, Lingfeng Shen, Xinyu Yang, Weiting Tan, Beidi Chen, Huaxiu Yao

PDF OpenReview

Iteration Head: A Mechanistic Study of Chain-of-Thought Vivien Cabannes, Charles Arnal, Wassim Bouaziz, Xingyu Alice Yang, Francois Charton, Julia Kempe

PDF OpenReview

Iterative Sizing Field Prediction for Adaptive Mesh Generation from Expert Demonstrations Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Philipp Becker, Aleksandar Taranovic, Onno Grönheim, Luise Kärger, Gerhard Neumann

PDF OpenReview

Iterative Theory of Mind Assay of Multimodal AI Models Rohini Elora Das, Rajarshi Das, Niharika Maity, Sreerupa Das

PDF OpenReview

iWISDM: Assessing Instruction Following in Multimodal Models at Scale Xiaoxuan Lei, Lucas Gomez, Hao Yuan Bai, Pouya Bashivan

PDF OpenReview

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Quentin Gallouédec, Edward Emanuel Beeching, Clément Romac, Emmanuel Dellandrea

PDF OpenReview

Jafar: An Open-Source Genie Reimplemention in JAX Timon Willi, Matthew Thomas Jackson, Jakob Nicolaus Foerster

PDF OpenReview

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramèr, Hamed Hassani, Eric Wong

PDF OpenReview

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

PDF OpenReview

Janus: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences Krithik Ramesh, Sameed Muneeb Siddiqui, Michael Mitzenmacher, Pardis Sabeti

PDF OpenReview

Jina CLIP: Your CLIP Model Is Also Your Text Retriever Han Xiao, Georgios Mastrapas, Bo Wang

PDF OpenReview

Jogging the Memory of Unlearned Models Through Targeted Relearning Attacks Shengyuan Hu, Yiwei Fu, Steven Wu, Virginia Smith

PDF OpenReview

Joint Diffusion Processes as an Inductive Bias in Sheaf Neural Networks Ferran Hernandez Caralt, Guillermo Bernardez, Iulia Duta, Eduard Alarcon, Pietro Lio

PDF OpenReview

Just Read Twice: Closing the Recall Gap for Recurrent Language Models Simran Arora, Aman Timalsina, Aaryan Singhal, Sabri Eyuboglu, Xinyi Zhao, Ashish Rao, Atri Rudra, Christopher Re

PDF OpenReview

Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Jiatao Gu, Ying Shen, Shuangfei Zhai, Yizhe Zhang, Navdeep Jaitly, Joshua M. Susskind

PDF OpenReview

KalMamba: Towards Efficient Probabilistic State Space Models for RL Under Uncertainty Philipp Becker, Niklas Freymuth, Gerhard Neumann

PDF OpenReview

Knowledge Graph Extraction from Total Synthesis Documents Andres M Bran, Zlatko Jončev, Philippe Schwaller

PDF OpenReview

Landscaping Linear Mode Connectivity Sidak Pal Singh, Linara Adilova, Michael Kamp, Asja Fischer, Bernhard Schölkopf, Thomas Hofmann

PDF OpenReview

Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure Bfloat16 Is Enough Konstantin Dobler, Gerard de Melo

PDF OpenReview

Language Alignment via Nash-Learning and Adaptive Feedback Ari Azarafrooz, Farshid Faal

PDF OpenReview

Language Model-in-the-Loop: Data Optimal Approach to Recommend Actions in Text Games Arjun V Sudhakar, Prasanna Parthasarathi, Janarthanan Rajendran, Sarath Chandar

PDF OpenReview

Language Models Linearly Represent Sentiment Curt Tigges, Oskar John Hollinsworth, Atticus Geiger, Neel Nanda

PDF OpenReview

Large Language Models Are Bad Game Theoretic Reasoners: Evaluating Performance and Bias in Two-Player Non-Zero-Sum Games Nathan Herr, Fernando Acero, Roberta Raileanu, Maria Perez-Ortiz, Zhibin Li

PDF OpenReview

Large Language Models Are Frame-Level Directors for Zero-Shot Text-to-Video Generation Susung Hong, Junyoung Seo, Heeseong Shin, Sunghwan Hong, Seungryong Kim

PDF OpenReview

Large Language Models Are Not Inverse Thinkers Quite yet Haoran Zhao

PDF OpenReview

Large Language Models as Misleading Assistants in Conversation Betty Li Hou, Kejian Shi, Jason Phang, James Aung, Steven Adler, Rosie Campbell

PDF OpenReview

Large Language Models Can Self-Correct with Minimal Effort Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang

PDF OpenReview

Large Language Models for Automated Open-Domain Scientific Hypotheses Discovery Zonglin Yang, Xinya Du, Junxian Li, Jie Zheng, Soujanya Poria, Erik Cambria

PDF OpenReview

Large Language Models Lack Understanding of Character Composition of Words Andrew Shin, Kunitake Kaneko

PDF OpenReview

Large-Scale Discovery of Experimental Designs in Super-Resolution Microscopy with XLuminA Carla Rodríguez, Sören Arlt, Leonhard Möckl, Mario Krenn

PDF OpenReview

Latent Functional Maps Marco Fumero, Marco Pegoraro, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà

PDF OpenReview

Latent Functional Maps Marco Fumero, Marco Pegoraro, Valentino Maiorca, Francesco Locatello, Emanuele Rodolà

PDF OpenReview

Latent-Guided Equivariant Diffusion for Controlled Structure-Based De Novo Ligand Generation Tuan Le, Julian Cremer, Djork-Arné Clevert, Kristof T Schütt

PDF OpenReview

LAuReL: Learned Augmented Residual Layer Gaurav Menghani, Ravi Kumar, Sanjiv Kumar

PDF OpenReview

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Qichen Fu, Minsik Cho, Thomas Merth, Sachin Mehta, Mohammad Rastegari, Mahyar Najibi

PDF OpenReview

Lean4trace: Data Augmentation for Neural Theorem Proving in Lean Vasilii Nesterov, Yermek Kapushev, Mikhail Burtsev

PDF OpenReview

Learnability of Parameter-Bounded Bayes Nets Arnab Bhattacharyya, Davin Choo, Sutanu Gayen, Dimitrios Myrisiotis

PDF OpenReview

Learned Best-Effort LLM Serving Siddharth Jha, Coleman Richard Charles Hooper, Xiaoxuan Liu, Sehoon Kim, Kurt Keutzer

PDF OpenReview

Learning and Unlearning of Fabricated Knowledge in Language Models Chen Sun, Nolan Andrew Miller, Andrey Zhmoginov, Max Vladymyrov, Mark Sandler

PDF OpenReview

Learning Cure Kinetics of Frontal Polymerization PDEs Using Differentiable Simulations Pengfei Cai, Qibang Liu, Philippe Geubelle, Rafael Gomez-Bombarelli

PDF OpenReview

Learning Diffeomorphic Lyapunov Functions from Data Samuel Tesfazgi, Leonhard Sprandl, Sandra Hirche

PDF OpenReview

Learning Efficient Recursive Numeral Systems via Reinforcement Learning Jonathan David Thomas, Andrea Silvi, Devdatt Dubhashi, Emil Carlsson, Moa Johansson

PDF OpenReview

Learning Fast and Slow: Representations for In-Context Weight Modulation Andrey Zhmoginov, Jihwan Lee, Max Vladymyrov, Mark Sandler

PDF OpenReview

Learning Generative Population Models from Multiple Clinical Datasets via Probabilistic Programming João Loula, Katherine M. Collins, Ulrich Schaechtle, Joshua B. Tenenbaum, Adrian Weller, Feras Saad, Timothy J. O'Donnell, Vikash Mansinghka

PDF OpenReview

Learning High-Dimensional Mixed Models via Amortized Variational Inference Priscilla Ong, Manuel Haussmann, Harri Lähdesmäki

PDF OpenReview

Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement Learning Alena Shilova, Thomas Delliaux, Philippe Preux, Bruno Raffin

PDF OpenReview

Learning In-Context Decision Making with Synthetic MDPs Akarsh Kumar, Chris Lu, Louis Kirsch, Phillip Isola

PDF OpenReview

Learning Latent Graph Structures and Their Uncertainty Alessandro Manenti, Daniele Zambon, Cesare Alippi

PDF OpenReview

Learning Long Timescale in Molecular Dynamics by Nano-GPT Yuan Yao, Wenqi Zeng

PDF OpenReview

Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics Alireza Mousavi-Hosseini, Denny Wu, Murat A Erdogdu

PDF OpenReview

Learning Nash Equilibria in Zero-Sum Markov Games: A Single-Timescale Algorithm Under Weak Reachability Reda Ouhamma, Maryam Kamgarpour

PDF OpenReview

Learning Sequence Models Through Consolidation Eleanor Spens, Neil Burgess

PDF OpenReview

Learning Set Functions with Implicit Differentiation Gözde Özcan, Chengzhi Shi, Stratis Ioannidis

PDF OpenReview

Learning Stable Allocations of Strictly Convex Stochastic Cooperative Games Nam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh

PDF OpenReview

Learning Symmetries via Weight-Sharing with Doubly Stochastic Tensors Putri A Van der Linden, Alejandro García Castellanos, Sharvaree Vadgama, Thijs P. Kuipers, Erik J Bekkers

PDF OpenReview

Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically Kabir Ahuja, Vidhisha Balachandran, Madhur Panwar, Tianxing He, Noah A. Smith, Navin Goyal, Yulia Tsvetkov

PDF OpenReview

Learning Task Representations from In-Context Learning Baturay Saglam, Zhuoran Yang, Dionysis Kalogerias, Amin Karbasi

PDF OpenReview

Learning the Boundary-to-Domain Mapping Using Lifting Product Fourier Neural Operators for Partial Differential Equations Aditya Kashi, Arka Daw, Muralikrishnan Gopalakrishnan Meena, Hao Lu

PDF OpenReview

Learning the Eye of the Beholder: Statistical Modeling and Estimation for Personalized Color Perception Xuanzhou Chen, Austin Xu, Jingyan Wang, Ashwin Pananjady

PDF OpenReview

Learning to Assist Humans Without Inferring Rewards Vivek Myers, Evan Ellis, Benjamin Eysenbach, Sergey Levine, Anca Dragan

PDF OpenReview

Learning to Design Data-Structures: A Case Study of Nearest Neighbor Search Omar Salemohamed, Vatsal Sharan, Shivam Garg, Laurent Charlin, Gregory Valiant

PDF OpenReview

Learning to Explore with Lagrangians for Bandits Under Unknown Constraints Udvas Das, Debabrota Basu

PDF OpenReview

Learning to Grok: Emergence of In-Context Learning and Skill Composition in Modular Arithmetic Tasks Tianyu He, Darshil Doshi, Aritra Das, Andrey Gromov

PDF OpenReview

Learning to Reason by Failing: Offline RL on Sub-Optimal Rollouts Scales Synthetic Data by 8x Amrith Setlur, Saurabh Garg, Xinyang Geng, Naman Garg, Virginia Smith, Aviral Kumar

PDF OpenReview

Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data Younghun Lee, Sungchul Kim, Ryan A. Rossi, Tong Yu, Xiang Chen

PDF OpenReview

Learning to Steer Markovian Agents Under Model Uncertainty Jiawei Huang, Vinzenz Thoma, Zebang Shen, Heinrich H. Nax, Niao He

PDF OpenReview

Learning When to Trust the Expert for Guided Exploration in RL Felix Schulz, Jasper Hoffmann, Yuan Zhang, Joschka Boedecker

PDF OpenReview

LEGENT: Open Platform for Embodied Agents Zhili Cheng, Jinyi Hu, Zhitong Wang, Yuge Tu, Shengding Hu, An Liu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong Sun

PDF OpenReview

Leveraging Generative Foundation Models for Domain Generalization Sobhan Hemati, Mahdi Beitollahi, Amir Hossein Estiri, Bassel Al Omari, Xi Chen, Guojun Zhang

PDF OpenReview

Leveraging Multi-Color Spaces as a Defense Mechanism Against Model Inversion Attack Sofiane Ouaari, Ali Burak Ünal, Mete Akgün, Nico Pfeifer

PDF OpenReview

Leveraging Topological Guidance for Improved Knowledge Distillation Eun Som Jeon, Rahul Khurana, Aishani Pathak, Pavan K. Turaga

PDF OpenReview

Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space Mohamed Amine Ketata, Nicholas Gao, Johanna Sommer, Tom Wollschläger, Stephan Günnemann

PDF OpenReview

Lifted Residual Score Estimation Tejas Jayashankar, Jongha Jon Ryu, Xiangxiang Xu, Gregory W. Wornell

PDF OpenReview

LIFTED: Multimodal Mixture-of-Experts for Clinical Trial Outcome Prediction Wenhao Zheng, Dongshen Peng, Hongxia Xu, Yun Li, Hongtu Zhu, Tianfan Fu, Huaxiu Yao

PDF OpenReview

Likelihood-Based Fine-Tuning of Protein Language Models for Few-Shot Fitness Prediction and Design Alex Hawkins-Hooker, Jakub Kmec, Oliver Bent, Paul Duckworth

PDF OpenReview

Likelihood-Based Fine-Tuning of Protein Language Models for Few-Shot Fitness Prediction and Design Alex Hawkins-Hooker, Jakub Kmec, Oliver Bent, Paul Duckworth

PDF OpenReview

Limitations of scRNA-Seq Zero-Imputation Methods for Network Inference Ankit Bhardwaj, Joshua Weiner, Preetha Balasubramanian, Lakshmi Subramanian

PDF OpenReview

Linear Transformers Are Versatile In-Context Learners Max Vladymyrov, Johannes von Oswald, Mark Sandler, Rong Ge

PDF OpenReview

Linear Weight Interpolation Leads to Transient Performance Gains Gaurav Iyer, Gintare Karolina Dziugaite, David Rolnick

PDF OpenReview

Liouna: Biologically Plausible Learning for Efficient Pre-Training of Transferrable Deep Models Fady Rezk, Antreas Antoniou, Henry Gouk, Timothy Hospedales

PDF OpenReview

LLM Circuit Analyses Are Consistent Across Training and Scale Curt Tigges, Michael Hanna, Qinan Yu, Stella Biderman

PDF OpenReview

LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language James Requeima, John F Bronskill, Dami Choi, Richard E. Turner, David Duvenaud

PDF OpenReview

LLM Sample: Part Average and Part Ideal Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi, Mario Fritz

PDF OpenReview

LLM Task Interference: Impact of Task-Switch in Conversational History Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz

PDF OpenReview

LLM-Informed Discrete Prompt Optimization Zeeshan Memon, Muhammad Arham, Adnan Ul-Hasan, Faisal Shafait

PDF OpenReview

LLM3: Large Language Model-Based Task and Motion Planning with Motion Failure Reasoning Shu Wang, Muzhi Han, Ziyuan Jiao, Zeyu Zhang, Ying Nian Wu, Song-Chun Zhu, Hangxin Liu

PDF OpenReview

LLMs at the Bargaining Table Yuan Deng, Vahab Mirrokni, Renato Paes Leme, Hanrui Zhang, Song Zuo

PDF OpenReview

LLMs Learn Governing Principles of Dynamical Systems, Revealing an In-Context Neural Scaling Law Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati, Christopher Earls

PDF OpenReview

Local Lateral Connectivity Is Sufficient for Replicating Cortex-like Topographical Organization in Deep Neural Networks Xinyu Qian, Amirozhan Dehghani, Asa Borzabadifarahani, Pouya Bashivan

PDF OpenReview

Local to Global: Learning Dynamics and Effect of Initialization for Transformers Ashok Vardhan Makkuva, Marco Bondaschi, Chanakya Ekbote, Adway Girish, Alliot Nagle, Hyeji Kim, Michael Gastpar

PDF OpenReview

Localized Zeroth-Order Prompt Optimization Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low

PDF OpenReview

Localizing Auditory Concepts in CNNs Pratyaksh Gautam, Makarand Tapaswi, Vinoo Alluri

PDF OpenReview

Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies Alex DeWeese, Guannan Qu

PDF OpenReview

Logical Distillation of Graph Neural Networks Alexander Pluska, Pascal Welke, Thomas Gärtner, Sagar Malhotra

PDF OpenReview

Long Context Understanding Using Self-Generated Synthetic Data Jerry Li, Subhro Das, Aude Oliva, Dmitry Krotov, Leonid Karlinsky, Rogerio Feris

PDF OpenReview

Long-Context Vision Large Language Models: Empirical Insights and a Baseline Yongshuo Zong, Ismail Elezi, Yongxin Yang, Jiankang Deng, Timothy Hospedales

PDF OpenReview

Long-Horizon Planning for Multi-Agent Robots in Partially Observable Environments Siddharth Nayak, Adelmo Morrison Orozco, Marina Ten Have, Jackson Zhang, Vittal Thirumalai, Darren Chen, Aditya Kapoor, Eric Robinson, Karthik Gopalakrishnan, James Harrison, Anuj Mahajan, Brian Ichter, Hamsa Balakrishnan

PDF OpenReview

LongAlign: A Recipe for Long Context Alignment of Large Language Models Yushi Bai, Xin Lv, Jiajie Zhang, Yuze He, Ji Qi, Lei Hou, Jie Tang, Yuxiao Dong, Juanzi Li

PDF OpenReview

Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models Alexandre Variengien, Eric Winsor

PDF OpenReview

Looking at Deep Learning Phenomena Through a Telescoping Lens Alan Jeffares, Alicia Curth, Mihaela van der Schaar

PDF OpenReview

LoQT: Low Rank Adapters for Quantized Training Sebastian Bugge Loeschcke, Mads Toftrup, Michael Kastoryano, Serge Belongie, Vésteinn Snæbjarnarson

PDF OpenReview

LoRD: Low-Rank Decomposition of Monolingual Code LLMs for One-Shot Compression Ayush Kaushal, Tejas Vaidhya, Irina Rish

PDF OpenReview

Lorentzian Residual Neural Networks Neil He, Menglin Yang, Rex Ying

PDF OpenReview

Loss in the Crowd: Hidden Breakthroughs in Language Model Training Sara Kangaslahti, Elan Rosenfeld, Naomi Saphra

PDF OpenReview

Loss Landscape Geometry Reveals Stagewise Development of Transformers George Wang, Matthew Farrugia-Roberts, Jesse Hoogland, Liam Carroll, Susan Wei, Daniel Murfet

PDF OpenReview

Lost in Translation: The Algorithmic Gap Between LMs and the Brain Tosato Tommaso, Tikeng Notsawo Pascal Junior, Helbling Saskia, Irina Rish, Guillaume Dumas

PDF OpenReview

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal

PDF OpenReview

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal

PDF OpenReview

Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Ashwinee Panda, Berivan Isik, Xiangyu Qi, Sanmi Koyejo, Tsachy Weissman, Prateek Mittal

PDF OpenReview

Low Rank Quantization-Aware Training for LLMs Yelysei Bondarenko, Riccardo Del Chiaro, Markus Nagel

PDF OpenReview

Low-Rank Linearization of Large Language Models Michael Zhang, Aaryan Singhal, Benjamin Frederick Spector, Simran Arora, Christopher Re

PDF OpenReview

Lowering PyTorch's Memory Consumption for Selective Differentiation Samarth Bhatia, Felix Dangel

PDF OpenReview

Machine Learning Nominal Max Oxygen Consumption from Wearable Reflective Pulse Oximetry with Density Functional Theory Saleem Abdul Fattah Ahmed Al Dajani, Frédéric Laquai

PDF OpenReview

MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-Go Approximation Alexandre Hayderi, Amin Saberi, Ellen Vitercik, Anders Wikum

PDF OpenReview

Mamba-PTQ: Outlier Channels in Recurrent Large Language Models Alessandro Pierro, Steven Abreu

PDF OpenReview

Manifold-Constrained Nucleus-Level Denoising Diffusion Model for Structure-Based Drug Design Shengchao Liu, Liang Yan, Weitao Du, Weiyang Liu, Hongyu Guo, Christian Borgs, Jennifer T Chayes, Anima Anandkumar

PDF OpenReview

Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva, Marina MC Höhne, Alexander Warnecke, Lukas Pirch, Klaus Robert Muller, Konrad Rieck, Kirill Bykov

PDF OpenReview

Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva, Marina MC Höhne, Alexander Warnecke, Lukas Pirch, Klaus Robert Muller, Konrad Rieck, Kirill Bykov

PDF OpenReview

Many-Shot In-Context Learning Rishabh Agarwal, Avi Singh, Lei M Zhang, Bernd Bohnet, Luis Rosias, Stephanie C.Y. Chan, Biao Zhang, Ankesh Anand, Zaheer Abbas, Azade Nova, John D Co-Reyes, Eric Chu, Feryal Behbahani, Aleksandra Faust, Hugo Larochelle

PDF OpenReview

Many-Shot In-Context Learning Rishabh Agarwal, Avi Singh, Lei M Zhang, Bernd Bohnet, Luis Rosias, Stephanie C.Y. Chan, Biao Zhang, Aleksandra Faust, Hugo Larochelle

PDF OpenReview

Many-Shot In-Context Learning for Molecular Inverse Design Saeed Moayedpour, Alejandro Corrochano-Navarro, Faryad Sahneh, Alexander Koetter, Jiří Vymětal, Lorenzo Kogler Anele, Pablo Mas, Yasser Jangjoo, Sizhen Li, Michael Bailey, Marc Bianciotto, Hans Matter, Christoph Grebner, Gerhard Hessler, Ziv Bar-Joseph, Sven Jager

PDF OpenReview

Many-Shot In-Context Learning in Multimodal Foundation Models Yixing Jiang, Jeremy Andrew Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H Chen, Andrew Y. Ng

PDF OpenReview

Many-to-Many Image Generation with Auto-Regressive Diffusion Models Ying Shen, Yizhe Zhang, Shuangfei Zhai, Lifu Huang, Joshua M. Susskind, Jiatao Gu

PDF OpenReview

MAP-THOR: Benchmarking Long-Horizon Multi-Agent Planning Frameworks in Partially Observable Environments Siddharth Nayak, Adelmo Morrison Orozco, Marina Ten Have, Vittal Thirumalai, Jackson Zhang, Darren Chen, Aditya Kapoor, Eric Robinson, Karthik Gopalakrishnan, Brian Ichter, James Harrison, Anuj Mahajan, Hamsa Balakrishnan

PDF OpenReview

MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-Based Programming Assistants John Heibel, Daniel Lowd

PDF OpenReview

Marginal Fairness Sliced Wasserstein Barycenter Khai Nguyen, Hai Nguyen, Nhat Ho

PDF OpenReview

Markov Persuasion Processes: How to Persuade Multiple Agents from Scratch Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni, Nicola Gatti, Alberto Marchesi

PDF OpenReview

Marrying Causal Representation Learning with Dynamical Systems for Science Dingling Yao, Caroline Muller, Francesco Locatello

PDF OpenReview

Masking in Molecular Graphs Leveraging Reaction Context Jiannan Yang, Veronika Thost, Tengfei Ma

PDF OpenReview

Matching Domain Experts by Training from Scratch on Domain Knowledge Xiaoliang Luo, Guangzhi Sun, Bradley C. Love

PDF OpenReview

Mathematical Models of Computation in Superposition Kaarel Hänni, Jake Mendel, Dmitry Vaintrob, Lawrence Chan

PDF OpenReview

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Bedi, Mengdi Wang

PDF OpenReview

Measuring Goal-Directedness Matt MacDermott, James Fox, Francesco Belardinelli, Tom Everitt

PDF OpenReview

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models Adam Karvonen, Benjamin Wright, Can Rager, Rico Angell, Jannik Brinkmann, Logan Riggs Smith, Claudio Mayrink Verdun, David Bau, Samuel Marks

PDF OpenReview

Mechanism Design for Large Language Models Paul Duetting, Vahab Mirrokni, Renato Paes Leme, Haifeng Xu, Song Zuo

PDF OpenReview

Mechanistic Interpretability of Binary and Ternary Transformer Networks Jason Li

PDF OpenReview

Medical Unlearnable Examples: Securing Medical Data from Unauthorized Training via Sparsity-Aware Local Masking Weixiang Sun, Yixin Liu, Zhiling Yan, Kaidi Xu, Lichao Sun

PDF OpenReview

Memory and Bandwidth Are All You Need for Fully Sharded Data Parallel Jiangtao Wang, Jan Ebert, Oleg Filatov, Stefan Kesselheim

PDF OpenReview

Merging Improves Self-Critique Against Jailbreak Attacks Victor Gallego

PDF OpenReview

Merging Text Transformer Models from Different Initializations Neha Verma, Maha Elbayad

PDF OpenReview

MESS: Modern Electronic Structure Simulations Hatem Helal, Andrew W Fitzgibbon

PDF OpenReview

Message-Passing Monte Carlo: Generating Low-Discrepancy Point Sets via Graph Neural Networks T. Konstantin Rusch, Nathan Kirk, Michael M. Bronstein, Christiane Lemieux, Daniela Rus

PDF OpenReview

Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold Lazar Atanackovic, Xi Zhang, Brandon Amos, Mathieu Blanchette, Leo J Lee, Yoshua Bengio, Alexander Tong, Kirill Neklyudov

PDF OpenReview

Meta-Designing Quantum Experiments with Language Models Sören Arlt, Haonan Duan, Felix Li, Sang Michael Xie, Yuhuai Wu, Mario Krenn

PDF OpenReview

Meta-Optimization for Deep Learning via Nonstochastic Control Xinyi Chen, Evan Dogariu, Zhou Lu, Elad Hazan

PDF OpenReview

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving Aniket Rajiv Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy P Lillicrap, Danilo Jimenez Rezende, Yoshua Bengio, Michael Curtis Mozer, Sanjeev Arora

PDF OpenReview

MetaGFN: Exploring Distant Modes with Adapted Metadynamics for Continuous GFlowNets Dominic Phillips, Flaviu Cipcigan

PDF OpenReview

Metric Learning for Clifford Group Equivariant Neural Networks Riccardo Ali, Paulina Kulytė, Haitz Sáez de Ocáriz Borde, Pietro Lio

PDF OpenReview

Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models Francisco Eiras, Aleksandar Petrov, Philip Torr, M. Pawan Kumar, Adel Bibi

PDF OpenReview

Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI Hugo Caselles-Dupré, Charles Mellerio, Herent, Alizée Lopez-Persem, Benoît Béranger, Pierre Fautrel, Gauthier Vernier, Matthieu Cord

PDF OpenReview

MInference: Accelerating Pre-Filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

PDF OpenReview

MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar

PDF OpenReview

Minimax Tree of Thoughts: Playing Two-Player Zero-Sum Sequential Games with Large Language Models Wei Guo, Xiaotian Hao, Jianye Hao, Yan Zheng

PDF OpenReview

MiniMol: A Parameter-Efficient Foundation Model for Molecular Learning Kerstin Klaser, Blazej Banaszewski, Samuel Maddrell-Mander, Callum McLean, Luis Müller, Ali Parviz, Shenyang Huang, Andrew W Fitzgibbon

PDF OpenReview

Missed Causes and Ambiguous Effects: Counterfactuals Pose Challenges for Interpreting Neural Networks Aaron Mueller

PDF OpenReview

Mission Impossible: A Statistical Perspective on Jailbreaking LLMs Jingtong Su, Julia Kempe, Karen Ullrich

PDF OpenReview

Misspecified $q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error Ally Yalei Du, Lin Yang, Ruosong Wang

PDF OpenReview

Mitigate Position Bias in Large Language Models via Scaling a Single Dimension Yijiong Yu, Huiqiang Jiang, Xufang Luo, Qianhui Wu, Chin-Yew Lin, Dongsheng Li, Yuqing Yang, Yongfeng Huang, Lili Qiu

PDF OpenReview

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy Cameron Allen, Aaron T. Kirtland, Ruo Yu Tao, Sam Lobel, Daniel Scott, Nicholas Petrocelli, Omer Gottesman, Ronald Parr, Michael Littman, George Konidaris

PDF OpenReview

Mixed-Curvature Decision Trees and Random Forests Philippe Chlenski, Quentin Chu, Itsik Pe'er

PDF OpenReview

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge? Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Leria Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao

PDF OpenReview

Mobile and Edge Evaluation of Large Language Models Stefanos Laskaridis, Kleomenis Katevas, Lorenzo Minto, Hamed Haddadi

PDF OpenReview

Model Based Diffusion for Trajectory Optimization Chaoyi Pan, Zeji Yi, Guanya Shi, Guannan Qu

PDF OpenReview

Model Breadcrumbs: Scalable Upcycling of Finetuned Foundation Models via Sparse Task Vectors Merging MohammadReza Davari, Eugene Belilovsky

PDF OpenReview

Model-Agnostic Graph Dataset Compression with the Tree Mover’s Distance Mika Sarkin Jain, Stefanie Jegelka, Ishani Karmarkar, Luana Ruiz, Ellen Vitercik

PDF OpenReview

Modeling Bilingual Disfluencies with Large Language Models Negin Raoof, Yating Wu, Carlos Bonilla, Junyi Jessy Li, Stephanie M Grasso, Alex Dimakis, Zoi Gkalitsiou

PDF OpenReview

Modeling Droplets Dynamics in Emulsions with Graph Neural Networks Giulio Ortali, Federico Toschi, Jan-Willem van de Meent

PDF OpenReview

Modeling the Plurality of Human Preferences via Ideal Points Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak

PDF OpenReview

Modeling the Plurality of Human Preferences via Ideal Points Daiwei Chen, Yi Chen, Aniket Rege, Ramya Korlakai Vinayak

PDF OpenReview

Modelling Latent Dynamical Systems with Recognition-Parametrised Models Samo Hromadka, Maneesh Sahani

PDF OpenReview

Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum

PDF OpenReview

Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum

PDF OpenReview

Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum

PDF OpenReview

Models That Prove Their Own Correctness Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum

PDF OpenReview

Modularity in Biologically Inspired Representations Depends on Task Variable Range Independence Will Dorrell, Kyle Hsu, Luke Hollingsworth, Jin Hwa Lee, Jiajun Wu, Chelsea Finn, Peter E. Latham, Timothy Edward John Behrens, James C. R. Whittington

PDF OpenReview

MolEval: An Evaluation Toolkit for Molecular Embeddings via LLMs Shaghayegh Sadeghi, Ali Forooghi, Jianguo Lu, Alioune Ngom

PDF OpenReview

MolGene-E: Inverse Molecular Design to Modulate Single Cell Transcriptomics Rahul Ohlan, Raswanth Murugan, Li Xie, Mohammadsadeq Mottaqi, Shuo Zhang, Lei Xie

PDF OpenReview

MONGOOSE: Path-Wise Smooth Bayesian Optimisation via Meta-Learning Adam X. Yang, Laurence Aitchison, Henry Moss

PDF OpenReview

More Details, Please: Improving Autoformalization with More Detailed Proofs Guillem Tarrach, Albert Q. Jiang, Daniel Raggi, Wenda Li, Mateja Jamnik

PDF OpenReview

MoRe Fine-Tuning with 10x Fewer Parameters Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala

PDF OpenReview

MoRe Fine-Tuning with 10x Fewer Parameters Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala

PDF OpenReview

MoReDrop: Dropout Without Dropping Li Jiang, Duo Li, Yichuan Ding, Xue Liu, Victor Wai Kin Chan

PDF OpenReview

MSA Pairing Transfomer: Protein Interaction Partner Prediction with Few-Shot Contrastive Learning Alex Hawkins-Hooker, Daniel Burkhardt Cerigo, Umberto Lupo, David Jones, Brooks Paige

PDF OpenReview

MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training Bo Chen, Zhilei Bei, Xingyi Cheng, Pan Li, Jie Tang, Le Song

PDF OpenReview

MSAMamba: Adapting Subquadratic Models to Long-Context DNA MSA Analysis Vishrut Thoutam, Dina Ellsworth

PDF OpenReview

MSAMamba: Adapting Subquadratic Models to Long-Context DNA MSA Analysis Vishrut Thoutam, Dina Ellsworth

PDF OpenReview

Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Steven Wu

PDF OpenReview

Multi-Agent Imitation Learning: Value Is Easy, Regret Is Hard Jingwu Tang, Gokul Swamy, Fei Fang, Steven Wu

PDF OpenReview

Multi-Frequency Progressive Refinement for Learned Inverse Scattering Owen Melia, Olivia Tsang, Vasileios Charisopoulos, Yuehaw Khoo, Jeremy Hoskins, Rebecca Willett

PDF OpenReview

Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Weijie J Su, Camillo Jose Taylor, Tanwi Mallick

PDF OpenReview

Multi-Modal and Multi-Task Transformer for Small Molecule Drug Discovery Sai Krishna Sirumalla, David Stephen Farina Jr, Zhuoran Qiao, Daniele Alessandro Di Cesare, Felipe Costas Farias, Michael Bernard O’Connor, Peter John Bygrave, Feizhi Ding, Thomas Dresselhaus, Marcelo Gomes Pereira de Lacerda, Jason Matthew Swails, Daniel Miles, Matthew Welborn, Fred Manby, Thomas Miller

PDF OpenReview

Multi-Objective Differentiable Neural Architecture Search Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Samuel Dooley, Josif Grabocka, Frank Hutter

PDF OpenReview

Multi-Objective Guidance via Importance Sampling for Target-Aware Diffusion-Based De Novo Ligand Generation Julian Cremer, Tuan Le, Frank Noe, Djork-Arné Clevert, Kristof T Schütt

PDF OpenReview

Multi-Task Extension of Geometrically Aligned Transfer Encoder Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Hyunseung Kim, Chanhui Lee, Soorin Yim, Sehui Han

PDF OpenReview

Multi-Task Training Increases Native Sequence Recovery of Antigen-Specific T-Cell Receptor Sequences Dhuvarakesh Karthikeyan, Alex Rubinsteyn

PDF OpenReview

Multilingual Compression Parity: How Efficiently Large Language Models Represent Information Across Languages? Alexander Tsvetkov, Alon Kipnis

PDF OpenReview

Multimodal Foundation World Models for Generalist Embodied Agents Pietro Mazzaglia, Tim Verbelen, Bart Dhoedt, Aaron Courville, Sai Rajeswar

PDF OpenReview

Multiple-Policy Evaluation via Density Estimation Yilei Chen, Aldo Pacchiano, Ioannis Paschalidis

PDF OpenReview

MultiScale Policy Learning for Alignment with Long Term Objectives Richa Rastogi, Yuta Saito, Thorsten Joachims

PDF OpenReview

Multivector Neurons: Better and Faster O(n)-Equivariant Clifford GNNs Cong Liu, David Ruhe, Patrick Forré

PDF OpenReview

Navigating Chemical Space with Latent Flows Guanghao Wei, Yining Huang, Chenru Duan, Yue Song, Yuanqi Du

PDF OpenReview

Navigating Trustworthiness of Deep Learning in ∆∆g Prediction : Addressing Data Bias, Model Evaluation, and Interpretation Ruochi Zhang, Ningning Chen, Fengfeng Zhou, Xin Gao

PDF OpenReview

NCIDiff: Non-Covalent Interaction-Generative Diffusion Model for Improving Reliability of 3D Molecule Generation Inside Protein Pocket Joongwon Lee, Wonho Zhung, Woo Youn Kim

PDF OpenReview

NEBULA: Neural Empirical Bayes Under Latent Representations for Efficient and Controllable Design of Molecular Libraries Ewa Nowara, Pedro O. Pinheiro, Sai Pooja Mahajan, Omar Mahmood, Andrew Martin Watkins, Saeed Saremi, Michael Maser

PDF OpenReview

NEORL: Efficient Exploration for Nonepisodic RL Bhavya Sukhija, Lenart Treven, Florian Dorfler, Stelian Coros, Andreas Krause

PDF OpenReview

Neural Collapse Versus Low-Rank Bias: Is Deep Neural Collapse Really Optimal? Peter Súkeník, Marco Mondelli, Christoph H. Lampert

PDF OpenReview

Neural Dueling Bandits Arun Verma, Zhongxiang Dai, Xiaoqiang Lin, Patrick Jaillet, Bryan Kian Hsiang Low

PDF OpenReview

Neural Incremental Data Assimilation Matthieu Blanke, Ronan Fablet, Marc Lelarge

PDF OpenReview

Neural Interactive Proofs Lewis Hammond, Sam Adam-Day

PDF OpenReview

Neural Network Learns Low-Dimensional Polynomials with SGD near the Information-Theoretic Limit Jason D. Lee, Kazusato Oko, Taiji Suzuki, Denny Wu

PDF OpenReview

Neural Ratio Estimators Meet Distributional Shift and Mode Misspecification: A Cautionary Tale from Strong Gravitational Lensing Andreas Filipp, Yashar Hezaveh, Laurence Perreault-Levasseur

PDF OpenReview

Neural Symmetry Detection for Learning Neural Network Constraints Alex Gabel, Rick Quax, Stratis Gavves

PDF OpenReview

Neural Thermodynamic Integration: Free Energies from Energy-Based Diffusion Models Bálint Máté, François Fleuret, Tristan Bereau

PDF OpenReview

Neuroplasticity and Corruption in Model Mechanisms: A Case Study of Indirect Object Identification Vishnu Kabir Chhabra, Ding Zhu, Mohammad Mahdi Khalili

PDF OpenReview

Neurosymbolic Markov Models Lennert De Smet, Gabriele Venturato, Luc De Raedt, Giuseppe Marra

PDF OpenReview

New Desiderata for Direct Preference Optimization Xiangkun Hu, Tong He, David Wipf

PDF OpenReview

No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO Skander Moalla, Andrea Miele, Razvan Pascanu, Caglar Gulcehre

PDF OpenReview

Non-Differentiable Diffusion Guidance for Improved Molecular Geometry Yuchen Shen, Chenhao Zhang, Chenghui Zhou, Sijie Fu, Newell Washburn, Barnabas Poczos

PDF OpenReview

Non-Ergodicity in Reinforcement Learning: Robustness via Ergodicity Transformations Dominik Baumann, Erfaun Noorani, James Price, Ole Peters, Colm Connaughton, Thomas B. Schön

PDF OpenReview

Non-Linear $H_\infty$ Robustness Guarantees for Neural Network Policies Daniel Urieli

PDF OpenReview

Non-Parameteric Conformal Distributionally Robust Optimization Yash Patel, Guyang Cao, Ambuj Tewari

PDF OpenReview

Nonconvex Meta-Optimization for Deep Learning Xinyi Chen, Evan Dogariu, Zhou Lu, Elad Hazan

PDF OpenReview

Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators Jianhao Yuan, Francesco Pinto, Adam Davies, Philip Torr

PDF OpenReview

NVDSL: Simplifying Tensor Cores with Python-Driven MLIR Metaprogramming Guray Ozen

PDF OpenReview

Off-Policy Evaluation from Logged Human Feedback Aniruddha Bhargava, Lalit K Jain, Branislav Kveton, Ge Liu, Subhojyoti Mukherjee

PDF OpenReview

Offline Reinforcement Learning with Pessimistic Value Priors Filippo Valdettaro, Aldo A. Faisal

PDF OpenReview

Offline RL via Feature-Occupancy Gradient Ascent Gergely Neu, Nneka Okolo

PDF OpenReview

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang, Shaofei Cai, Zhancun Mu, Haowei Lin, Ceyao Zhang, Xuejie Liu, Qing Li, Anji Liu, Xiaojian Ma, Yitao Liang

PDF OpenReview

On Conditional Sampling with Joint Flow Matching Amy Xiang Wang

PDF OpenReview

On Fairly Comparing Group Equivariant Networks Lucas Roos, Rodney Stephen Kroon

PDF OpenReview

On Language Models’ Cognitive Biases in Reading Time Prediction Patrick Haller, Lena Sophia Bolliger, Lena Ann Jäger

PDF OpenReview

On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization Motahareh Sohrabi, Juan Ramirez, Tianyue H. Zhang, Simon Lacoste-Julien, Jose Gallego-Posada

PDF OpenReview

On Provable Length and Compositional Generalization Kartik Ahuja, Amin Mansouri

PDF OpenReview

On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks Nicholas H. Barbara, Ruigang Wang, Ian Manchester

PDF OpenReview

On the Calibration of Conditional-Value-at-Risk Rajeev Verma, Volker Fischer, Eric Nalisnick

PDF OpenReview

On the Difficulty of Faithful Chain-of-Thought Reasoning in Large Language Models Sree Harsha Tanneru, Dan Ley, Chirag Agarwal, Himabindu Lakkaraju

PDF OpenReview

On the Discrepancy and Connection Between Memorization and Generation in Diffusion Models Hanyu Wang, Yujin Han, Difan Zou

PDF OpenReview

On the Effectiveness of Quantum Chemistry Pre-Training for Pharmacological Property Prediction Arun Raja, Hongtao Zhao, Christian Tyrchan, Eva Nittinger, Michael M. Bronstein, Charlotte Deane, Garrett M Morris

PDF OpenReview

On the Expressive Power of Tree-Structured Probabilistic Circuits Lang Yin, Han Zhao

PDF OpenReview

On the Local Geometry of Deep Generative Manifolds Ahmed Imtiaz Humayun, Ibtihel Amara, Candice Schumann, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaei

PDF OpenReview

On the Matter of Embeddings Dispersion on Hyperspheres Evgeniia Tokarchuk, Hua Chang Bakker, Vlad Niculae

PDF OpenReview

On the Metastability of Learning Algorithms in Physics-Informed Neural Networks: A Case Study on Schr\"odinger Operators Alessandro Maria Selvitella

PDF OpenReview

On the Multi-Modal Vulnerability of Diffusion Models Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yu

PDF OpenReview

On the Power of Convolution Augmented Transformer Mingchen Li, Xuechen Zhang, Yixiao Huang, Samet Oymak

PDF OpenReview

On the Privacy Risks of Post-Hoc Explanations of Foundation Models Catherine Huang, Martin Pawelczyk, Himabindu Lakkaraju

PDF OpenReview

On the Robustness of Neural Networks Quantization Against Data Poisoning Attacks Yiwei Lu, Yihan Wang, Guojun Zhang, Yaoliang Yu

PDF OpenReview

On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics Michal Nauman, Marek Cygan

PDF OpenReview

On Three-Layer Data Markets Alireza Fallah, Michael Jordan, Ali Makhdoumi, Azarakhsh Malekian

PDF OpenReview

One-Shot Safety Alignment for Large Language Models via Optimal Dualization Xinmeng Huang, Shuo Li, Edgar Dobriban, Osbert Bastani, Hamed Hassani, Dongsheng Ding

PDF OpenReview

One-Shot Safety Alignment for Large Language Models via Optimal Dualization Xinmeng Huang, Shuo Li, Edgar Dobriban, Osbert Bastani, Hamed Hassani, Dongsheng Ding

PDF OpenReview

One-Versus-Others Attention: Scalable Multimodal Integration for Biomedical Data Michal Golovanevsky, Eva Schiller, Akira A Nair, Ritambhara Singh, Carsten Eickhoff

PDF OpenReview

Online Optimization of Closed-Loop Control Systems Hao Ma, Melanie Zeilinger, Michael Muehlebach

PDF OpenReview

Online Performance Optimization of Nonlinear Systems: A Gray-Box Approach Zhiyu He, Michael Muehlebach, Saverio Bolognani, Florian Dorfler

PDF OpenReview

Open LLMs Are Necessary for Private Adaptations and Outperform Their Closed Alternatives Vincent Hanke, Tom Blanchard, Franziska Boenisch, Iyiola Emmanuel Olatunji, Michael Backes, Adam Dziedzic

PDF OpenReview

Open LLMs Are Necessary for Private Adaptations and Outperform Their Closed Alternatives Vincent Hanke, Tom Blanchard, Franziska Boenisch, Iyiola Emmanuel Olatunji, Michael Backes, Adam Dziedzic

PDF OpenReview

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training Sami Jaghouar, Johannes Hagemann

PDF OpenReview

OpenELM: An Efficient Language Model Family with Open Training and Inference Framework Sachin Mehta, Mohammad Hossein Sekhavat, Qingqing Cao, Maxwell Horton, Yanzi Jin, Chenfan Sun, Seyed Iman Mirzadeh, Mahyar Najibi, Dmitry Belenko, Peter Zatloukal, Mohammad Rastegari

PDF OpenReview

Optimal Design for Human Feedback Subhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Anand Deshmukh, Ge Liu, Yifei Ma, Branislav Kveton

PDF OpenReview

Optimality of Stationary Policies in Risk-Averse Total-Reward MDPs with EVaR Xihong Su, Marek Petrik, Julien Grand-Clément

PDF OpenReview

Optimised Grouped-Query Attention Mechanism for Transformers Yuang Chen, Cheng Zhang, Xitong Gao, Robert D. Mullins, George Anthony Constantinides, Yiren Zhao

PDF OpenReview

Optimistic Asynchrony Control: Achieving Synchronous Convergence with Asynchronous Throughput for Embedding Model Training Roger Waleffe, Jason Mohoney

PDF OpenReview

Optimistic Information Directed Sampling Gergely Neu, Matteo Papini, Ludovic Schwartz

PDF OpenReview

Optimistic Verifiable Training by Controlling Hardware Nondeterminism Megha Srivastava, Simran Arora, Dan Boneh

PDF OpenReview

Oracle-Efficient Reinforcement Learning for Max Value Ensembles Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

PDF OpenReview

Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback Zhirui Chen, Vincent Y. F. Tan

PDF OpenReview

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano, Pulkit Agrawal

PDF OpenReview

OTTER: Effortless Label Distribution Adaptation of Zero-Shot Models Changho Shin, Jitian Zhao, Sonia Cromp, Harit Vishwakarma, Frederic Sala

PDF OpenReview

Out-of-Context Prompting Boosts Fairness and Robustness in Large Language Model Predictions Leonardo Cotta, Chris J. Maddison

PDF OpenReview

Out-of-Distribution Validation for Bioactivity Prediction in Drug Discovery: Lessons from Materials Science Udit Surya Saha, Michele Vendruscolo, Anne E Carpenter, Shantanu Singh, Andreas Bender, Srijit Seal

PDF OpenReview

OutEffHop: A Principled Outlier-Efficient Attention Layer from Dense Associative Memory Models Haozheng Luo, Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu

PDF OpenReview

Outliers and Calibration Sets Have Diminishing Effect on Quantization of Modern LLMs Davide Paglieri, Saurabh Dash, Tim Rocktäschel, Jack Parker-Holder

PDF OpenReview

Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl

PDF OpenReview

Overconfident Oracles: Limitations of in Silico Sequence Design Benchmarking Shikha Surana, Nathan Grinsztajn, Timothy Atkinson, Paul Duckworth, Thomas D Barrett

PDF OpenReview

OxonFair: A Flexible Toolkit for Algorithmic Fairness Eoin D. Delaney, Zihao Fu, Sandra Wachter, Brent Mittelstadt, Chris Russell

PDF OpenReview

PAIR: Boosting the Predictive Power of Protein Representations with a Corpus of Text Annotations Haonan Duan, Marta Skreta, Leonardo Cotta, Ella Miray Rajaonson, Nikita Dhawan, Alan Aspuru-Guzik, Chris J. Maddison

PDF OpenReview

PanSAM: Zero-Shot, Prompt-Free Pancreas Segmentation in CT Imaging Abolfazl Malekahmadi, Mohammad Taha Teimuri Jervakani, Armin Behnamnia, Zahra Dehghanian, Amir Shamloo, Hamid R. Rabiee

PDF OpenReview

Parallelising Differentiable Algorithms Removes the Scalar Bottleneck: A Case Study Euan Ong, Ferenc Huszár, Pietro Lio, Petar Veličković

PDF OpenReview

Parameter Tuning and Modeling of a Rotary Kiln Using Physics-Informed Neural Networks Janak M. Patel, Vishal Sudam Jadhav, Anirudh Deodhar, Shirish Karande, Venkataramana Runkana

PDF OpenReview

Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis Sagar Srinivas Sakhinana, Sannidhi Gowri Naga Krishna Geethan, Chidaksh Ravuru, Venkataramana Runkana

PDF OpenReview

Partial Structure Discovery Is Sufficient for No-Regret Learning in Causal Bandits Muhammad Qasim Elahi, Mahsa Ghasemi, Murat Kocaoglu

PDF OpenReview

Partially Observable Multi-Agent Reinforcement Learning Using Mean Field Control Kai Cui, Sascha H. Hauck, Christian Fabian, Heinz Koeppl

PDF OpenReview

Path Complex Neural Network for Molecular Property Prediction Longlong Li, Xiang Liu, Guanghui Wang, Yu Guang Wang, Kelin Xia

PDF OpenReview

PathoLM: Identifying Pathogenicity from the DNA Sequence Through the Genome Foundation Model Sajib Acharjee Dip

PDF OpenReview

Penzai + Treescope: A Toolkit for Interpreting, Visualizing, and Editing Models as Data Daniel D. Johnson

PDF OpenReview

Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami, Reza Bayat, Ioannis Mitliagkas, Joao Monteiro, Valentina Zantedeschi

PDF OpenReview

Performative Prediction on Games and Mechanism Design António Góis, Mehrnaz Mofakhami, Fernando P. Santos, Simon Lacoste-Julien, Gauthier Gidel

PDF OpenReview

Permutation Tree Invariant Neural Architectures Johannes Urban, Sebastian Tschiatschek, Nils Morten Kriege

PDF OpenReview

PhaseEvo: Towards Unified Long-Context Prompt Optimization for Large Language Models Wendi Cui, Jiaxin Zhang, Zhuohang Li, Hao Sun, Damien Lopez, Kamalika Das, Bradley A. Malin, Sricharan Kumar

PDF OpenReview

Physical Backdoor Attack Can Jeopardize Driving with Vision-Large-Language Models Zhenyang Ni, Rui Ye, Yuxi Wei, Zhen Xiang, Yanfeng Wang, Siheng Chen

PDF OpenReview

Physics-Informed Neural Networks for Derivative-Constrained PDEs Kentaro Hoshisashi, Carolyn E. Phelan, Paolo Barucca

PDF OpenReview

Physics-Informed Weakly Supervised Learning for Interatomic Potentials Makoto Takamoto, Viktor Zaverkin, Mathias Niepert

PDF OpenReview

PICT: Adaptive GPU Accelerated Differentiable Fluid Simulation for Machine Learning Aleksandra Franz, Nils Thuerey

PDF OpenReview

PIED: Physics-Informed Experimental Design for Inverse Problems Apivich Hemachandra, Gregory Kang Ruey Lau, See-Kiong Ng, Bryan Kian Hsiang Low

PDF OpenReview

Pink Noise LQR: How Does Colored Noise Affect the Optimal Policy in RL? Jakob Hollenstein, Marko Zaric, Samuele Tosatto, Justus Piater

PDF OpenReview

PINNACLE: PINN Adaptive ColLocation and Experimental Points Selection Gregory Kang Ruey Lau, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low

PDF OpenReview

PIPER: Primitive-Informed Preference-Based Hierarchical Reinforcement Learning via Hindsight Relabeling Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Singh Bedi

PDF OpenReview

PIPER: Primitive-Informed Preference-Based Hierarchical Reinforcement Learning via Hindsight Relabeling Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Bedi

PDF OpenReview

PIXART-Δ: Fast and Controllable Image Generation with Latent Consistency Models Junsong Chen, Simian Luo, Enze Xie

PDF OpenReview

Planning Behavior in a Recurrent Neural Network That Plays Sokoban Adrià Garriga-Alonso, Mohammad Taufeeque, Adam Gleave

PDF OpenReview

Playing Large Games with Oracles and AI Debate Xinyi Chen, Angelica Chen, Dean Foster, Elad Hazan

PDF OpenReview

PLINDER: The Protein-Ligand Interactions Dataset and Evaluation Resource Janani Durairaj, Yusuf Adeshina, Zhonglin Cao, Xuejin Zhang, Vladas Oleinikovas, Thomas Duignan, Zachary McClure, Xavier Robin, Emanuele Rossi, Guoqing Zhou, Srimukh Prasad Veccham, Clemens Isert, Yuxing Peng, Prabindh Sundareson, Mehmet Akdel, Gabriele Corso, Hannes Stark, Zachary Wayne Carpenter, Michael M. Bronstein, Emine Kucukbenli, Torsten Schwede, Luca Naef

PDF OpenReview

PLUTO: Pathology-Universal Transformer Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma L Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric E. Walk, John Abel, Harsha Vardhan Pokkalla, Andrew H. Beck, Sean Grullon

PDF OpenReview

PLUTO: Pathology-Universal Transformer Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric Walk, John Abel, Harsha Vardhan Pokkalla, Andrew H. Beck, Sean Grullon

PDF OpenReview

PLUTO: Pathology-Universal Transformer Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma L Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi, Jennifer A. Hipp, Darren Fahy, Benjamin Glass, Eric Walk, John Abel, Harsha Vardhan Pokkalla, Andrew H. Beck, Sean Grullon

PDF OpenReview

Policy Gradient Methods with Adaptive Policy Spaces Gianmarco Tedeschi, Matteo Papini, Marcello Restelli

PDF OpenReview

Policy Gradients for Optimal Parallel Tempering MCMC Daniel Zhao, Natesh S. Pillai

PDF OpenReview

Polynomial Convergence of Bandit No-Regret Dynamics in Congestion Games Leello Tadesse Dadi, Ioannis Panageas, Stratis Skoulakis, Luca Viano, Volkan Cevher

PDF OpenReview

Polynomial Regression as a Task for Understanding In-Context Learning Through Finetuning and Alignment Max Wilcoxson, Morten Svendgård, Ria Doshi, Dylan Davis, Reya Vir, Anant Sahai

PDF OpenReview

Population Transformer: Learning Population-Level Representations of Intracranial Activity Geeling Chau, Christopher Wang, Sabera J Talukder, Vighnesh Subramaniam, Saraswati Soedarmadji, Yisong Yue, Boris Katz, Andrei Barbu

PDF OpenReview

Population-Level Dark Energy Constraints from Strong Gravitational Lensing Using Simulation-Based Inference Sreevani Jarugula, Brian Nord, Abhijith Gandrakota, Aleksandra Ciprijanovic

PDF OpenReview

Position Coupling: Leveraging Task Structure for Improved Length Generalization of Transformers Hanseul Cho, Jaeyoung Cha, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun

PDF OpenReview

Position Paper: Dual-System Language Models via Next-Action Prediction Zhehang Du, Weijie J Su

PDF OpenReview

POST: A Framework for Privacy of Soft-Prompt Transfer Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Adam Dziedzic

PDF OpenReview

POST: A Framework for Privacy of Soft-Prompt Transfer Xun Wang, Jing Xu, Franziska Boenisch, Michael Backes, Adam Dziedzic

PDF OpenReview

Power Mean Estimation in Stochastic Monte-Carlo Tree Search Tuan Quang Dam, Odalric-Ambrym Maillard, Emilie Kaufmann

PDF OpenReview

PQV-Mobile: A Combined Pruning and Quantization Toolkit to Optimize Vision Transformers for Mobile Applications Kshitij Bhardwaj

PDF OpenReview

Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models Vishruth Veerendranath, Vishwa Shah, Kshitish Ghate

PDF OpenReview

Pre-Training of Single-Cell Language Models Through Genetic Pathway Learning Xuxi Chen, Zhangyang Wang, Marinka Zitnik, Manolis Kellis, Tianlong Chen

PDF OpenReview

Predicting Dark Matter Halo Masses from Simulated Galaxy Images and Environments Austin J Larson, John F Wu, Craig Jones

PDF OpenReview

Predicting Metal-Protein Interactions Using Cofolding Methods: Status Quo Simon L. Dürr, Ursula Rothlisberger

PDF OpenReview

Predictive Uncertainties Based on Proper Scoring Rules Nikita Kotelevskii, Maxim Panov

PDF OpenReview

Preference Elicitation for Offline Reinforcement Learning Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch, Giorgia Ramponi

PDF OpenReview

Preference Elicitation for Offline Reinforcement Learning Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch, Giorgia Ramponi

PDF OpenReview

Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho

PDF OpenReview

Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi, Lily H Zhang, Xinyi Chen, Qiuyi Zhang, Rajesh Ranganath, Kyunghyun Cho

PDF OpenReview

Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models Siyan Zhao, Daniel Mingyi Israel, Guy Van den Broeck, Aditya Grover

PDF OpenReview

Pretrained Deep Models Outperform GBDTs in Learning-to-Rank Under Label Scarcity Charlie Hou, Kiran Koshy Thekumparampil, Michael Shavlovsky, Giulia Fanti, Sujay Sanghavi

PDF OpenReview

Pretrained Hybrids with MAD Skills Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi Gnvv, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

PDF OpenReview

Pretrained Hybrids with MAD Skills Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi Gnvv, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

PDF OpenReview

Pretrained Hybrids with MAD Skills Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi Gnvv, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

PDF OpenReview

PrimeGuard: Safe and Helpful LLMs Through Tuning-Free Routing Blazej Manczak, Eric Lin, Eliott Zemour, Vaikkunth Mugunthan

PDF OpenReview

Privacy Auditing of Large Language Models Ashwinee Panda, Xinyu Tang, Milad Nasr, Christopher A. Choquette-Choo, Prateek Mittal

PDF OpenReview

Privacy Auditing of Large Language Models Ashwinee Panda, Xinyu Tang, Milad Nasr, Christopher A. Choquette-Choo, Prateek Mittal

PDF OpenReview

Private Attribute Inference from Images with Vision-Language Models Batuhan Tömekçe, Mark Vero, Robin Staab, Martin Vechev

PDF OpenReview

Private Fine-Tuning of Large Language Models with Zeroth-Order Optimization Xinyu Tang, Ashwinee Panda, Milad Nasr, Saeed Mahloujifar, Prateek Mittal

PDF OpenReview

Probabilistic World Modeling with Asymmetric Distance Measure Meng Song

PDF OpenReview

Probability Tools for Sequential Random Projection Yingru Li

PDF OpenReview

Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover

PDF OpenReview

Probing the Decision Boundaries of In-Context Learning in Large Language Models Siyan Zhao, Tung Nguyen, Aditya Grover

PDF OpenReview

Processing Large-Scale Graphs with G-Signatures Lukas Gruber, Bernhard Schäfl, Johannes Brandstetter, Sepp Hochreiter

PDF OpenReview

ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations Sravanti Addepalli, Priyam Dey, Venkatesh Babu Radhakrishnan

PDF OpenReview

Progress Measures for Grokking on Real-World Tasks Satvik Golechha

PDF OpenReview

Progress or Regress? Self-Improvement Reversal in Post-Training Ting Wu, Xuefeng Li, Pengfei Liu

PDF OpenReview

Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel

PDF OpenReview

Progressive Distillation Improves Feature Learning via Implicit Curriculum Abhishek Panigrahi, Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel

PDF OpenReview

Progressive-Hint Prompting Improves Reasoning in Large Language Models Chuanyang Zheng, Zhengying Liu, Enze Xie, Zhenguo Li, Yu Li

PDF OpenReview

Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones Andrey Zhmoginov, Jihwan Lee, Mark Sandler

PDF OpenReview

Projected Language Models: A Large Model Pre-Segmented into Smaller Ones David Grangier, Angelos Katharopoulos, Pierre Ablin, Awni Hannun

PDF OpenReview

Projection Killer: Peering Through High Dimensional Posterior Distribution Marco Raveri, Cyrille Doux, Shivam Pandey

PDF OpenReview

Prompt Optimization with EASE? Efficient Ordering-Aware Automated Selection of Exemplars Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

PDF OpenReview

Prompt Optimization with Human Feedback Xiaoqiang Lin, Zhongxiang Dai, Arun Verma, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low

PDF OpenReview

Prompt-Prompted Adaptive Structured Pruning for Efficient LLM Generation Harry Dong, Beidi Chen, Yuejie Chi

PDF OpenReview

Prot2Token: A Multi-Task Framework for Protein Language Processing Using Autoregressive Language Modeling Mahdi Pourmirzaei, Farzaneh Esmaili, Mohammadreza Pourmirzaei, Duolin Wang, Dong Xu

PDF OpenReview

Protein Language Models Expose Viral Mimicry and Immune Escape Dan Ofer, Michal Linial

PDF OpenReview

Protein Language Models in Directed Evolution Russell Maguire, Kotryna Bloznelyte, Fikayo Adepoju, Matthew Armean-Jones, Shafiat Dewan, Akash Gupta, Frances Patricia Jones, Preet Lalli, Anna Schooneveld, Sean Thompson, Ece Ebrahimi, Stella Fozzard, David Berman, Luca Rossoni, Will Addison, Ian Taylor

PDF OpenReview

ProtMamba: A Homology-Aware but Alignment-Free Protein State Space Model Damiano Sgarbossa, Cyril Malbranke, Anne-Florence Bitbol

PDF OpenReview

Prototype-Based Methods in Explainable AI and Emerging Opportunities in the Geosciences Anushka Narayanan, Karianne Bergen

PDF OpenReview

Provable Benefit of Cutout and CutMix for Feature Learning Junsoo Oh, Chulhee Yun

PDF OpenReview

Provable Partially Observable Reinforcement Learning with Privileged Information Yang Cai, Xiangyu Liu, Argyris Oikonomou, Kaiqing Zhang

PDF OpenReview

Provable Tempered Overfitting of Minimal Nets and Typical Nets Itamar Harel, William M. Hoza, Gal Vardi, Itay Evron, Nathan Srebro, Daniel Soudry

PDF OpenReview

Provably Mitigating Overoptimization in RLHF: Your SFT Loss Is Implicitly an Adversarial Regularizer Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang

PDF OpenReview

Proving That Cryptic Crossword Clue Answers Are Correct Martin Andrews, Sam Witteveen

PDF OpenReview

ProxyTune: Hyperparameter Tuning Through Iteratively Refined Proxies Agrin Hilmkil, Wenbo Gong, Nick Pawlowski, Cheng Zhang

PDF OpenReview

PutnamBench: A Multilingual Competition-Mathematics Benchmark for Formal Theorem-Proving George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri

PDF OpenReview

QGFN: Controllable Greediness with Action Values Elaine Lau, Stephen Zhewen Lu, Ling Pan, Doina Precup, Emmanuel Bengio

PDF OpenReview

Quality-Diversity for One-Shot Biological Sequence Design Jérémie Dona, Arthur Flajolet, Andrei Marginean, Antoine Cully, Thomas Pierrot

PDF OpenReview

Quantifying Aleatoric and Epistemic Uncertainty: A Credal Approach Paul Hofman, Yusuf Sale, Eyke Hüllermeier

PDF OpenReview

Quantized Representations Prevent Dimensional Collapse in Self-Predictive RL Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaeinezhadfard, Arno Solin, Joni Pajarinen

PDF OpenReview

Quantum 3D Visual Grounding: A Step Towards Quantum-Inspired AI-Visualization Adib Bazgir, Rama chandra Praneeth Madugula, Yuwen Zhang

PDF OpenReview

Quantum Circuit Synthesis with Diffusion Models Florian Fürruter, Gorka Muñoz-Gil, Hans J Briegel

PDF OpenReview

Quantum-PEFT: Ultra Parameter-Efficient Fine-Tuning Toshiaki Koike-Akino, Francesco Tonin, Yongtao Wu, Leyla Naz Candogan, Volkan Cevher

PDF OpenReview

Query Design for Crowdsourced Clustering: Effect of Cognitive Overload and Contextual Bias Yi Chen, Ramya Korlakai Vinayak

PDF OpenReview

RamanSPy: Augmenting Raman Spectroscopy Data Analysis with AI Dimitar Georgiev, Simon Vilms Pedersen, Ruoxiao Xie, Álvaro Fernández-Galiana, Molly M. Stevens, Mauricio Barahona

PDF OpenReview

RamanSPy: Augmenting Raman Spectroscopy Data Analysis with AI Dimitar Georgiev, Simon Vilms Pedersen, Ruoxiao Xie, Álvaro Fernández-Galiana, Molly M. Stevens, Mauricio Barahona

PDF OpenReview

Random Matrix Theory Analysis of Neural Network Weight Matrices Matthias Thamm, Max Staats, Bernd Rosenow

PDF OpenReview

Randomized Confidence Bounds for Stochastic Partial Monitoring Maxime Heuillet, Ola Ahmad, Audrey Durand

PDF OpenReview

Rank Minimization, Alignment and Weight Decay in Neural Networks David Yunis, Kumar Kshitij Patel, Samuel Wheeler, Pedro Henrique Pamplona Savarese, Gal Vardi, Karen Livescu, Michael Maire, Matthew Walter

PDF OpenReview

Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Rafael Esteves, Shreya Kadambi, Shubhankar Borse, Paul Whatmough, Risheek Garrepalli, Mart Van Baalen, Harris Teague, Markus Nagel

PDF OpenReview

Realtime Reinforcement Learning: Towards Rapid Asynchronous Deployment of Large Models Matthew Riemer, Gopeshh Subbaraj, Glen Berseth, Irina Rish

PDF OpenReview

REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan Daniel Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun

PDF OpenReview

Recommender System Design via Online Feedback Optimization Sanjay Chandrasekaran, Giulia De Pasquale, Giuseppe Belgioioso, Florian Dorfler

PDF OpenReview

Recurrent Natural Policy Gradient for POMDPs Semih Cayci, Atilla Eryilmaz

PDF OpenReview

Recursive Introspection: Teaching Foundation Model Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar

PDF OpenReview

Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar

PDF OpenReview

Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar

PDF OpenReview

Recursive Introspection: Teaching LLM Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar

PDF OpenReview

Reducing Uncertainty Through Mutual Information in Structural and Systems Biology Vincent Zaballa, Elliot E Hui

PDF OpenReview

Refusal in Language Models Is Mediated by a Single Direction Andy Arditi, Oscar Balcells Obeso, Aaquib Syed, Daniel Paleka, Nina Panickssery, Wes Gurnee, Neel Nanda

PDF OpenReview

Regression-Stratified Sampling for Optimized Algorithm Selection in Time-Constrained Tabular AutoML Mehdi Bahrami, So Hasegawa, Lei Liu, Wei-Peng Chen

PDF OpenReview

Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe

PDF OpenReview

Regularized Distribution Matching Distillation for One-Step Unpaired Image-to-Image Translation Denis Rakitin, Ivan Shchekotov, Dmitry Vetrov

PDF OpenReview

Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian Neural Networks Tristan Cinquin, Robert Bamler

PDF OpenReview

Reinforcement Learning for Efficient Design and Control Co-Optimisation of Energy Systems Marine Cauz, Adrien Bolland, Christophe Ballif, Nicolas Wyrsch

PDF OpenReview

Reinforcement Learning from Bagged Reward Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

PDF OpenReview

Reinforcement Learning from Human Text Feedback: Learning a Reward Model from Human Text Input Belen Martin Urcelay, Andreas Krause, Giorgia Ramponi

PDF OpenReview

Reinforcement Learning in the Wild with Maximum Likelihood-Based Model Transfer Hannes Eriksson, Tommy Tram, Debabrota Basu, Mina Alibeigi, Christos Dimitrakakis

PDF OpenReview

Reinforcement Learning of Adaptive Acquisition Policies for Inverse Problems Gianluigi Silvestri, Fabio Valerio Massoli, Tribhuvanesh Orekondy, Afshin Abdi, Arash Behboodi

PDF OpenReview

Reinforcement Learning with Lookahead Information Nadav Merlis

PDF OpenReview

Reinforcement Learning with Quasi-Hyperbolic Discounting S R Eshwar, Nibedita Roy, Gugan Thoppe

PDF OpenReview

Relational Composition in Neural Networks: A Survey and Call to Action Martin Wattenberg, Fernanda Viégas

PDF OpenReview

Relatively Rational: Learning Utilities and Rationalities Jointly from Pairwise Preferences Taku Yamagata, Tobias Oberkofler, Timo Kaufmann, Viktor Bengs, Eyke Hüllermeier, Raul Santos-Rodriguez

PDF OpenReview

Relaxed Equivariant Graph Neural Networks Elyssa Hofgard, Rui Wang, Robin Walters, Tess Smidt

PDF OpenReview

Relaxing Graph Transformers for Adversarial Attacks Philipp Foth, Lukas Gosch, Simon Geisler, Leo Schwinn, Stephan Günnemann

PDF OpenReview

Reliability Thresholds for the Bethe Free Energy Approximation Harald Leisenberger, Christian Knoll, Franz Pernkopf

PDF OpenReview

ReLU Characteristic Activation Analysis Wenlin Chen, Hong Ge

PDF OpenReview

ReLU MLPs Can Compute Numerical Integration: Mechanistic Interpretation of a Non-Linear Activation Chun Hei Yip, Rajashree Agrawal, Jason Gross

PDF OpenReview

Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions Luca Arnaboldi, Yatin Dandi, Florent Krzakala, Luca Pesce, Ludovic Stephan

PDF OpenReview

RepoQA: Evaluating Long Context Code Understanding Jiawei Liu, Jia Le Tian, Vijay Daita, Yuxiang Wei, Yifeng Ding, Yuhan Katherine Wang, Jun Yang, Lingming Zhang

PDF OpenReview

Representing Rule-Based Chatbots with Transformers Dan Friedman, Abhishek Panigrahi, Danqi Chen

PDF OpenReview

Resolving Discrepancies in Compute-Optimal Scaling of Language Models Tomer Porian, Mitchell Wortsman, Jenia Jitsev, Ludwig Schmidt, Yair Carmon

PDF OpenReview

Resource-Constrained Neural Architecture Search on Language Models: A Case Study Andreas Paraskeva, Joao Pedro Reis, Suzan Verberne, Jan N. van Rijn

PDF OpenReview

Rethinking Invariance in In-Context Learning Lizhe Fang, Yifei Wang, Khashayar Gatmiry, Lei Fang, Yisen Wang

PDF OpenReview

Rethinking Model-Based, Policy-Based, and Value-Based Reinforcement Learning via the Lens of Representation Complexity Guhao Feng, Han Zhong

PDF OpenReview

Rethinking Molecular Design: Integrating Latent Variable and Auto-Regressive Models for Enhanced Goal Directed Generation Arthur-Louis Heath, Amina Mollaysa, Michael Krauthammer

PDF OpenReview

Retrieval & Fine-Tuning for In-Context Tabular Models Valentin Thomas, Junwei Ma, Rasa Hosseinzadeh, Keyvan Golestan, Guangwei Yu, Maksims Volkovs, Anthony L. Caterini

PDF OpenReview

Retrieve to Explain: Evidence-Driven Predictions with Language Models Ravi Patel, Angus Brayne, Rogier Hintzen, Daniel Jaroslawicz, Georgiana Neculae, Dane S. Corneil

PDF OpenReview

Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami

PDF OpenReview

Revealing the Utilized Rank of Subspaces of Learning in Neural Networks Isha Garg, Christian Koguchi, Eshan Verma, Daniel Ulbricht

PDF OpenReview

Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference Xunpeng Huang, Difan Zou, Hanze Dong, Yi Zhang, Yian Ma, Tong Zhang

PDF OpenReview

Revisiting Cascaded Ensembles for Efficient Inference Steven Kolawole, Don Dennis, Ameet Talwalkar, Virginia Smith

PDF OpenReview

Revisiting Random Walks for Learning on Graphs Jinwoo Kim, Olga Zaghen, Ayhan Suleymanzade, Youngmin Ryou, Seunghoon Hong

PDF OpenReview

Revisiting Score Function Estimators for $k$-Subset Sampling Klas Wijk, Ricardo Vinuesa Motilva, Hossein Azizpour

PDF OpenReview

Revisiting Successor Features for Inverse Reinforcement Learning Arnav Kumar Jain, Harley Wiltzer, Jesse Farebrother, Irina Rish, Glen Berseth, Sanjiban Choudhury

PDF OpenReview

Reward Centering Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton

PDF OpenReview

Reweighted Bellman Targets for Continual Reinforcement Learning Ke Sun, Jun Jin, Xi Chen, Wulong Liu, Linglong Kong

PDF OpenReview

RFamLlama: An Efficient Conditional Language Model for RNA Sequence Generation Across Diverse Structural Families Jinyuan Sun, Han Li, Yifan Deng

PDF OpenReview

RGFN: Synthesizable Molecular Generation Using GFlowNets Michał Koziarski, Andrei Rekesh, Dmytro Shevchuk, Almer M. van der Sloot, Piotr Gaiński, Yoshua Bengio, Cheng-Hao Liu, Mike Tyers, Robert A. Batey

PDF OpenReview

RIO-CPD: A Riemannian Geometric Method for Correlation-Aware Online Change Point Detection Chengyuan Deng, Zhengzhang Chen, Xujiang Zhao, Haoyu Wang, Junxiang Wang, Haifeng Chen, Jie Gao

PDF OpenReview

RISE: 3D Perception Makes Real-World Robot Imitation Simple and Effective Chenxi Wang, Hongjie Fang, Hao-Shu Fang, Cewu Lu

PDF OpenReview

Risk-Aware Bandits for Best Crop Management Dorian Baudry, Romain Gautron

PDF OpenReview

RLHF and IIA: Perverse Incentives Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy

PDF OpenReview

RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, Asuman E. Ozdaglar

PDF OpenReview

RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, Asuman E. Ozdaglar

PDF OpenReview

RNA-FrameFlow for De Novo 3D RNA Backbone Design Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian Rokkum Jamasb, Charles Harris, Simon V Mathis, Kieran Didi, Bryan Hooi, Pietro Lio

PDF OpenReview

RNA-FrameFlow for De Novo 3D RNA Backbone Design Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian Rokkum Jamasb, Charles Harris, Simon V Mathis, Kieran Didi, Bryan Hooi, Pietro Lio

PDF OpenReview

RNAInvBench: Benchmark for the RNA Inverse Design Problem Jack Cole, Fan Li, Liwen Wu, Ke Li

PDF OpenReview

RNR: Teaching Large Language Models to Follow Roles and Rules Kuan Wang, Alexander Bukharin, Haoming Jiang, Qingyu Yin, Zhengyang Wang, Tuo Zhao, Jingbo Shang, Chao Zhang, Bing Yin, Xian Li, Jianshu Chen, Shiyang Li

PDF OpenReview

RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model Hantao Zhou, Tianying Ji, Lukas Sommerhalder, Michael Görner, Norman Hendrich, Fuchun Sun, Jianwei Dr. Zhang, Huazhe Xu

PDF OpenReview

Robust Best-of-Both-Worlds Gap Estimators Based on Importance-Weighted Sampling Sarah Clusiau, Saeed Masoudian, Yevgeny Seldin

PDF OpenReview

Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein

PDF OpenReview

Robust Federated Finetuning of Foundation Models via Alternating Minimization of LoRA Shuangyi Chen, Yue Ju, Hardik Dalal, Zhongwen Zhu, Ashish J Khisti

PDF OpenReview

Robust Knowledge Unlearning via Mechanistic Localizations Phillip Huang Guo, Aaquib Syed, Abhay Sheshadri, Aidan Ewart, Gintare Karolina Dziugaite

PDF OpenReview

Robust Learning of Transfer Functions for Single-Cell Transcriptomics Depth Normalization Da Kuang, Junhyong Kim

PDF OpenReview

Robust Unlearning via Mechanistic Localizations Phillip Huang Guo, Aaquib Syed, Abhay Sheshadri, Aidan Ewart, Gintare Karolina Dziugaite

PDF OpenReview

Robustness Analysis of AI Models in Critical Energy Systems Pantelis Dogoulis, Matthieu Jimenez, Maxime Cordy, Salah Ghamizi, Yves Le Traon

PDF OpenReview

Robustness of Explainable Artificial Intelligence in Industrial Process Modelling Benedikt Kantz, Clemens Staudinger, Christoph Feilmayr, Johannes Wachlmayr, Alexander Haberl, Stefan Schuster, Franz Pernkopf

PDF OpenReview

RouteFinder: Towards Foundation Models for Vehicle Routing Problems Federico Berto, Chuanbo Hua, Nayeli Gast Zepeda, André Hottung, Niels Wouda, Leon Lan, Kevin Tierney, Jinkyoo Park

PDF OpenReview

RouterBench: A Benchmark for Multi-LLM Routing System Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay

PDF OpenReview

Rule Based Rewards for Fine-Grained LLM Safety Tong Mu, Alec Helyar, Johannes Heidecke, Joshua Achiam, Andrea Vallone, Ian D Kivlichan, Molly Lin, Alex Beutel, John Schulman, Lilian Weng

PDF OpenReview

Rule-Enhanced Graph Learning Ali Khazraee, Abdolreza Mirzaei, Majjid Farhadi, Parmis Nadaff, Kiarash Zahirnia, Mohammad Salameh, Kevin Cannons, Richard Mar, Mingyi Wu, Oliver Schulte

PDF OpenReview

SA-DQAS: Self-Attention Enhanced Differentiable Quantum Architecture Search Yize Sun, Jiarui Liu, Zixin Wu, Zifeng Ding, Yunpu Ma, Thomas Seidl, Volker Tresp

PDF OpenReview

Safe Exploration in Reproducing Kernel Hilbert Spaces Abdullah Tokmak, Kiran G. Krishnan, Thomas B. Schön, Dominik Baumann

PDF OpenReview

Safe Online Nonstochastic Control from Data Sebastian Kerz, Armin Lederer, Marion Leibold, Dirk Wollherr

PDF OpenReview

Safe Reinforcement Learning with Contrastive Risk Prediction Hanping Zhang, Yuhong Guo

PDF OpenReview

Safer Reinforcement Learning by Going Off-Policy: A Benchmark Igor Kuznetsov

PDF OpenReview

SAIL: Self-Improving Efficient Online Alignment of Large Language Models Mucong Ding, Souradip Chakraborty, Vibhu Agrawal, Zora Che, Alec Koppel, Mengdi Wang, Amrit Bedi, Furong Huang

PDF OpenReview

SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-Resolution with Latent Diffusion Models Zhaoxu Luo, Bowen Song, Liyue Shen

PDF OpenReview

SatDiffMoE: A Mixture of Estimation Method for Satellite Image Super-Resolution with Latent Diffusion Models Bowen Song, Zhaoxu Luo, Liyue Shen

PDF OpenReview

Scalable AI Safety via Doubly-Efficient Debate Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras

PDF OpenReview

Scalable Anomaly Detection in Batch Polishing Processes for Inertial Confinement Fusion Shells Shashank Galla, Akash Tiwari, Kshitij Bhardwaj, Sean Michael Hayes, Satish Bukkapatnam, Suhas Bhandarkar

PDF OpenReview

Scalable Approaches for a Theory of Many Minds Maximilian Puelma Touzel, Amin Memarian, Matthew Riemer, Andrei Mircea, Andrew Robert Williams, Elin Ahlstrand, Lucas Lehnert, Rupali Bhati, Guillaume Dumas, Irina Rish

PDF OpenReview

Scalable Local Intrinsic Dimension Estimation with Diffusion Models Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem

PDF OpenReview

Scalable Multi-Task Transfer Learning for Molecular Property Prediction Chanhui Lee, Dae-Woong Jeong, Sung Moon Ko, Sumin Lee, Hyunseung Kim, Soorin Yim, Sehui Han, Sungwoong Kim, Sungbin Lim

PDF OpenReview

Scalable Oversight by Accounting for Unreliable Feedback Shivam Singhal, Cassidy Laidlaw, Anca Dragan

PDF OpenReview

Scalable Unsupervised Alignment of Metric and Nonmetric Structures Sanketh Vedula, Valentino Maiorca, Lorenzo Basile, Francesco Locatello, Alexander Bronstein

PDF OpenReview

Scalably Solving Assistance Games Cassidy Laidlaw, Eli Bronstein, Timothy Guo, Dylan Feng, Lukas Berglund, Justin Svegliato, Stuart Russell, Anca Dragan

PDF OpenReview

ScaLES: Scalable Latent Exploration Score for Pre-Trained Generative Networks Omer Ronen, Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk, Bin Yu

PDF OpenReview

Scalify: Scale Propagation for Efficient Low-Precision LLM Training Paul Balanca, Samuel Hosegood, Carlo Luschi, Andrew W Fitzgibbon

PDF OpenReview

Scaling Automated Quantum Error Correction Discovery with Reinforcement Learning Jan Olle, Remmy Zen, Matteo Puviani, Florian Marquardt

PDF OpenReview

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Alexander Hägele, Elie Bakouch, Atli Kosson, Loubna Ben Allal, Leandro Von Werra, Martin Jaggi

PDF OpenReview

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms Rafael Rafailov, Yaswanth Chittepu, Ryan Park, Harshit Sikchi, Joey Hejna, W. Bradley Knox, Chelsea Finn, Scott Niekum

PDF OpenReview

Scaling the Vocabulary of Non-Autoregressive Models for Efficient Generative Retrieval Ravisri Valluri, Akash Kumar Mohankumar, Kushal S. Dave, Amit S, Jian Jiao, Manik Varma, Gaurav Sinha

PDF OpenReview

Scaling up Diffusion and Flow-Based XGBoost Models Jesse C. Cresswell, Taewoo Kim

PDF OpenReview

Scanning Tunneling Microscopy (STM) Image Segmentation Using Unsupervised and Few-Shot Learning Nikola Kolev, Emily Hofmann, Geoff Thornton, Max Trouton, Filippo Federici, David Gao, Steven Schofield, Taylor Stock, Neil Curson

PDF OpenReview

Scavenging Hyena: Distilling Transformers into Long Convolution Models Tokiniaina Raharison Ralambomihanta, Shahrad Mohammadzadeh, Sami Nur Islam, Wassim Jabbour, Laurence Liang

PDF OpenReview

SCENE-Net V2: Interpretable Multiclass 3D Scene Understanding with Geometric Priors Diogo Mateus Lavado, Claudia Soares, Alessandra Micheletti

PDF OpenReview

Scoreformer: A Surrogate Model for Large-Scale Prediction of Docking Scores Alvaro Ciudad Serrano, Adrian Morales-Pastor, Laura Malo, Isaac Filella-Merce, Victor Guallar, Alexis Molina

PDF OpenReview

scTree: Discovering Cellular Hierarchies in the Presence of Batch Effects in scRNA-Seq Data Moritz Vandenhirtz, Florian Barkmann, Laura Manduchi, Julia E Vogt, Valentina Boeva

PDF OpenReview

scTree: Discovering Cellular Hierarchies in the Presence of Batch Effects in scRNA-Seq Data Moritz Vandenhirtz, Florian Barkmann, Laura Manduchi, Julia E Vogt, Valentina Boeva

PDF OpenReview

SE(3)-Equivariant Diffusion Graph Nets: Synthesizing Flow Fields by Denoising Invariant Latents on Graphs Mario Lino Valencia, Nils Thuerey, Tobias Pfaff

PDF OpenReview

SE(3)-Hyena Operator for Scalable Equivariant Learning Artem Moskalev, Mangal Prakash, Rui Liao, Tommaso Mansi

PDF OpenReview

SE3ET: SE(3)-Equivariant Transformer for Low-Overlap Point Cloud Registration Chien Erh Lin, Minghan Zhu, Maani Ghaffari

PDF OpenReview

Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion Yutong Hu, Yang Tan, Andi Han, Lirong Zheng, Liang Hong, Bingxin Zhou

PDF OpenReview

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound Rishit Dagli, Shivesh Prakash, Robert Wu, Houman Khosravani

PDF OpenReview

Seeded LoRA: Collaborative Fine-Tuning Through Seed Initialization of Adapters Alejandro R. Salamanca, Ahmet Üstün, Nicki Skafte Detlefsen, Tim Dettmers

PDF OpenReview

Segmentation CNNs Are Denoising Models Luis A. Zavala-Mondragón, Ruud Van Sloun, Peter H.N. de With, Fons van der Sommen

PDF OpenReview

Self-Cognition in Large Language Models: An Exploratory Study Dongping Chen, Jiawen Shi, Neil Zhenqiang Gong, Yao Wan, Pan Zhou, Lichao Sun

PDF OpenReview

Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller Min Cai, Yuchen Zhang, Shichang Zhang, Fan Yin, Difan Zou, Yisong Yue, Ziniu Hu

PDF OpenReview

Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller Min Cai, Yuchen Zhang, Shichang Zhang, Fan Yin, Difan Zou, Yisong Yue, Ziniu Hu

PDF OpenReview

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment Shenao Zhang, Donghan Yu, Hiteshi Sharma, Ziyi Yang, Shuohang Wang, Hany Hassan Awadalla, Zhaoran Wang

PDF OpenReview

Self-Play Preference Optimization for Language Model Alignment Yue Wu, Zhiqing Sun, Huizhuo Yuan, Kaixuan Ji, Yiming Yang, Quanquan Gu

PDF OpenReview

Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries Alonso Urbano, David W. Romero

PDF OpenReview

Self-Supervised Learning for Crystal Property Prediction via Denoising Alexander New, Nam Q Le, Michael Pekala, Christopher D Stiles

PDF OpenReview

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs Jiatong Han, Jannik Kossen, Muhammed Razzak, Lisa Schut, Shreshth A Malik, Yarin Gal

PDF OpenReview

SemioLLM: Assessing Large Language Models for Semiological Analysis in Epilepsy Research Meghal Dani, Muthu Jeyanthi Prakash, Zeynep Akata, Stefanie Liebe

PDF OpenReview

Sequential Decision Making with Expert Demonstrations Under Unobserved Heterogeneity Vahid Balazadeh, Keertana Chidambaram, Viet Nguyen, Rahul Krishnan, Vasilis Syrgkanis

PDF OpenReview

Serial Monopoly on Blockchains with Quasi-Patient Users Paolo Penna, Manvir Schneider

PDF OpenReview

Setting the Record Straight on Transformer Oversmoothing Gbetondji Jean-Sebastien Dovonon, Michael M. Bronstein, Matt Kusner

PDF OpenReview

SGD vs GD: Rank Deficiency in Linear Networks Aditya Varre, Margarita Sagitova, Nicolas Flammarion

PDF OpenReview

Shall We Team up: Exploring Spontaneous Cooperation of Competing LLM Agents Zengqing Wu, Brian I. Kwon, Shuyuan Zheng, Qianying Liu, Xu Han, Makoto Onizuka, Shaojie Tang, Run Peng, Chuan Xiao

PDF OpenReview

Sheaf Diffusion Goes Nonlinear: Enhancing GNNs with Adaptive Sheaf Laplacians Olga Zaghen, Antonio Longa, Steve Azzolin, Lev Telyatnikov, Andrea Passerini, Pietro Lio

PDF OpenReview

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao

PDF OpenReview

Should You Trust DQN? Aditya Gopalan, Gugan Thoppe

PDF OpenReview

SiBBlInGS: Similarity-Driven Building-Block Inference Using Graphs Across States Noga Mudrik, Gal Mishne, Adam Shabti Charles

PDF OpenReview

Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo, Marianne Arriola, Aaron Gokaslan, Edgar Mariano Marroquin, Alexander M Rush, Yair Schiff, Justin T Chiu, Volodymyr Kuleshov

PDF OpenReview

Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo, Marianne Arriola, Aaron Gokaslan, Edgar Mariano Marroquin, Alexander M Rush, Yair Schiff, Justin T Chiu, Volodymyr Kuleshov

PDF OpenReview

Simple Linear Attention Language Models Balance the Recall-Throughput Tradeoff Simran Arora, Sabri Eyuboglu, Michael Zhang, Aman Timalsina, Silas Alberti, Dylan Zinsley, James Zou, Atri Rudra, Christopher Re

PDF OpenReview

Simple, Unified Analysis of Johnson-Lindenstrauss with Applications Yingru Li

PDF OpenReview

Single Train Multi Deploy on Topology Search Spaces Using Kshot-Hypernet Jingyue Zhuge, Christian Mayr, Anand Subramoney, David Kappel

PDF OpenReview

SINR: Equivariant Neural Vector Fields David Ruhe, Patrick Forré

PDF OpenReview

Skill-Enhanced Reinforcement Learning Acceleration from Demonstrations Hanping Zhang, Yuhong Guo

PDF OpenReview

SkillAct: Using Skill Abstractions Improves LLM Agents Anthony Zhe Liu, Jongwook Choi, Sungryull Sohn, Yao Fu, Jaekyeom Kim, Dong-Ki Kim, Xinhe Wang, Jaewon Yoo, Honglak Lee

PDF OpenReview

Slicedit: Zero-Shot Video Editing with Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli

PDF OpenReview

Slicedit: Zero-Shot Video Editing with Text-to-Image Diffusion Models Using Spatio-Temporal Slices Nathaniel Cohen, Vladimir Kulikov, Matan Kleiner, Inbar Huberman-Spiegelglas, Tomer Michaeli

PDF OpenReview

Slow Games D Reusche, Christopher Goes, Nicolas Della Penna

PDF OpenReview

Smart Vision-Language Reasoners Denisa Roberts, Lucas Roberts

PDF OpenReview

Smoke and Mirrors in Causal Downstream Tasks Riccardo Cadei, Lukas Lindorfer, Sylvia Cremer, Cordelia Schmid, Francesco Locatello

PDF OpenReview

SMX: Sequential Monte Carlo Planning for Expert Iteration Edan Toledo, Matthew Macfarlane, Donal John Byrne, Siddarth Singh, Paul Duckworth, Alexandre Laterre

PDF OpenReview

Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency Yanxiao Zhao, Yangge Qian, Tianyi Wang, Jingyang Shan, Xiaolin Qin

PDF OpenReview

SOLMformer - Incorporating Sequence and Observation Level Metadata for Categorical Time Series Modeling Yamini Vibha Ananth, Gregory Benton, Jingxing Fang, Jerry Junyang Cheung, Xu Chu, Cong Yu

PDF OpenReview

Sorting Out Quantum Monte Carlo Jack Richter-Powell, Luca Thiede, Alan Aspuru-Guzik, David Duvenaud

PDF OpenReview

Sparse Autoencoders Match Supervised Features for Model Steering on the IOI Task Aleksandar Makelov

PDF OpenReview

Sparse Network Initialization Using Deterministic Ramanujan Graphs Arindam Biswas, Suryam Arnav Kalra, Pabitra Mitra, Biswajit Basu

PDF OpenReview

Spatio-Spectral Graph Neural Networks Simon Geisler, Arthur Kosmala, Daniel Herbst, Stephan Günnemann

PDF OpenReview

SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths Kaixuan Huang, Xudong Guo, Mengdi Wang

PDF OpenReview

Specify What? a Case-Study Using GPT-4 and Formal Methods for Specification Synthesis George Granberry, Wolfgang Ahrendt, Moa Johansson

PDF OpenReview

Spectral State Space Models Naman Agarwal, Daniel Suo, Xinyi Chen, Elad Hazan

PDF OpenReview

Spectrum-Informed Multistage Neural Network: Multiscale Function Approximator of Machine Precision Jakin Ng, Yongji Wang, Ching-Yao Lai

PDF OpenReview

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs Swanand Kadhe, Farhan Ahmed, Dennis Wei, Nathalie Baracaldo, Inkit Padhi

PDF OpenReview

Stability Analysis of Equivariant Convolutional Representations Through the Lens of Equivariant Multi-Layered CKNs Soutrik Roy Chowdhury

PDF OpenReview

Stabilizing the Training of Consistency Models with Score Guidance Jeongjun Lee, Jonggeon Park, Jongmin Yoon, Juho Lee

PDF OpenReview

Stable Differentiable Causal Discovery Achille Nazaret, Justin Hong, Elham Azizi, David Blei

PDF OpenReview

State Space Models Are Comparable to Transformers in Estimating Functions with Dynamic Smoothness Naoki Nishikawa, Taiji Suzuki

PDF OpenReview

Steering Language Models with Game-Theoretic Solvers Ian Gemp, Roma Patel, Yoram Bachrach, Marc Lanctot, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls

PDF OpenReview

Stein Variational Newton Neural Network Ensembles Klemens Flöge, Muhammad Abdul Moeed, Vincent Fortuin

PDF OpenReview

Step-on-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping Haoyu Wang, Guozheng Ma, Ziqiao Meng, Zeyu Qin, Li Shen, Zhong Zhang, Bingzhe Wu, Liu Liu, Yatao Bian, Tingyang Xu, Xueqian Wang, Peilin Zhao

PDF OpenReview

Stitching Manifolds: Leveraging Interaction to Compose Object Representations into Scenes. Hamza Keurti, Bernhard Schölkopf, Pau Vilimelis Aceituno, Benjamin F Grewe

PDF OpenReview

Stochastic Concept Bottleneck Models Moritz Vandenhirtz, Sonia Laguna, Ričards Marcinkevičs, Julia E Vogt

PDF OpenReview

Stochastic Concept Bottleneck Models Moritz Vandenhirtz, Sonia Laguna, Ričards Marcinkevičs, Julia E Vogt

PDF OpenReview

Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree Search Jonathan Light, Min Cai, Weiqin Chen, Guanzhi Wang, Xiusi Chen, Wei Cheng, Yisong Yue, Ziniu Hu

PDF OpenReview

STREAM: Embodied Reasoning Through Code Generation Daniil Cherniavskii, Phillip Lippe, Andrii Zadaianchuk, Efstratios Gavves

PDF OpenReview

Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack Xiaoyue Xu, Qinyuan Ye, Xiang Ren

PDF OpenReview

STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang

PDF OpenReview

STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making Chuanhao Li, Runhan Yang, Tiankai Li, Milad Bafarassat, Kourosh Sharifi, Dirk Bergemann, Zhuoran Yang

PDF OpenReview

Strong Copyright Protection for Language Models via Adaptive Model Fusion Javier Abad, Konstantin Donhauser, Francesco Pinto, Fanny Yang

PDF OpenReview

Strongly Isomorphic Neural Optimal Transport Across Incomparable Spaces Athina Sotiropoulou, David Alvarez-Melis

PDF OpenReview

Structural Activity Prediction Models Recover Known Binding Modes (Poster Abstract) Michael Backenköhler, Joschka Groß, Paula Linh Kramer, Verena Wolf, Andrea Volkamer

PDF OpenReview

Structure- and Function-Aware Substitution Matrices via Differentiable Graph Matching Paolo Pellizzoni, Carlos Oliver, Karsten Borgwardt

PDF OpenReview

Structure-Based Drug Design Benchmark: Do 3D Methods Really Dominate? Kangyu Zheng, Yingzhou Lu, Zaixi Zhang, Zhongwei Wan, Yao Ma, Marinka Zitnik, Tianfan Fu

PDF OpenReview

Structured Generations: Using Hierarchical Clusters to Guide Diffusion Models Jorge da Silva Gonçalves, Laura Manduchi, Moritz Vandenhirtz, Julia E Vogt

PDF OpenReview

Sum-Max Submodular Bandits Stephen Pasteris, Alberto Rumi, Fabio Vitale, Nicolò Cesa-Bianchi

PDF OpenReview

Survival of the Fittest Representation: A Case Study with Modular Addition Xiaoman Delores Ding, Zifan Carl Guo, Eric J Michaud, Ziming Liu, Max Tegmark

PDF OpenReview

Survive on Planet Pandora: Robust Cross-Domain RL Under Distinct State-Action Representations Kuan-Chen Pan, MingHong Chen, Xi Liu, Ping-Chun Hsieh

PDF OpenReview

SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors Vijay Lingam, Atula Tejaswi Neerkaje, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Eunsol Choi, Alex Dimakis, Aleksandar Bojchevski, Sujay Sanghavi

PDF OpenReview

SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors Vijay Lingam, Atula Tejaswi Neerkaje, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi

PDF OpenReview

Swallowing the Bitter Pill: Simplified Scalable Conformer Generation Yuyang Wang, Ahmed A. A. Elhag, Navdeep Jaitly, Joshua M. Susskind, Miguel Ángel Bautista

PDF OpenReview

SWUS: Active Learning with Structure Weighted Uncertainty Score Andrea Karlova, Brooks Paige

PDF OpenReview

Symbolic Autoencoding for Self-Supervised Sequence Learning Mohammad Hossein Amani, Nicolas Baldwin, Amin Mansouri, Martin Josifoski, Maxime Peyrard, Robert West

PDF OpenReview

Symbolic Regression with a Learned Concept Library Arya Grayeli, Atharva Sehgal, Omar Costilla Reyes, Miles Cranmer, Swarat Chaudhuri

PDF OpenReview

Synthetic Data-Driven Prediction of Height for Childhood Malnutrition David Berthiaume, Yuan Tang, Chau Nguyen, Siyu Gai, Emilia Mazzolenis, Weiwei Pan

PDF OpenReview

TabMDA: Tabular Manifold Data Augmentation for Any Classifier Using Transformers with In-Context Subsetting Andrei Margeloiu, Adrián Bazaga, Nikola Simidjievski, Pietro Lio, Mateja Jamnik

PDF OpenReview

Tackling Polysemanticity with Neuron Embeddings Alex Foote

PDF OpenReview

TAGMol: Target-Aware Gradient-Guided Molecule Generation Vineeth Dorna, D. Subhalingam, Keshav Kolluru, Shreshth Tuli, Mrityunjay Singh, Saurabh Singal, N M Anoop Krishnan, Sayan Ranu

PDF OpenReview

Tail Extrapolation in Target-Aware Conditional Molecule Generation Weichi Yao, Cameron Gruich, Bryan Goldsmith, Yixin Wang

PDF OpenReview

Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs Valeriia Cherepanova, James Zou

PDF OpenReview

TarDis: Achieving Robust and Structured Disentanglement of Multiple Covariates Kemal Inecik, Aleyna Kara, Antony Rose, Muzlifah Haniffa, Fabian J Theis

PDF OpenReview

Task Addition and Weight Disentanglement in Closed-Vocabulary Models Adam Hazimeh, Alessandro Favero, Pascal Frossard

PDF OpenReview

Task Addition in Multi-Task Learning by Geometrical Alignment Soorin Yim, Dae-Woong Jeong, Sung Moon Ko, Sumin Lee, Hyunseung Kim, Chanhui Lee, Sehui Han

PDF OpenReview

Task Descriptors Help Transformers Learn Linear Models In-Context Ruomin Huang, Rong Ge

PDF OpenReview

Teaching Dark Matter Simulations to Speak the Halo Language Shivam Pandey, Francois Lanusse, Chirag Modi, Benjamin Dan Wandelt

PDF OpenReview

Teaching Large Language Models to Reason with Reinforcement Learning Alexander Havrilla, Yuqing Du, Sharath Chandra Raparthy, Christoforos Nalmpantis, Jane Dwivedi-Yu, Eric Hambro, Sainbayar Sukhbaatar, Roberta Raileanu

PDF OpenReview

Teaching Transformers Causal Reasoning Through Axiomatic Training Aniket Vashishtha, Abhinav Kumar, Abbavaram Gowtham Reddy, Vineeth N. Balasubramanian, Amit Sharma

PDF OpenReview

Technical Report for ICML 2024 Automated Math Reasoning Challenge: Solving Optimization Problems with Open Source Large Language Model Duc M. Nguyen, Sungahn Ko

PDF OpenReview

Temporal Graph Rewiring with Expander Graphs Katarina Petrović, Shenyang Huang, Farimah Poursafaei, Petar Veličković

PDF OpenReview

Test-Time Adaptation with State-Space Models Mona Schirmer, Dan Zhang, Eric Nalisnick

PDF OpenReview

Test-Time Prototype Evolution for Generalizable Vision-Language Models Ce Zhang, Simon Stepputtis, Katia P. Sycara, Yaqi Xie

PDF OpenReview

Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning Simon Austin Lee, Kyoka Ono

PDF OpenReview

The Butterfly Effect: Tiny Perturbations Cause Neural Network Training to Diverge Gül Sena Altıntaş, Devin Kwok, David Rolnick

PDF OpenReview

The Concept Percolation Hypothesis: Analyzing the Emergence of Capabilities in Neural Networks Trained on Formal Grammars Ekdeep Singh Lubana, Kyogo Kawaguchi, Robert P. Dick, Hidenori Tanaka

PDF OpenReview

The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas

PDF OpenReview

The Convolution-Closed Hurdle Motif with an Application to Tensor Decomposition John Hood, Aaron Schein

PDF OpenReview

The Convolution-Closed Hurdle Motif with an Application to Tensor Decomposition John Hood, Aaron Schein

PDF OpenReview

The Effect of Data Corruption on Multimodal Long Form Responses Daniel Z Kaplan, Alexis Roger, Mohamed Osman, Irina Rish

PDF OpenReview

The Efficacy of Pre-Training in Chemical Graph Out-of-Distribution Generalization Qi Liu, Rosa H. M. Chan, Rose Yu

PDF OpenReview

The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts Wakana Haijima, Kou Nakakubo, Masahiro Suzuki, Yutaka Matsuo

PDF OpenReview

The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof Derek Lim, Theo Putterman, Robin Walters, Haggai Maron, Stefanie Jegelka

PDF OpenReview

The GAN Is Dead; Long Live the GAN! a Modern Baseline GAN Nick Huang, Aaron Gokaslan, Volodymyr Kuleshov, James Tompkin

PDF OpenReview

The Geometry of Categorical and Hierarchical Concepts in Large Language Models Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch

PDF OpenReview

The Geometry of Categorical and Hierarchical Concepts in Large Language Models Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch

PDF OpenReview

The Geometry of Diffusion Models: Tubular Neighbourhoods and Singularities Kotaro Sakamoto, Ryosuke Sakamoto, Masato Tanabe, Masatomo Akagawa, Yusuke Hayashi, Manato Yaguchi, Masahiro Suzuki, Yutaka Matsuo

PDF OpenReview

The Hidden Pitfalls of the Cosine Similarity Loss Andrew Draganov, Sharvaree Vadgama, Erik J Bekkers

PDF OpenReview

The Implicit Bias of Adam on Separable Data Chenyang Zhang, Difan Zou, Yuan Cao

PDF OpenReview

The Importance of Online Data: Understanding Preference Fine-Tuning via Coverage Yuda Song, Gokul Swamy, Aarti Singh, Drew Bagnell, Wen Sun

PDF OpenReview

The Mamba in the Llama: Distilling and Accelerating Hybrid Models Junxiong Wang, Daniele Paliotta, Avner May, Alexander M Rush, Tri Dao

PDF OpenReview

The Minimax Regret of Sequential Probability Assignment, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood Ziyi Liu, Idan Attias, Daniel M. Roy

PDF OpenReview

The Missing Curve Detectors of InceptionV1: Applying Sparse Autoencoders to InceptionV1 Early Vision Liv Gorton

PDF OpenReview

The NGT200 Dataset - Geometric Multi-View Isolated Sign Recognition Oline Ranum, David Wessels, Gomèr Otterspeer, Erik J Bekkers, Floris Roelofsen, Jari I. Andersen

PDF OpenReview

The Optimization Landscape of Spectral Neural Network Chenghui Li, Rishi Sonthalia, Nicolas Garcia Trillos

PDF OpenReview

The Price of Freedom: Exploring Tradeoffs Between Expressivity and Computational Efficiency in Equivariant Tensor Products YuQing Xie, Ameya Daigavane, Mit Kotak, Tess Smidt

PDF OpenReview

The Pupil Becomes the Master: Eye-Tracking Feedback for Tuning LLMs Samuel Kiegeland, David Robert Reich, Ryan Cotterell, Lena Ann Jäger, Ethan Wilcox

PDF OpenReview

The Remarkable Robustness of LLMs: Stages of Inference? Vedang Lad, Wes Gurnee, Max Tegmark

PDF OpenReview

The Scaling Law in Astronomical Time Series Data Jia-Shu Pan, Yuan-Sen Ting, Jie Yu, Yang Huang, Ji-Feng Liu

PDF OpenReview

The Value of Reward Lookahead in Reinforcement Learning Nadav Merlis, Dorian Baudry, Vianney Perchet

PDF OpenReview

Theoretical Analyses of Hyperparameter Selection in Graph-Based Semi-Supervised Learning Ally Yalei Du, Eric Huang, Dravyansh Sharma

PDF OpenReview

Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding Benjamin Bergner, Andrii Skliar, Amelie Royer, Tijmen Blankevoort, Yuki M Asano, Babak Ehteshami Bejnordi

PDF OpenReview

Thinking Out-of-the-Box: A Comparative Investigation of Human and LLMs in Creative Problem-Solving Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Yejin Choi, Thomas L. Griffiths, Faeze Brahman

PDF OpenReview

Three Mechanisms of Feature Learning in an Analytically Solvable Model Yizhou Xu, Liu Ziyin

PDF OpenReview

Tight Bounds for Online Convex Optimization with Adversarial Constraints Abhishek Sinha, Rahul Vaze

PDF OpenReview

TimeDiT: General-Purpose Diffusion Transformers for Time Series Foundation Model Defu Cao, Wen Ye, Yan Liu

PDF OpenReview

TinyAgent: Quantization-Aware Model Compression and Adaptation for On-Device LLM Agent Deployment Jason Kong, Lanxiang Hu, Flavio Ponzina, Tajana Rosing

PDF OpenReview

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Zhengqing Yuan, Zhaoxu Li, Weiran Huang, Yanfang Ye, Lichao Sun

PDF OpenReview

To Compete or to Collude: Builder Incentives in MEV-Boost Auctions Fei Wu, Thomas Thiery, Stefanos Leonardos, Carmine Ventre

PDF OpenReview

Tokenized SAEs: Disentangling SAE Reconstructions Thomas Dooms, Daniel Wilhelm

PDF OpenReview

Topological and Dynamical Representations for Radio Frequency Signal Classification Tegan Emerson, Timothy Doster, Colin C Olson, Audun Myers

PDF OpenReview

Topological Neural Networks Go Persistent, Equivariant and Continuous Yogesh Verma, Amauri H Souza, Vikas Garg

PDF OpenReview

Topology-Informed Graph Transformer Yun Young Choi, Sun Woo Park, Minho Lee, Youngho Woo

PDF OpenReview

Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models Weihang Xu, Maryam Fazel, Simon Shaolei Du

PDF OpenReview

Towards Adaptive Attacks on Constrained Tabular Machine Learning Thibault Simonetto, Salah Ghamizi, Maxime Cordy

PDF OpenReview

Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques Rishika Bhagwatkar, Shravan Nayak, Reza Bayat, Alexis Roger, Daniel Z Kaplan, Pouya Bashivan, Irina Rish

PDF OpenReview

Towards Aligning Language Models with Textual Feedback Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan

PDF OpenReview

Towards Bridging Classical and Neural Computation Through a Read-Eval-Print Loop David W. Zhang, Michaël Defferrard, Corrado Rainone, Roland Memisevic

PDF OpenReview

Towards Detailed and Interpretable Hybrid Modeling of Continental-Scale Bird Migration Fiona Lippert, Bart Kranstauber, Patrick Forré, Emiel van Loon

PDF OpenReview

Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information Fedor Sergeev, Paola Malsot, Gunnar Ratsch, Vincent Fortuin

PDF OpenReview

Towards Efficient and Scalable Training of Differentially Private Deep Learning Sebastian Rodriguez Beltran, Marlon Tobaben, Niki Andreas Loppi, Antti Honkela

PDF OpenReview

Towards Efficient Large-Scale Language-3D Representation Learning Shentong Mo, Xiaogang Xu, Tongzhou Wang, Antonio Torralba, Shuang Li

PDF OpenReview

Towards Empowerment Gain Through Causal Structure Learning in Model-Based RL Hongye Cao, Fan Feng, Meng Fang, Shaokang Dong, Jing Huo, Yang Gao

PDF OpenReview

Towards Enforcing Hard Physics Constraints in Operator Learning Frameworks Valentin Duruisseaux, Miguel Liu-Schiaffini, Julius Berner, Anima Anandkumar

PDF OpenReview

Towards General Geometries for Embedding Knowledge Graphs Samuel G. Fadel, Tino Paulsen, Sebastian Mair

PDF OpenReview

Towards Generalizable Particle Picking in Cryo-EM Images by Leveraging Masked AutoEncoder Andreas Zamanos, Panagiotis Koromilas, Giorgos Bouritsas, Panagiotis L. Kastritis, Yannis Panagakis

PDF OpenReview

Towards Human-AI Collaboration in Healthcare: Guided Deferral Systems with Large Language Models Joshua Strong, Qianhui Men, Alison Noble

PDF OpenReview

Towards Linking Graph Topology to Model Performance for Biomedical Knowledge Graph Completion Alberto Cattaneo, Thomas Martynec, Stephen Bonner, Carlo Luschi, Daniel Justus

PDF OpenReview

Towards Reliable Uncertainty Estimates for Drug Discovery: A Large-Scale Temporal Study of Probability Calibration Hannah Rosa Friesacher, Emma Svensson, Adam Arany, Lewis Mervin, Ola Engkvist

PDF OpenReview

Towards Safe Large Language Models for Medicine Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju

PDF OpenReview

Towards Safe Large Language Models for Medicine Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju

PDF OpenReview

Towards Safe Large Language Models for Medicine Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju

PDF OpenReview

Towards Smaller Language Models via Layer Looping Sabri Eyuboglu, Dylan Zinsley, Jon Saad-Falcon, Simran Arora, Atri Rudra, James Zou, Christopher Re

PDF OpenReview

Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning Andreas Schlaginhaufen, Maryam Kamgarpour

PDF OpenReview

Towards Zero-Shot Generalization in Offline Reinforcement Learning Zhiyong Wang, Chen Yang, John C.S. Lui, Dongruo Zhou

PDF OpenReview

Trace Is the New AutoDiff — Unlocking Efficient Optimization of Computational Workflows Ching-An Cheng, Allen Nie, Adith Swaminathan

PDF OpenReview

TracrBench: Generating Interpretability Testbeds with Large Language Models Hannes Thurnherr, Jérémy Scheurer

PDF OpenReview

Train Your Cake and Eat It Too! Repurposing Collaborative Training to Tailor LLMs to Private Data Without Sharing Boris Radovič, Mohammed Aljahdali, Marco Canini, Veljko Pejović, Zuhair Khayyat

PDF OpenReview

Training Compute-Optimal Protein Language Models Xingyi Cheng, Bo Chen, Pan Li, Jing Gong, Jie Tang, Le Song

PDF OpenReview

Training Compute-Optimal Protein Language Models Xingyi Cheng, Bo Chen, Pan Li, Jing Gong, Jie Tang, Le Song

PDF OpenReview

Training Energy-Efficient Large Language Models Leveraging Equilibrium Driven Bio-Plausible Neural Dynamics Malyaban Bal, Abhronil Sengupta

PDF OpenReview

Training-Free Acceleration of ViTs with Delayed Spatial Merging Jung Hwan Heo, Seyedarmin Azizi, Arash Fayyazi, Massoud Pedram

PDF OpenReview

Training-Free Design of Augmentations with Data-Centric Principles Jieke Wu, Wei Huang, Mingyuan Bai, Xiaoling Hu, Yi Duan, Wuyang Chen

PDF OpenReview

Transcoders Find Interpretable LLM Feature Circuits Jacob Dunefsky, Philippe Chlenski, Neel Nanda

PDF OpenReview

Transductive Active Learning with Application to Safe Bayesian Optimization Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

PDF OpenReview

Transfer Learning in Multi-Fidelity Surrogate Modeling: A Wind Farm Case Dichang Zhang, Zexia Zhang, Christian Santoni, Ali Khosronejad, Dimitris Samaras

PDF OpenReview

Transferability for Graph Convolutional Networks Christian Koke, Abhishek Saroha, Yuesong Shen, Marvin Eisenberger, Michael M. Bronstein, Daniel Cremers

PDF OpenReview

Transferable Reinforcement Learning via Generalized Occupancy Models Chuning Zhu, Xinqi Wang, Tyler Han, Simon Shaolei Du, Abhishek Gupta

PDF OpenReview

Transferable Reinforcement Learning via Generalized Occupancy Models Chuning Zhu, Xinqi Wang, Tyler Han, Simon Shaolei Du, Abhishek Gupta

PDF OpenReview

Transformer Conformal Prediction for Time Series Junghwan Lee, Chen Xu, Yao Xie

PDF OpenReview

Transformer Designs for In-Context Learning in Foundation Models for Time Series Forecasting with Covariates Afrin Dange, Vaibhav Raj, Praneeth Netrapalli, Sunita Sarawagi

PDF OpenReview

Transformer Efficiently Learns Low-Dimensional Target Functions In-Context Yujin Song, Denny Wu, Kazusato Oko, Taiji Suzuki

PDF OpenReview

Transformer Neural Autoregressive Flows Massimiliano Patacchiola, Aliaksandra Shysheya, Katja Hofmann, Richard E. Turner

PDF OpenReview

Transformers Are Minimax Optimal Nonparametric In-Context Learners Juno Kim, Tai Nakamaki, Taiji Suzuki

PDF OpenReview

Transformers Are Minimax Optimal Nonparametric In-Context Learners Juno Kim, Tai Nakamaki, Taiji Suzuki

PDF OpenReview

Transformers as Stochastic Optimizers Ryuichiro Hataya, Masaaki Imaizumi

PDF OpenReview

Transformers Can Do Arithmetic with the Right Embeddings Sean Michael McLeish, Arpit Bansal, Alex Stein, Neel Jain, John Kirchenbauer, Brian R. Bartoldson, Bhavya Kailkhura, Abhinav Bhatele, Jonas Geiping, Avi Schwarzschild, Tom Goldstein

PDF OpenReview

Transformers Can Perform Distributionally-Robust Optimisation Through In-Context Learning Taeyoung Kim, Hongseok Yang

PDF OpenReview

Transformers Learn Temporal Difference Methods for In-Context Reinforcement Learning Jiuqi Wang, Ethan H Blaser, Hadi Daneshmand, Shangtong Zhang

PDF OpenReview

Transformers Need Glasses! Information Over-Squashing in Language Tasks Federico Barbero, Andrea Banino, Steven Kapturowski, Dharshan Kumaran, João Guilherme Madeira Araújo, Alex Vitvitskyi, Razvan Pascanu, Petar Veličković

PDF OpenReview

Transformers on Markov Data: Constant Depth Suffices Nived Rajaraman, Marco Bondaschi, Ashok Vardhan Makkuva, Kannan Ramchandran, Michael Gastpar

PDF OpenReview

Transformers with Stochastic Competition for Tabular Data Modelling Andreas Voskou, Charalambos Christoforou, Sotirios Chatzis

PDF OpenReview

Transforming a Non-Differentiable Rasterizer into a Differentiable One with Stochastic Gradient Estimation Thomas Deliot, Eric Heitz, Laurent Belcour

PDF OpenReview

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum S Anderson, Yaron Singer, Amin Karbasi

PDF OpenReview

TriageAgent: Towards Better Multi-Agents Collaborations for Large Language Model-Based Clinical Triage Meng Lu, Ho Brandon, Ren Dennis, Xuan Wang

PDF OpenReview

TriLM vs FloatLM: Ternary LLMs Are More Performant than Quantized FP16 LLMs Ayush Kaushal, Tejas Vaidhya, Tejas Pandey, Aaryan Bhagat, Irina Rish

PDF OpenReview

Truly No-Regret Learning in Constrained MDPs Adrian Müller, Pragnya Alatur, Volkan Cevher, Giorgia Ramponi, Niao He

PDF OpenReview

TrustAgent: Towards Safe and Trustworthy LLM-Based Agents Through Agent Constitution Wenyue Hua, Xianjun Yang, Mingyu Jin, Zelong Li, Wei Cheng, Ruixiang Tang, Yongfeng Zhang

PDF OpenReview

Truthful Aggregation of LLMs\\ with an Application to Online Advertising Ermis Soumalias, Michael Curry, Sven Seuken

PDF OpenReview

Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang, Mingyi Hong, Fan Wang, Tsung-Hui Chang

PDF OpenReview

Two-Level Test-Time Adaptation in Multimodal Learning Jixiang Lei, Franz Pernkopf

PDF OpenReview

U-μP: The Unit-Scaled Maximal Update Parametrization Charlie Blake, Constantin Eichenberg, Josef Dean, Lukas Balles, Luke Yuri Prince, Björn Deiseroth, Andres Felipe Cruz-Salinas, Carlo Luschi, Samuel Weinbach, Douglas Orr

PDF OpenReview

UHCone: Universal Hyperbolic Cone for Implicit Hierarchical Learning Menglin Yang, Jiahong Liu, Irwin King, Rex Ying

PDF OpenReview

Unavoidable Learning Constraints Alter the Foundations of Direct Preference Optimization David Wipf

PDF OpenReview

Uncertainty-Aware Preference Alignment in Reinforcement Learning from Human Feedback Sheng Xu, Bo Yue, Hongyuan Zha, Guiliang Liu

PDF OpenReview

Uncertainty-Aware Surrogate Models for Airfoil Flow Simulations with Denoising Diffusion Probabilistic Models Qiang Liu, Nils Thuerey

PDF OpenReview

Uncovering a Culture of AI Grassroots Experimentation by Boston City Employees: Safety Risks and Mitigation Jude Ha, Audrey Xing-Yun Chang

PDF OpenReview

Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

PDF OpenReview

Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

PDF OpenReview

Understanding Adversarially Robust Generalization via Weight-Curvature Index Yuelin Xu, Xiao Zhang

PDF OpenReview

Understanding and Minimising Outlier Features in Neural Network Training Bobby He, Lorenzo Noci, Daniele Paliotta, Imanol Schlag, Thomas Hofmann

PDF OpenReview

Understanding and Minimising Outlier Features in Neural Network Training Bobby He, Lorenzo Noci, Daniele Paliotta, Imanol Schlag, Thomas Hofmann

PDF OpenReview

Understanding and Mitigating Tokenization Bias in Language Models Buu Phan, Marton Havasi, Matthew J. Muckley, Karen Ullrich

PDF OpenReview

Understanding Counting in Small Transformers: The Interplay Between Attention and Feed-Forward Layers Freya Behrens, Luca Biggio, Lenka Zdeborova

PDF OpenReview

Understanding Hallucinations in Diffusion Models Through Mode Interpolation Sumukh K Aithal, Pratyush Maini, Zachary Chase Lipton

PDF OpenReview

Understanding Inhibition Through Maximally Tense Images Christopher J Hamblin, Srijani Saha, Talia Konkle, George A. Alvarez

PDF OpenReview

Understanding Nonlinear Implicit Bias via Region Counts in Input Space Jingwei Li, Jing Xu, Zifan Wang, Huishuai Zhang, Jingzhao Zhang

PDF OpenReview

Understanding the Cognitive Complexity in Language Elicited by Product Images Yan-Ying Chen, Shabnam Hakimi, Monica P Van, Francine Chen, Matthew K Hong, Matthew Klenk, Charlene C. Wu

PDF OpenReview

Understanding the Role of Equivariance in Self-Supervised Learning Yifei Wang, Kaiwen Hu, Sharut Gupta, Ziyu Ye, Yisen Wang, Stefanie Jegelka

PDF OpenReview

Understanding the Role of Functional Diversity in Weight-Ensembling with Ingredient Selection and Multidimensional Scaling Alex Rojas, David Alvarez-Melis

PDF OpenReview

Unfamiliar Finetuning Examples Control How Language Models Hallucinate Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine

PDF OpenReview

Unfamiliar Finetuning Examples Control How Language Models Hallucinate Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine

PDF OpenReview

Unfamiliar Finetuning Examples Control How Language Models Hallucinate Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine

PDF OpenReview

Unfolding Time: Generative Modeling for Turbulent Flows in 4D Abdullah Saydemir, Marten Lienen, Stephan Günnemann

PDF OpenReview

Unified Taxonomy in AI Safety: Watermarks, Adversarial Defenses, and Transferable Attacks Grzegorz Gluch, Sai Ganesh Nagarajan, Berkant Turan

PDF OpenReview

Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning Junyan Liu, Yunfan Li, Ruosong Wang, Lin Yang

PDF OpenReview

Universal Self-Consistency for Large Language Models Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou

PDF OpenReview

Unlocking the Global Synergies in Low-Rank Adapters Zixi Zhang, Cheng Zhang, Xitong Gao, Robert D. Mullins, George Anthony Constantinides, Yiren Zhao

PDF OpenReview

Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models Sanae Lotfi, Yilun Kuang, Marc Anton Finzi, Brandon Amos, Micah Goldblum, Andrew Gordon Wilson

PDF OpenReview

Unmixing Noise from Hawkes Process to Model Learned Physiological Events Guillaume Staerman, Virginie Loison, Thomas Moreau

PDF OpenReview

Unsupervised Feature Extraction from a Foundation Model Zoo for Cell Similarity Search in Oncological Microscopy Across Devices Gabriel Kalweit, Anusha Klett, Mehdi Naouar, Jens Rahnfeld, Yannick Vogt, Diana Laura Infante Ramirez, Rebecca Berger, Jesus Duque Afonso, Tanja Nicole Hartmann, Marie Follo, Michael Luebbert, Roland Mertelsmann, Evelyn Ullrich, Joschka Boedecker, Maria Kalweit

PDF OpenReview

Unsupervised Ground Metric Learning with Tree Wasserstein Distance Kira Michaela Düsterwald, Makoto Yamada

PDF OpenReview

Unveiling CLIP Dynamics: Linear Mode Connectivity and Generalization Alireza Abdollahpourrostam, Amartya Sanyal, Seyed-Mohsen Moosavi-Dezfooli

PDF OpenReview

Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang

PDF OpenReview

Upper Error Bounds for Score-Based Inverse Problem Solving in Imaging Irina Dobrianski, Dominik Narnhofer, Thomas Pock

PDF OpenReview

UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation Junhong Shen, Tanya Marwah, Ameet Talwalkar

PDF OpenReview

USCILab3D: A Large-Scale, Long-Term, Semantically Annotated Outdoor Dataset Kiran Lekkala, Henghui Bao, Peixu Cai, Wei Zer Lim, Chen Liu, Laurent Itti

PDF OpenReview

Using Degeneracy in the Loss Landscape for Mechanistic Interpretability Lucius Bushnaq, Jake Mendel, Stefan Heimersheim, Dan Braun, Nicholas Goldowsky-Dill, Kaarel Hänni, Cindy Wu, Marius Hobbhahn

PDF OpenReview

Using Gradients to Check Sensitivity of MCMC-Based Analyses to Removing Data Tin D. Nguyen, Ryan James Giordano, Rachael Meager, Tamara Broderick

PDF OpenReview

Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations Zilin Ma, Susannah Cheng Su, Nathan Zhao, Linn Bieske, Blake Bullwinkel, Yanyi Zhang, Jinglun Gao, Gekai Liao, Siyao Li, Ziqing Luo, Boxiang Wang, Zihan Wen, Yanrui Yang, Claude Bruderlein, Weiwei Pan

PDF OpenReview

VACoDe: Visual Augmented Contrastive Decoding Sihyeon Kim, Boryeong Cho, Sangmin Bae, Sumyeong Ahn, Se-Young Yun

PDF OpenReview

Variable Star Light Curves in Koopman Space Mario Pasquato, Gaia Carenini, Nicolas Mekhaël, Vittorio F. Braga, Piero Trevisan, Giuseppe Bono, Yashar Hezaveh

PDF OpenReview

Variance Reduction of Diffusion Model's Gradients with Taylor Approximation-Based Control Variate Paul Jeha, Will Sussman Grathwohl, Michael Riis Andersen, Carl Henrik Ek, Jes Frellsen

PDF OpenReview

Variance-Dependent Regret Bounds for Nonstationary Linear Bandits Zhiyong Wang, Jize Xie, Yi Chen, John C.S. Lui, Dongruo Zhou

PDF OpenReview

Variational and Explanatory Neural Networks for Encoding Cancer Profiles and Predicting Drug Responses Tianshu Feng, Rohan Gnanaolivu, Abolfazl Safikhani, Yuanhang Liu, Jun Jiang, Nicholas Chia, Alexander Partin, Priyanka Vasanthakumari, Yitan Zhu, Chen Wang

PDF OpenReview

Variational Inference Failures Under Model Symmetries: Permutation Invariant Posteriors for Bayesian Neural Networks Yoav Gelberg, Tycho F. A. van der Ouderaa, Mark van der Wilk, Yarin Gal

PDF OpenReview

Variational Inference with Censored Gaussian Process Regressors Andrea Karlova, Rishabh Kabra, Daniel Augusto de Souza, Brooks Paige

PDF OpenReview

Variational Stochastic Gradient Descent for Deep Neural Networks Haotian Chen, Anna Kuzina, Babak Esmaeili, Jakub M. Tomczak

PDF OpenReview

Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao, Robert Bamler, Bernhard Schölkopf, Weiyang Liu

PDF OpenReview

Verbalized Machine Learning: Revisiting Machine Learning with Language Models Tim Z. Xiao, Robert Bamler, Bernhard Schölkopf, Weiyang Liu

PDF OpenReview

VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency Vernon Toh Yan Han, Ratish Puduppully, Nancy F. Chen

PDF OpenReview

VFA: Vision Frequency Analysis of Foundation Models and Human Mohammad Javad Darvishi Bayazi, Md Rifat Arefin, Jocelyn Faubert, Irina Rish

PDF OpenReview

VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-Horizon Manipulation Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh, Han-Yuan Hsu, Yi-Ting Chen, Winston H. Hsu

PDF OpenReview

Vid3D: Synthesis of Dynamic 3D Scenes Using 2D Video Diffusion Rishab Parthasarathy, Zachary Ankner, Aaron Gokaslan

PDF OpenReview

Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-Based LLMs Jinmin Li, Kuofeng Gao, Yang Bai, Jingyun Zhang, Shu-Tao Xia

PDF OpenReview

Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen, Oier Mees, Aviral Kumar, Sergey Levine

PDF OpenReview

Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen, Oier Mees, Aviral Kumar, Sergey Levine

PDF OpenReview

Vision-Language Models Provide Promptable Representations for Reinforcement Learning William Chen, Oier Mees, Aviral Kumar, Sergey Levine

PDF OpenReview

Vision-LSTM: xLSTM as Generic Vision Backbone Benedikt Alkin, Maximilian Beck, Korbinian Pöppel, Sepp Hochreiter, Johannes Brandstetter

PDF OpenReview

Visualizing Neural Network Imagination Nevan Wichers, Victor Tao, Riccardo Volpato, Fazl Barez

PDF OpenReview

vMF-Exp: Von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Théo Bontempelli, Thomas Bouabça, Tristan Cazenave

PDF OpenReview

Von Mises Quasi-Processes for Bayesian Circular Regression Yarden Cohen, Alexandre Khae Wu Navarro, Jes Frellsen, Richard E. Turner, Raziel Riemer, Ari Pakman

PDF OpenReview

Wasserstein Modality Alignment Makes Your Multimodal Transformer More Robust Zhuo Zhi, Ziquan Liu, Qiangqiang Wu, Miguel R. D. Rodrigues

PDF OpenReview

Waterfall: Framework for Robust and Scalable Text Watermarking Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low

PDF OpenReview

Weak-to-Strong Extrapolation Expedites Alignment Chujie Zheng, Ziqi Wang, Heng Ji, Minlie Huang, Nanyun Peng

PDF OpenReview

Weak-to-Strong Jailbreaking on Large Language Models Xuandong Zhao, Xianjun Yang, Tianyu Pang, Chao Du, Lei Li, Yu-Xiang Wang, William Yang Wang

PDF OpenReview

WebCanvas: Benchmarking Web Agents in Online Environments Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu

PDF OpenReview

WebCanvas: Benchmarking Web Agents in Online Environments Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu

PDF OpenReview

Weight-Based Decomposition: A Case for Bilinear MLPs Michael T Pearce, Thomas Dooms, Alice Rigg

PDF OpenReview

What Can VLMs Do for Zero-Shot Embodied Task Planning? Xian Fu, Min Zhang, Jianye Hao, Peilong Han, Hao Zhang, Lei Shi, Hongyao Tang

PDF OpenReview

What Can VLMs Do for Zero-Shot Embodied Task Planning? Xian Fu, Min Zhang, Jianye Hao, Peilong Han, Hao Zhang, Lei Shi, Hongyao Tang

PDF OpenReview

What Makes a Machine Learning Task a Good Candidate for an Equivariant Network? Scott Mahan, Davis Brown, Timothy Doster, Henry Kvinge

PDF OpenReview

What Makes and Breaks Safety Fine-Tuning? a Mechanistic Study Samyak Jain, Ekdeep Singh Lubana, Kemal Oksuz, Tom Joy, Philip Torr, Amartya Sanyal, Puneet K. Dokania

PDF OpenReview

When Are Bias-Free ReLU Networks like Linear Networks? Yedi Zhang, Andrew M Saxe, Peter E. Latham

PDF OpenReview

When Do Language Models Need to Be Large? Zhixun Chen, Yali Du, David Henry Mguni

PDF OpenReview

When Is Mean-Field Reinforcement Learning Tractable and Relevant? Batuhan Yardim, Artur Goldman, Niao He

PDF OpenReview

When to Sense and Control? a Time-Adaptive Approach for Continuous-Time RL Lenart Treven, Bhavya Sukhija, Yarden As, Florian Dorfler, Andreas Krause

PDF OpenReview

Where Do Large Learning Rates Lead Us? a Feature Learning Perspective Ildus Sadrtdinov, Maxim Kodryan, Eduard Pokonechny, Ekaterina Lobacheva, Dmitry Vetrov

PDF OpenReview

Why Do Recurrent Neural Networks Suddenly Learn? Bifurcation Mechanisms in Neuro-Inspired Short-Term Memory Tasks Udith Haputhanthri, Liam Storan, Yiqi Jiang, Adam Shai, Hakki Orhun Akengin, Mark Schnitzer, Fatih Dinc, Hidenori Tanaka

PDF OpenReview

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo

PDF OpenReview

Why Pruning and Conditional Computation Work: A High-Dimensional Perspective Erdem Koyuncu

PDF OpenReview

Why Transformers Need Adam: A Hessian Perspective Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo

PDF OpenReview

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models Liwei Jiang, Kavel Rao, Seungju Han, Allyson Ettinger, Faeze Brahman, Sachin Kumar, Niloofar Mireshghallah, Ximing Lu, Maarten Sap, Nouha Dziri, Yejin Choi

PDF OpenReview

Wind Farm Control with Cooperative Multi-Agent Reinforcement Learning Claire Bizon Monroc, Ana Busic, Jiamin Zhu, Donatien Dubuc

PDF OpenReview

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX Alexander Nikulin, Vladislav Kurenkov, Ilya Zisman, Artem Sergeevich Agarkov, Viacheslav Sinii, Sergey Kolesnikov

PDF OpenReview

xLSTM: Extended Long Short-Term Memory Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael K Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter

PDF OpenReview

xLSTM: Extended Long Short-Term Memory Korbinian Pöppel, Maximilian Beck, Markus Spanring, Andreas Auer, Oleksandra Prudnikova, Michael K Kopp, Günter Klambauer, Johannes Brandstetter, Sepp Hochreiter

PDF OpenReview

xMINT: A Multimodal Integration Transformer for Xenium Gene Imputation Xiaohui Jiang, Yuxia Xie, Jichun Xie

PDF OpenReview

You Shall Pass: Dealing with the Zero-Gradient Problem in Predict and Optimize for Convex Optimization Grigorii Veviurko, Wendelin Boehmer, Mathijs de Weerdt

PDF OpenReview

Zero-Shot Generalization of GNNs over Distinct Attribute Domains Yangyi Shen, Beatrice Bevilacqua, Joshua Robinson, Charilaos Kanatsoulis, Jure Leskovec, Bruno Ribeiro

PDF OpenReview

Zero-Shot Generalization of GNNs over Distinct Attribute Domains Yangyi Shen, Beatrice Bevilacqua, Joshua Robinson, Charilaos Kanatsoulis, Jure Leskovec, Bruno Ribeiro

PDF OpenReview

Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion Hila Manor, Tomer Michaeli

PDF OpenReview

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity Wentao Guo, Jikai Long, Yimeng Zeng, Zirui Liu, Xinyu Yang, Yide Ran, Jacob R. Gardner, Osbert Bastani, Christopher De Sa, Xiaodong Yu, Beidi Chen, Zhaozhuo Xu

PDF OpenReview

ZigMa: A DiT-Style Zigzag Mamba Diffusion Model Vincent Tao Hu, Stefan Andreas Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes Schusterbauer, Björn Ommer

PDF OpenReview