TMLR 2026
671 papers
\textsc{PGO-BEN}: Proxy-Guided Orthogonalization and Beta Ensembling for Few-Shot Domain-Incremental Learning
Samrat Mukherjee, Thivyanth Venkateswaran, Eric Nuertey Coleman, Luigi Quarantiello, Julio Hurtado, Vincenzo Lomonaco, Gemma Roig, Subhasis Chaudhuri, Biplab Banerjee $\texttt{C2-DPO}$: Constrained Controlled Direct Preference Optimization
Kavosh Asadi, Xingzi Xu, Julien Han, Ege Beyazit, Idan Pipano, Dominique Perrault-Joncas, Shoham Sabach, Mohammad Ghavamzadeh, Karim Bouyarmane $\texttt{LucidAtlas}$: Learning Uncertainty-Aware, Covariate-Disentangled, Individualized Atlas Representations
Yining Jiao, Sreekalyani Bhamidi, Carlton Jude Zdanski, Huaizhi Qu, Julia S Kimbell, Andrew Prince, Cameron P Worden, Samuel Kirse, Christopher Rutter, Benjamin H Shields, Jisan Mahmud, Tianlong Chen, Marc Niethammer $\texttt{SEM-CTRL}$: Semantically Controlled Decoding
Mohammad Albinhassan, Pranava Madhyastha, Alessandra Russo A Faster Generalized Two-Stage Approximate Top-K
B L Yashas Samaga, Varun Yerram, Spandana Raj Babbula, Prateek Jain, Praneeth Netrapalli A Multi-Fidelity Control Variate Approach for Policy Gradient Estimation
Xinjie Liu, Cyrus Neary, Kushagra Gupta, Wesley A. Suttle, Christian Ellis, Ufuk Topcu, David Fridovich-Keil A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-Tuning
Shashank Gupta, Chaitanya Ahuja, Tsung-Yu Lin, Sreya Dutta Roy, Harrie Oosterhuis, Maarten de Rijke, Satya Narayan Shukla A Survey of Model Architectures in Information Retrieval
Zhichao Xu, Fengran Mo, Zhiqi Huang, Crystina Zhang, Puxuan Yu, Bei Wang Phillips, Jimmy Lin, Vivek Srikumar A Survey of Reasoning in Autonomous Driving Systems: Open Challenges and Emerging Paradigms
Kejin Yu, Yuhan Sun, Taiqiang Wu, Ruixu Zhang, Zhiqiang Lin, Yuxin Meng, Junjie Wang, Yujiu Yang A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence
Huan-ang Gao, Jiayi Geng, Wenyue Hua, Mengkang Hu, Xinzhe Juan, Hongzhang Liu, Shilong Liu, Jiahao Qiu, Xuan Qi, Qihan Ren, Yiran Wu, Hongru Wang, Han Xiao, Yuhang Zhou, Shaokun Zhang, Jiayi Zhang, Jinyu Xiang, Yixiong Fang, Qiwen Zhao, Dongrui Liu, Cheng Qian, Zhenhailong Wang, Minda Hu, Huazheng Wang, Qingyun Wu, Heng Ji, Mengdi Wang A Survey of Token Compression for Efficient Multimodal Large Language Models
Kele Shao, Keda Tao, Kejia Zhang, Sicheng Feng, Mu Cai, Yuzhang Shang, Haoxuan You, Can Qin, Yang Sui, Huan Wang A Survey on Federated Fine-Tuning of Large Language Models
Yebo Wu, Chunlin Tian, Jingguang Li, He Sun, KaHou Tam, Zhanting Zhou, Haicheng Liao, Jing Xiong, Zhijiang Guo, Li Li, Cheng-zhong Xu A Survey on Over-Smoothing and Over-Squashing: Unified Propagation Perspectives on Graph Neural Networks and Transformers
Alvaro Arroyo, Federico Barbero, Hugh Blayney, Michael M. Bronstein, Xiaowen Dong, Pietro Lio, Razvan Pascanu, Pierre Vandergheynst A Unifying Framework for Parallelizing Sequential Models with Linear Dynamical Systems
Xavier Gonzalez, E. Kelly Buchanan, Hyun Dong Lee, Jerry Weihong Liu, Ke Alexander Wang, David M. Zoltowski, Leo Kozachkov, Christopher Re, Scott Linderman Accurate Split Learning on Noisy Signals
Hang Xu, Subhajit Maity, Aritra Dutta, Xin Li, Panos Kalnis ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
Jinyi Hu, Shengding Hu, Yuxuan Song, Yufei Huang, Mingxuan Wang, Hao Zhou, Zhiyuan Liu, Wei-Ying Ma, Maosong Sun ActionEQA: Action Interface for Embodied Question Answering
Tianwei Bao, Qineng Wang, Kangrui Wang, Mingkai Deng, Guangyi Liu, Jiayuan Mao, Lawrence Birnbaum, Zhiting Hu, Eric P. Xing, Zhaoran Wang, Manling Li Active Teacher Selection for Reward Learning
Rachel Freedman, Justin Svegliato, Kyle Hollins Wray, Stuart Russell AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
Shijue Huang, Hongru Wang, Wanjun Zhong, Zhaochen Su, Jiazhan Feng, Bowen Cao, Yi R. Fung ADAPT: Adaptive Prompt Tuning for Vision-Language Models
Zhenhan Huang, Tejaswini Pedapati, Pin-Yu Chen, Jianxi Gao Adapting Language Models to Produce Good Class Probabilities for Classification Tasks
Lautaro Estienne, Matias Vera, Elizabeth Fons, Elena Kochkina, Pablo Piantanida, Luciana Ferrer Adaptive Multi-Frame Sampling for Consistent Zero-Shot Text-to-Video Editing
Thérèse Tisseau des Escotais, Clément Rambour, Bertrand Leroy, Arnaud Breloy ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes
Amartya Mukherjee, Ruizhi Deng, He Zhao, Yuzhen Mao, Leonid Sigal, Frederick Tung Adversarial Attacks in Weight-Space Classifiers
Tamir Shor, Ethan Fetaya, Chaim Baskin, Alex M. Bronstein Amortized Bayesian Workflow
Chengkun Li, Aki Vehtari, Paul-Christian Bürkner, Stefan T. Radev, Luigi Acerbi, Marvin Schmitt Any Image Restoration via Efficient Spatial-Frequency Degradation Adaptation
Bin Ren, Eduard Zamfir, Zongwei Wu, Yawei Li, Yidi Li, Danda Pani Paudel, Radu Timofte, Ming-Hsuan Yang, Luc Van Gool, Nicu Sebe Are Foundation Models for Computer Vision Good Conformal Predictors?
Leo Fillioux, Julio Silva-Rodríguez, Ismail Ben Ayed, Paul-Henry Cournède, Maria Vakalopoulou, Stergios Christodoulidis, Jose Dolz Augmented Mixup Procedure for Privacy-Preserving Collaborative Training
Mihail-Iulian Pleșa, Fabrice Clérot, Simona Elena David, Robert Poenaru Augmented Vision-Language Models: A Systematic Review
Anthony C Davis, Burhan A. Sadiq, Tianmin Shu, Chien-Ming Huang Augmenting Molecular Graphs with Geometries via Machine Learning Interatomic Potentials
Cong Fu, Yuchao Lin, Zachary Krueger, Haiyang Yu, Maho Nakata, Jianwen Xie, Emine Kucukbenli, Xiaofeng Qian, Shuiwang Ji Automated Attention Pattern Discovery at Scale in Large Language Models
Jonathan Katzy, Razvan Mihai Popescu, Erik Mekkes, Arie van Deursen, Maliheh Izadi BalancedDPO: Adaptive Multi-Metric Alignment
Dipesh Tamboli, Souradip Chakraborty, Aditya Malusare, Biplab Banerjee, Amrit Singh Bedi, Vaneet Aggarwal Benchmarking Missing Data Imputation Methods in Socioeconomic Surveys
Siyi Sun, David Antony Selby, Yunchuan Huang, Ayush Patnaik, Sebastian Josef Vollmer, Seth Flaxman, Anisoara Calinescu Better Language Models Exhibit Higher Visual Alignment
Jona Ruthardt, Gertjan J. Burghouts, Serge Belongie, Yuki M Asano Beyond Affinity: A Benchmark of 1d, 2D, and 3D Methods Reveals Critical Trade-Offs in Structure-Based Drug Design
Kangyu Zheng, Kai Zhang, Jiale Tan, Xuehan Chen, Yingzhou Lu, Zaixi Zhang, Lichao Sun, Marinka Zitnik, Tianfan Fu, Zhiding Liang Beyond Expectations: Learning with Stochastic Dominance Made Practical
Shicong Cen, Jincheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens
Karthik Valmeekam, Vardhan Palod, Kaya Stechly, Atharva Gundawar, Subbarao Kambhampati Bi-Level Hierarchical Neural Contextual Bandits for Online Recommendation
Yunzhe Qi, Yao Zhou, Yikun Ban, Allan Stewart, Chuanwei Ruan, Jiachuan He, Shishir Kumar Prasad, Haixun Wang, Jingrui He BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions
Tao Yu, Zhengbo Zhang, Zhiheng Lyu, Junhao Gong, Hongzhu Yi, Xinming Wang, Yuxuan Zhou, Jiabing Yang, Ping Nie, Yan Huang, Wenhu Chen Budget-Optimized Crowdworker Allocation
Sha Lai, Prakash Ishwar, Margrit Betke CADmium: Fine-Tuning Code Language Models for Text- Driven Sequential CAD Design
Prashant Govindarajan, Davide Baldelli, Jay Pathak, Quentin Fournier, Sarath Chandar CARINOX: Inference-Time Scaling with Category-Aware Reward-Based Initial Noise Optimization and Exploration
Seyed Amir Kasaei, Ali Aghayari, Arash Marioriyad, Niki Sepasian, Shayan Baghayi Nejad, MohammadAmin Fazli, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban CatScreen: A Large MultiModal Benchmark Dataset for Cataract Screening
Mahapara Khurshid, Sonam Kumar, Dr Anusuya Bhattacharyya, Dhruve Kiyawat, Anshul Chauhan, Suklengmung Buragohain, Harsha Bhattacharjee, Limalemla Jamir, Vishali Gupta, Mona Duggal, Mayank Vatsa, Richa Singh Causal Graph Learning via Distributional Invariance of Cause-Effect Relationship
Nang Hung Nguyen, Phi Le Nguyen, Thao Nguyen Truong, Trong Nghia Hoang, Masashi Sugiyama ClimateAgent: Multi-Agent Orchestration for Complex Climate Data Science Workflows
Chenyue Li, Hyeonjae Kim, Wen Deng, Mengxi Jin, Huang Wen, Mengqian Lu, Binhang Yuan CodePDE: An Inference Framework for LLM-Driven PDE Solver Generation
Shanda Li, Tanya Marwah, Junhong Shen, Weiwei Sun, Andrej Risteski, Yiming Yang, Ameet Talwalkar Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning
Mingyu Cao, Gen Li, Jie Ji, Jiaqi Zhang, Ajay Jaiswal, Li Shen, Xiaolong Ma, Shiwei Liu, Lu Yin Conformal Calibration of Statistical Confidence Sets
Luben Miguel Cruz Cabezas, Guilherme Soares, Thiago Ramos, Rafael Bassi Stern, Rafael Izbicki Context-Aware Learned Mesh-Based Simulation via Trajectory-Level Meta-Learning
Philipp Dahlinger, Niklas Freymuth, Tai Hoang, Tobias Würth, Michael Volpp, Luise Kärger, Gerhard Neumann Contextual Learning for Anomaly Detection in Tabular Data
Spencer King, Zhilu Zhang, Ruofan Yu, Baris Coskun, Wei Ding, Qian Cui Contrastive VQ Priors for Multi-Class Plaque Segmentation via SAM Adaptation
Ruan Yizhe, Yusuke Kurose, Junichi Iho, Yoji Tokunaga, Makoto Horie, Yusaku Hayashi, Keisuke Nishizawa, Yasushi Koyama, Tatsuya Harada Cost-Aware Routing for Efficient Text-to-Image Generation
Qinchan Li, Kenneth Chen, Changyue Su, Wittawat Jitkrittum, Qi Sun, Patsorn Sangkloy Covariance Density Neural Networks
Om Roy, Yashar Moshfeghi, Keith M Smith CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions
Kung-Hsiang Huang, Akshara Prabhakar, Onkar Thorat, Divyansh Agarwal, Prafulla Kumar Choubey, Yixin Mao, Silvio Savarese, Caiming Xiong, Chien-Sheng Wu Curvature-Aware Safety Restoration in LLMs Fine-Tuning
Thong Bach, Thanh Nguyen-Tang, Dung Nguyen, Thao Minh Le, Truyen Tran D-Garment: Physically Grounded Latent Diffusion for Dynamic Garment Deformations
Antoine Dumoulin, Adnane Boukhayma, Laurence Boissieux, Bharath Bhushan Damodaran, Pierre Hellier, Stefanie Wuhrer DASB - Discrete Audio and Speech Benchmark
Pooneh Mousavi, Jarod Duret, Darius Petermann, Artem Ploujnikov, Luca Della Libera, Anastasia Kuznetsova, Cem Subakan, Mirco Ravanelli Data Compressibility Quantifies LLM Memorization
Yizhan Huang, Zhe Yang, Meifang Chen, Huang Nianchen, Jianping Zhang, Michael R. Lyu Dealing with Uncertainty in Contextual Anomaly Detection
Luca Bindini, Lorenzo Perini, Stefano Nistri, Jesse Davis, Paolo Frasconi deCIFer: Crystal Structure Prediction from Powder Diffraction Data Using Autoregressive Language Models
Frederik Lizak Johansen, Ulrik Friis-Jensen, Erik B Dam, Kirsten M. Ø. Jensen, Rocío Mercado, Raghavendra Selvan Decoding Safety Feedback from Diverse Raters: A Data-Driven Lens on Responsiveness to Severity
Pushkar Mishra, Charvi Rastogi, Stephen R Pfohl, Alicia Parrish, Tian Huey Teh, Roma Patel, Mark Diaz, Ding Wang, Michela Paganini, Vinodkumar Prabhakaran, Lora Aroyo, Verena Rieser DeepSeek-R1 Thoughtology: Let’s Think About LLM Reasoning
Sara Vera Marjanovic, Arkil Patel, Vaibhav Adlakha, Milad Aghajohari, Parishad BehnamGhader, Mehar Bhatia, Aditi Khandelwal, Austin Kraft, Benno Krojer, Xing Han Lù, Nicholas Meade, Dongchan Shin, Amirhossein Kazemnejad, Gaurav Kamath, Marius Mosbach, Karolina Stanczak, Siva Reddy Delta-Influence: Identifying Poisons via Influence Functions
Wenjie Li, Jiawei Li, Pengcheng Zeng, Christian Schroeder de Witt, Ameya Prabhu, Amartya Sanyal Denoising Hamiltonian Network for Physical Reasoning
Congyue Deng, Brandon Y. Feng, Cecilia Garraffo, Alan Garbarz, Robin Walters, William T. Freeman, Leonidas Guibas, Kaiming He DiffCATS: Causally Associated Time-Series Generation Through Diffusion Models
Giuseppe Masi, Andrea Coletta, Elizabeth Fons, Svitlana Vyetrenko, Novella Bartolini DiffKGW: Stealthy and Robust Diffusion Model Watermarking
Tianxin Wei, Ruizhong Qiu, Yifan Chen, Yunzhe Qi, Jiacheng Lin, Wenxuan Bao, Wenju Xu, Sreyashi Nag, Ruirui Li, Hanqing Lu, Zhengyang Wang, Chen Luo, Hui Liu, Suhang Wang, Jingrui He, Qi He, Xianfeng Tang Diffusion Posterior Sampling for Simulation-Based Inference in Tall Data Settings
Julia Linhart, Gabriel Cardoso, Alexandre Gramfort, Sylvain Le Corff, Pedro L. C. Rodrigues Dimension-Free Error Estimate for Diffusion Model and Optimal Scheduling
Valentin De Bortoli, Romuald Elie, Anna Kazeykina, Zhenjie Ren, Jiacheng Zhang DINOv3
Oriane Siméoni, Huy V. Vo, Maximilian Seitzer, Federico Baldassarre, Maxime Oquab, Cijo Jose, Vasil Khalidov, Marc Szafraniec, Seung Eun Yi, Michael Ramamonjisoa, Francisco Massa, Daniel Haziza, Luca Wehrstedt, Jianyuan Wang, Timothée Darcet, Théo Moutakanni, Leonel Sentana, Claire Roberts, Andrea Vedaldi, Jamie Tolan, John Brandt, Camille Couprie, Julien Mairal, Herve Jegou, Patrick Labatut, Piotr Bojanowski Discovering Symbolic Differential Equations with Symmetry Invariants
Jianke Yang, Manu Bhat, Bryan Hu, Yadi Cao, Nima Dehmamy, Robin Walters, Rose Yu Diversity Sampling Regularization for Multi-Domain Generalization
Lakpa Tamang, Mohamed Reda Bouadjenek, Sunil Aryal, Richard Dazeley Domain Indexing Collaborative Filtering for Recommender Systems
Rohit Amarnath, Zihao Xu, Qi Xu, Zhigang Hua, Yan Xie, Shuang Yang, Bo Long, Hao Wang Domain Translation with Monolingual Lexical Distribution
Yusuke Sakai, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe, Xiaojiang Liu EgoPlan: Towards Effective Embodied Agents via Egocentric Planning
Zhirui Fang, Ming Yang, Weishuai Zeng, Junpeng Yue, Boyu Li, Jiafei Lyu, Xiu Li, Zongqing Lu Empowering Power Outage Prediction with Spatially Aware Hybrid Graph Neural Networks and Contrastive Learning
Xuyang Shen, Zijie Pan, Diego Cerrai, Xinxuan Zhang, Christopher Colorio, Emmanouil Anagnostou, Dongjin Song Enhancing Concept Localization in CLIP-Based Concept Bottleneck Models
Rémi Kazmierczak, Steve Azzolin, Goran Frehse, Eloïse Berthier, Gianni Franchi Enhancing Semantic Segmentation with Continual Self-Supervised Pre-Training
Brown Ebouky, Ajad Chhatkuli, A. Cristiano I. Malossi, Christoph Studer, Roy Assaf, Andrea Bartezzaghi ExpertLens: Activation Steering Features Are Highly Interpretable
Masha Fedzechkina, Eleonora Gualdoni, Sinead Williamson, Katherine Metcalf, Skyler Seto, Barry-John Theobald Explaining Graph Neural Networks for Node Similarity on Graphs
Daniel Daza, Cuong Xuan Chu, Trung-Kien Tran, Daria Stepanova, Michael Cochez, Paul Groth Explaining with Trees: Interpreting CNNs Using Hierarchies
Caroline Mazini Rodrigues, Nicolas Boutry, Laurent Najman Exploration-Driven Optimization for Test-Time Large Language Model Reasoning
ChangHao Li, Yuchen Zhuang, Chenxiao Gao, Haotian Sun, Rushi Qiang, Chao Zhang, Bo Dai Extracting and Following Paths for Robust Relational Reasoning with Large Language Models
Ge Zhang, Mohammad Ali Alomrani, Hongjian Gu, Jiaming Zhou, Yaochen Hu, Bin Wang, Qun Liu, Mark Coates, Yingxue Zhang, Jianye Hao Facial Counterfactual Generation via Causal Mask-Guided Editing
Pei Sze Tan, Sailaja Rajanala, Arghya Pal, Raphael CW Phan, Huey Fang Ong Fast Debiasing of the LASSO Estimator
Shuvayan Banerjee, James Saunderson, Radhendushka Srivastava, Ajit Rajwade Feature Representation Transferring to Lightweight Models via Perception Coherence
Hai-Vy Nguyen, Fabrice Gamboa, Sixin Zhang, Reda Chhaibi, Serge Gratton, Thierry Giaccone Finally Outshining the Random Baseline: A Simple and Effective Solution for Active Learning in 3D Biomedical Imaging
Carsten T. Lüth, Jeremias Traub, Kim-Celine Kahl, Till J. Bungert, Lukas Klein, Lars Krämer, Paul F Jaeger, Klaus Maier-Hein, Fabian Isensee Flow Matching for Tabular Data Synthesis
Bahrul Ilmi Nasution, Floor Eijkelboom, Mark Elliot, Richard Allmendinger, Christian A. Naesseth Forget Less, Retain More: A Lightweight Regularizer for Rehearsal-Based Continual Learning
Lama Alssum, Hasan Abed Al Kader Hammoud, Motasem Alfarra, Juan C Leon Alcazar, Bernard Ghanem Forgetting: A New Mechanism Towards Better Large Language Model Fine-Tuning
Ali Taheri, Alireza Taban, Qizhou Wang, Shanshan Ye, Abdolreza Mirzaei, Tongliang Liu, Bo Han From Mice to Trains: Amortized Bayesian Inference on Graph Data
Svenja Jedhoff, Elizaveta Semenova, Aura Raulo, Anne Meyer, Paul-Christian Bürkner From Models to Systems: A Comprehensive Survey of Efficient Multimodal Learning
Pan Wang, Siwei Song, Hui Ji, Siqi Cao, Heng Yu, Zhijian Liu, Huanrui Yang, Yingyan Celine Lin, Beidi Chen, Mohit Bansal, Xiaoming Liu, Pengfei Zhou, Ming-Hsuan Yang, Tianlong Chen, Jingtong Hu Generalizing Coverage Plots for Simulation-Based Inference
Maximilian Lipp, Benjamin Kurt Miller, Lyubov Amitonova, Patrick Forré Generative Evolutionary Meta-Solver (GEMS): Scalable Surrogate-Free Multi-Agent Reinforcement Learning
Alakh Sharma, Gaurish Trivedi, Kartikey Singh Bhandari, Yash Sinha, Dhruv Kumar, Pratik Narang, Jagat Sesh Challa GENIE: A Visual-Only Diffusion Framework for Task- Agnostic Image Transformation
Uddeshya Singh, Aniket Thomas, Aishwarya Agarwal, Srikrishna Karanam, Biplab Banerjee GENIE: Watermarking Graph Neural Networks for Link Prediction
Venkata Sai Pranav Bachina, Aaryan Ajay Sharma, Charu Sharma, Ankit Gangwal Genomic Next-Token Predictors Are In-Context Learners
Nathan Breslow, Aayush Mishra, Michael Schatz, Anqi Liu, Mahler Revsine, Daniel Khashabi GGFlow: A Graph Flow Matching Method with Efficient Optimal Transport
Xiaoyang Hou, Tian Zhu, Milong Ren, Dongbo Bu, Xin Gao, Chunming Zhang, Shiwei Sun Graph Coarsening Using Game Theoretic Approach
Sonali Raj, Manoj Kumar, Sumit Kumar, Ruchir Gupta, Amit Kumar Jaiswal Graph Concept Bottleneck Models
Haotian Xu, Tsui-Wei Weng, Lam M. Nguyen, Tengfei Ma Graph Generation via Temporal-Aware Biased Walks
Resul Tugay, Eren Olug, Elif Ak, Kiymet Kaya, Şule Gündüz Öğüdücü GraphGini: Fostering Individual and Group Fairness in Graph Neural Networks
Anuj Kumar Sirohi, Anjali Gupta, Sandeep Kumar, Amitabha Bagchi, Sayan Ranu GriDiT: Factorized Grid-Based Diffusion for Efficient Long Image Sequence Generation
Snehal Singh Tomar, Alexandros Graikos, Arjun Krishna, Dimitris Samaras, Klaus Mueller GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong, Wei Xiong, Jing Shi, He Zhang, Yizhi Song, Nathan Jacobs Hierarchical Time Series Forecasting with Robust Reconciliation
Shuhei Aikawa, Aru Suzuki, Kei Yoshitake, Kanata Teshigawara, Iwabuchi Akira, Ken Kobayashi, Kazuhide Nakata Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning
Isaac Xu, Martin Gillis, Ayushi Sharma, Benjamin Misiuk, Craig J. Brown, Thomas Trappenberg Improving LLM Unlearning Robustness via Random Perturbations
Dang Huu-Tien, Hoang Thanh-Tung, Anh Tuan Bui, Phuong Minh Nguyen, Le-Minh Nguyen, Naoya Inoue Improving Local Explainability by Learning Causal Graphs from Data
Daan Roos, Sebastian Gerwinn, Jan-Willem van de Meent, Sara Magliacane In-Context Learning in Presence of Spurious Correlations
Hrayr Harutyunyan, Rafayel Darbinyan, Samvel Karapetyan, Hrant Khachatrian Incorporating New Knowledge into Federated Learning: Advances, Insights, and Future Directions
Lixu Wang, Sun Yinggang, Yang Zhao, Jiaqi Wu, Jiahua Dong, Ating Yin, Qinbin Li, Qingqing Ye, Dusit Niyato, Tianwei Zhang, Kwok-Yan Lam, Yu Haining, Haibo Hu, Wei Dong Interference-Aware K-Step Reachable Communication in Multi-Agent Reinforcement Learning
Ziyu Cheng, Jinsheng Ren, Jun Yang, Zhouxian Jiang, Chenzhihang Li, Rongye Shi, Bin Liang Investigating a Model-Agnostic and Imputation-Free Approach for Irregularly-Sampled Multivariate Time-Series Modeling
Abhilash Neog, Arka Daw, Sepideh Fatemi, Medha Sawhney, Aanish Pradhan, Mary E. Lofton, Bennett J. McAfee, Adrienne Breef-Pilz, Heather L. Wander, Dexter W Howard, Cayelan C. Carey, Paul Hanson, Anuj Karpatne Iterative Compositional Data Generation for Robot Control
Anh-Quan Pham, Marcel Hussing, Shubhankar P. Patankar, Danielle Bassett, Jorge Mendez-Mendez, Eric Eaton KASPER: Kolmogorov Arnold Networks for Stock Prediction and Explainable Regimes
Vidhi Oad, Param Pathak, Nouhaila Innan, Shalini Devendrababu, Muhammad Shafique Kernel Neural Operators (KNOs) for Scalable, Memory-Efficient, Geometrically-Flexible Operator Learning
Matthew Lowery, John Turnage, Zachary Morrow, John Davis Jakeman, Akil Narayan, Shandian Zhe, Varun Shankar KITTEN: A Knowledge-Integrated Evaluation of Image Generation on Visual Entities
Hsin-Ping Huang, Xinyi Wang, Yonatan Bitton, Hagai Taitelbaum, Gaurav Singh Tomar, Ming-Wei Chang, Xuhui Jia, Kelvin C.K. Chan, Hexiang Hu, Yu-Chuan Su, Ming-Hsuan Yang Language Models Are Symbolic Learners in Arithmetic
Chunyuan Deng, Zhiqi Li, Roy Xie, Ruidi Chang, Hanjie Chen Large Language Model Reasoning Failures
Peiyang Song, Pengrui Han, Noah Goodman Large Language Model-Based Data Science Agent: A Survey
Ke Chen, Peiran Wang, Yaoning Yu, Xianyang Zhan, Haohan Wang Large Language Models for Scientific Idea Generation: A Creativity-Centered Survey
Fatemeh Shahhosseini, Arash Marioriyad, Ali Momen, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban, Shaghayegh Haghjooy Javanmard Layer Collapse Can Be Induced by Unstructured Pruning
Zhu Liao, Victor Quétu, Van-Tam Nguyen, Enzo Tartaglione Learning Energy-Based Models by Self-Normalising the Likelihood
Hugo Henri Joseph Senetaire, Paul Jeha, Jes Frellsen, Pierre-Alexandre Mattei Learning from Online Videos at Inference Time for Computer-Use Agents
Yujian Liu, Ze Wang, Hao Chen, Ximeng Sun, Xiaodong Yu, Jialian Wu, Jiang Liu, Emad Barsoum, Zicheng Liu, Shiyu Chang Learning Lagrangian Interaction Dynamics with Sampling-Based Model Order Reduction
Hrishikesh Viswanath, Yue Chang, Aleksey Panas, Julius Berner, Peter Yichen Chen, Aniket Bera Learning Long-Range Representations with Equivariant Messages
Egor Rumiantsev, Marcel F. Langer, Tulga-Erdene Sodjargal, Michele Ceriotti, Philip Loche Learning to Imitate with Less: Efficient Individual Behavior Modeling in Chess
Zhenwei Tang, Difan Jiao, Eric Xue, Reid McIlroy-Young, Jon Kleinberg, Siddhartha Sen, Ashton Anderson Learning-Augmented Robust Algorithmic Recourse
Kshitij Kayastha, Vasilis Gkatzelis, Shahin Jabbari Leveraging the True Depth of LLMs
Ramón Calvo González, Daniele Paliotta, Matteo Pagliardini, Martin Jaggi, François Fleuret LJ-Bench: Ontology-Based Benchmark for U.S. Crime
Hung Yun Tseng, Wuzhen Li, Blerina Gkotse, Grigorios Chrysos LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-Based Ranking
Yifan Zeng, Ojas Tendolkar, Raymond Baartmans, Qingyun Wu, Lizhong Chen, Huazheng Wang LoRA-Ensemble: Efficient Uncertainty Modelling for Self-Attention Networks
Dominik J. Mühlematter, Michelle Halbheer, Alexander Becker, Dominik Narnhofer, Helge Aasen, Konrad Schindler, Mehmet Ozgur Turkoglu MACAW: A Causal Generative Model for Medical Imaging
Vibujithan Vigneshwaran, Erik Yuiti Ohara, Matthias Wilms, Nils Forkert Make Your LVLM KV Cache More Lightweight
Xihao Chen, Yangyang Guo, Roger Zimmermann MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Longtao Zheng, Yifan Zhang, Hanzhong Guo, Jiachun Pan, Zhenxiong Tan, Jiahao Lu, Chuanxin Tang, Bo An, Shuicheng Yan MetaSeal: Defending Against Image Attribution Forgery Through Content-Dependent Cryptographic Watermarks
Tong Zhou, Ruyi Ding, Gaowen Liu, Charles Fleming, Ramana Rao Kompella, Yunsi Fei, Xiaolin Xu, Shaolei Ren MetaSym: A Symplectic Meta-Learning Framework for Physical Intelligence
Pranav Vaidhyanathan, Aristotelis Papatheodorou, Mark T. Mitchison, Natalia Ares, Ioannis Havoutis MiniGPT-Med: A Unified Vision-Language Model for Radiology Image Understanding
Asma Alkhaldi, Raneem Alnajim, Layan Alabdullatef, Rawan Alyahya, Jun Chen, Deyao Zhu, Ahmed Z. Alsinan, Mohamed Elhoseiny Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs
Thierry Bossy, Julien Tuấn Tú Vignoud, Tahseen Rabbani, Juan R. Troncoso Pastoriza, Martin Jaggi MixtureVitae: Open Web-Scale Pretraining Dataset with High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Huu Nguyen, Victor May, Harsh Raj, Marianna Nezhurina, Yishan Wang, Yanqi Luo, Vu Minh Chien, Taishi Nakamura, Ken Tsui, Van Khue Nguyen, David Salinas, Aleksandra Krasnodębska, Christoph Schuhmann, Mats Leon Richter, Xuan-Son Vu, Jenia Jitsev Modality-Inconsistent Continual Learning of Multimodal Large Language Models
Weiguo Pian, Shijian Deng, Shentong Mo, Mingrui Liu, Yunhui Guo, Yapeng Tian Model Debiasing by Learnable Data Augmentation
Pietro Morerio, Ruggero Ragonesi, Vittorio Murino Multimodal Deception in Explainable AI: Concept-Level Backdoor Attacks on Concept Bottleneck Models
Songning Lai, Jiayu Yang, Yu Huang, Lijie Hu, TianlangXue, Zhangyi Hu, Jiaxu Li, Haicheng Liao, Zongyang Liu, Yutao Yue Multimodal Prescriptive Deep Learning
Dimitris Bertsimas, Lisa Everest, Vasiliki Stoumpou Multiscale Training of Convolutional Neural Networks
Shadab Ahamed, Niloufar Zakariaei, Eldad Haber, Moshe Eliasof Multivariate Conformal Prediction Using Optimal Transport
Michal Klein, Louis Béthune, Eugene Ndiaye, Marco Cuturi MV2MAE: Self-Supervised Video Pre-Training with Motion-Aware Multi-View Masked Autoencoders
Ketul Shah, Robert Crandall, Jie Xu, Peng Zhou, Vipin Pillai, Marian George, Mayank Bansal, Rama Chellappa Neural Conditional Transport Maps
Carlos Rodriguez-Pardo, Leonardo Chiani, Emanuele Borgonovo, Massimo Tavoni Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution
Muhammad Umair Haider, Hammad Rizwan, Hassan Sajjad, Peizhong Ju, A.B. Siddique Nondeterministic Polynomial-Time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs
Chang Yang, Ruiyu Wang, Junzhe Jiang, Qi Jiang, Qinggang Zhang, Yanchen Deng, Shuxin Li, Shuyue Hu, Bo Li, Florian T. Pokorny, Xiao Huang, Xinrun Wang Nonlinear Reconciliation: Error Reduction Theorems
Lorenzo Nespoli, Anubhab Biswas, Roberto Rocchetta, Vasco Medici Not All CAMs Are Complete: Completeness as the Key to Faithfulness
Vincenzo Buono, Peyman Sheikholharam Mashhadi, Mahmoud Rahat, Prayag Tiwari, Stefan Byttner Offline Model-Based Optimization: Comprehensive Review
Minsu Kim, Jiayao Gu, Ye Yuan, Taeyoung Yun, Zixuan Liu, Yoshua Bengio, Can Chen Offline Reinforcement Learning via Inverse Optimization
Ioannis Dimanidis, Tolga Ok, Peyman Mohajerin Esfahani On Fitting Flow Models with Large Sinkhorn Couplings
Stephen Y. Zhang, Alireza Mousavi-Hosseini, Michal Klein, Marco Cuturi On Symmetric Losses for Policy Optimization with Noisy Preferences
Soichiro Nishimori, Yu-Jie Zhang, Thanawat Lodkaew, Masashi Sugiyama On the (linear) Convergence of Generalized Newton Inexact ADMM
Zachary Frangella, Theo Diamandis, Bartolomeo Stellato, Madeleine Udell On Uncertainty Calibration for Equivariant Functions
Edward Berman, Jacob Ginesin, Marco Pacini, Robin Walters One Model for All: Multi-Objective Controllable Language Models
Qiang He, Yucheng Yang, Tianyi Zhou, Meng Fang, Mykola Pechenizkiy, Setareh Maghsudi One-Sided Matrix Completion from Ultra-Sparse Samples
Hongyang R. Zhang, Zhenshuo Zhang, Huy Nguyen, Guanghui Lan Open Technical Problems in Open-Weight AI Model Risk Management
Stephen Casper, Kyle O'Brien, Shayne Longpre, Elizabeth Seger, Kevin Klyman, Rishi Bommasani, Aniruddha Nrusimha, Ilia Shumailov, Sören Mindermann, Steven Basart, Frank Rudzicz, Kellin Pelrine, Avijit Ghosh, Andrew Strait, Robert Kirk, Dan Hendrycks, Peter Henderson, J Zico Kolter, Geoffrey Irving, Yarin Gal, Yoshua Bengio, Dylan Hadfield-Menell Optimistic Online Learning in Symmetric Cone Games
Anas Barakat, Wayne Lin, John Lazarsfeld, Antonios Varvitsiotis Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos
Meng Cao, Haoran Tang, Haoze Zhao, Mingfei Han, Ruyang Liu, Qiang Sun, Xiaojun Chang, Ian Reid, Xiaodan Liang Overcoming Open-Set Approaches to Adversarial Defense
Edgar Wilfred Jatho, Armon Barton, Matthew Wright, Patrick McClure Paradoxical Noise Preference in RNNs
Noah Izaac Eckstein, Manoj Srinivasan Pave Your Own Path: Graph Gradual Domain Adaptation on Fused Gromov-Wasserstein Geodesics
Zhichen Zeng, Ruizhong Qiu, Wenxuan Bao, Tianxin Wei, Xiao Lin, Yuchen Yan, Tarek F. Abdelzaher, Jiawei Han, Hanghang Tong Physics-Informed Deep B-Spline Networks
Zhuoyuan Wang, Raffaele Romagnoli, Saviz Mowlavi, Yorie Nakahira PipelineRL: Faster On-Policy Reinforcement Learning for Long Sequence Generation
Alexandre Piché, Ehsan Kamalloo, Rafael Pardinas, Xiaoyin Chen, Dzmitry Bahdanau Policy Learning with a Language Bottleneck
Megha Srivastava, Cédric Colas, Dorsa Sadigh, Jacob Andreas Primus: Enforcing Attention Usage for 3D Medical Image Segmentation
Tassilo Wald, Saikat Roy, Fabian Isensee, Constantin Ulrich, Sebastian Ziegler, Dasha Trofimova, Raphael Stock, Michael Baumgartner, Gregor Koehler, Klaus Maier-Hein PRISM: Diversifying Dataset Distillation by Decoupling Architectural Priors
Brian Bernhard Moser, Shalini Sarode, Federico Raue, Stanislav Frolov, Krzysztof Adamkiewicz, Arundhati Shanbhag, Joachim Folz, Tobias Christian Nauen, Andreas Dengel Process Reward Models That Think
Muhammad Khalifa, Rishabh Agarwal, Lajanugen Logeswaran, Jaekyeom Kim, Hao Peng, Moontae Lee, Honglak Lee, Lu Wang Prompt-Based Adaptation in Large-Scale Vision Models: A Survey
Xi Xiao, Yunbei Zhang, Lin Zhao, Yiyang Liu, Xiaoying Liao, Zheda Mai, Xingjian Li, Xiao Wang, Hao Xu, Jihun Hamm, Xue Lin, Min Xu, Qifan Wang, Tianyang Wang, Cheng Han Quantum Rationale-Aware Graph Contrastive Learning for Jet Discrimination
Md Abrar Jahin, Md Akmol Masud, Dr. M. F. Mridha, Nilanjan Dey, Zeyar Aung Random Projection-Induced Gaussian Latent Features for Arbitrary Style Transfer
Weizhi Lu, Zhongzheng Li, Dongchen Gao, Mingrui Chen, Weiyu Li, Jinglin Zhang, Wei Zhang Re:Form --- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny
Chuanhao Yan, Fengdi Che, Xuhan Huang, Xu Xu, Xin Li, Yizhi Li, Xingwei Qu, Jingzhe Shi, Chenghua Lin, Yaodong Yang, Binhang Yuan, Hang Zhao, Yu Qiao, Bowen Zhou, Jie Fu Reasoning-Driven Synthetic Data Generation and Evaluation
Tim R. Davidson, Benoit Seguin, Enrico Bacis, Cesar Ilharco, Hamza Harkous Relative Geometry of Neural Forecasters: Linking Accuracy and Alignment in Learned Latent Geometry
Deniz Kucukahmetler, Maximilian Jean Hemmann, Julian Mosig von Aehrenfeld, Maximilian Amthor, Christian Deubel, Nico Scherf, Diaaeldin Taha Reproducibility Study: Understanding Multi-Agent LLM Cooperation in the GovSim Framework
Alessio Silverio, Carmen Michaela Chezan, Mathijs van Sprang, Tom Cappendijk, Martin Smit Retrospective Feature Estimation for Continual Learning
Nghia D. Nguyen, Hieu Trung Nguyen, Ang Li, Hoang Pham, Viet Anh Nguyen, Khoa D Doan Riemannian Generative Decoder
Andreas Bjerregaard, Søren Hauberg, Anders Krogh RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment
Yuhao Du, Zhuo Li, Pengyu Cheng, Zhihong Chen, Yuejiao Xie, Xiang Wan, Anningzhe Gao Robust Conformal Prediction for Infrequent Classes
Jens-Michalis Papaioannou, Sebastian Jäger, Alexei Figueroa, David Stutz, Betty van Aken, Keno Bressem, Wolfgang Nejdl, Felix Gers, Alexander Löser, Felix Biessmann RT2I-Bench: Evaluating Robustness of Text-to-Image Systems Against Adversarial Attacks
Athanasios Glentis, Ioannis Tsaknakis, Jiangweizhi Peng, Xun Xian, Yihua Zhang, Gaowen Liu, Charles Fleming, Mingyi Hong Safe Reinforcement Learning Using Action Projection: Safeguard the Policy or the Environment?
Hannah Markgraf, Shambhuraj Sawant, Hanna Krasowski, Lukas Schäfer, Sebastien Gros, Matthias Althoff Scalable Physical Source-to-Field Inference with Hypernetworks
Berian James, Stefan Pollok, Ignacio Peis, Elizabeth Louise Baker, Jes Frellsen, Rasmus Bjørk Semantic-Aware Adversarial Fine-Tuning for CLIP
Jiacheng Zhang, Jinhao Li, Hanxun Huang, Sarah Monazam Erfani, Benjamin I. P. Rubinstein, Feng Liu Semi-Supervised Cross-Domain Imitation Learning
Li-Min Chu, Kai-Siang Ma, Ming-Hong Chen, Ping-Chun Hsieh Sequential Causal Discovery with Noisy Language Model Priors
Prakhar Verma, David Arbour, Sunav Choudhary, Harshita Chopra, Arno Solin, Atanu R. Sinha SiLVR: A Simple Language-Based Video Reasoning Framework
Ce Zhang, Yan-Bo Lin, Ziyang Wang, Mohit Bansal, Gedas Bertasius SMILE: A Composite Lexical-Semantic Metric for Question-Answering Evaluation
Shrikant Kendre, Austin Xu, Honglu Zhou, Michael S Ryoo, Shafiq Joty, Juan Carlos Niebles Sociodynamics of Reinforcement Learning
Yann Bouteiller, Karthik Soma, Giovanni Beltrame SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models
Sebastiano Monti, Carlo Nicolini, Giovanni Pellegrini, Jacopo Staiano, Bruno Lepri SpecEval: Evaluating Model Adherence to Behavior Specifications
Ahmed M Ahmed, Kevin Klyman, Yi Zeng, Sanmi Koyejo, Percy Liang SpikingBrain: Spiking Brain-Inspired Large Models
Yuqi Pan, Yupeng Feng, JingHao Zhuang, Siyu Ding, Han Xu, Zehao Liu, Bohan Sun, Yuhong Chou, Xuerui Qiu, Anlin Deng, Anjie Hu, Shurong Wang, Peng Zhou, Man Yao, Jibin Wu, Jian Yang, 孙国梁, Bo Xu, Guoqi Li SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba
Yulong Huang, Jianxiong Tang, Chao Wang, Ziyi Wang, Jianguo Zhang, Zhichao Lu, Bojun Cheng, Luziwei Leng SPoT: Subpixel Placement of Tokens in Vision Transformers
Martine Hjelkrem-Tan, Marius Aasan, Gabriel Y. Arteaga, Adín Ramírez Rivera SSFL: Discovering Sparse Unified Subnetworks at Initialization for Efficient Federated Learning
Riyasat Ohib, Bishal Thapaliya, Gintare Karolina Dziugaite, Jingyu Liu, Vince D. Calhoun, Sergey Plis SSL-SLR: Self-Supervised Representation Learning for Sign Language Recognition
Ariel Basso Madjoukeng, Jérôme Fink, Pierre Poitier, Edith Belise Kenmogne, Benoit Frenay Steering Large Reasoning Models Towards Concise Reasoning via Flow Matching
Yawei Li, Benjamin Bergner, Yinghan Zhao, Vihang Prakash Patil, Bei Chen, Cheng Wang StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Jialin Yang, Dongfu Jiang, Tony He, Sherman Siu, Yuxuan Zhang, Disen Liao, Zhuofeng Li, Huaye Zeng, Yiming Jia, Haozhe Wang, Benjamin Schneider, Chi Ruan, Wentao Ma, Zhiheng Lyu, Yifei Wang, Yi Lu, Quy Duc Do, Ziyan Jiang, Ping Nie, Wenhu Chen Structure Is Supervision: Multiview Masked Autoencoders for Radiology
Sonia Laguna, Andrea Agostini, Alain Ryser, Samuel Ruiperez-Campillo, Irene Cannistraci, Moritz Vandenhirtz, Stephan Mandt, Nicolas Deperrois, Farhad Nooralahzadeh, Michael Krauthammer, Thomas M. Sutter, Julia E Vogt Subspace Based Federated Unlearning
Guanghao Li, Li Shen, Yan Sun, Yue Hu, Han Hu, Dacheng Tao Synapse: Adaptive Arbitration of Complementary Expertise in Time Series Foundational Models
Sarkar Snigdha Sarathi Das, Palash Goyal, Mihir Parmar, Yiwen Song, Long Le, Lesly Miculicich, Jinsung Yoon, Rui Zhang, Hamid Palangi, Tomas Pfister Synergistic Benefits of Joint Molecule Generation and Property Prediction
Adam Izdebski, Jan Olszewski, Pankhil Gawade, Krzysztof Koras, Serra Korkmaz, Valentin Rauscher, Jakub M. Tomczak, Ewa Szczurek Tabby: A Language Model Architecture for Tabular and Structured Data Synthesis
Sonia Cromp, Satya Sai Srinath Namburi Gnvv, Mohammed Alkhudhayri, Catherine Cao, Samuel Guo, Nicholas Roberts, Frederic Sala Template-Based Probes Are Imperfect Lenses for Counterfactual Bias Evaluation in LLMs
Farnaz Kohankhaki, D. B. Emerson, Jacob-Junqi Tian, Laleh Seyyed-Kalantari, Faiza Khan Khattak Ternary Momentum for Quantized Training
Noga Bar, Amit Attia, Michal Moshkovitz, Dotan Di Castro The Clever Hans Mirage: A Comprehensive Survey on Spurious Correlations in Machine Learning
Wenqian Ye, Luyang Jiang, Eric Xie, Guangtao Zheng, Yunsheng Ma, Xu Cao, Dongliang Guo, Daiqing Qi, Zeyu He, Yijun Tian, Christopher W. Porter, Megan Coffee, Zhe Zeng, Sheng Li, Ziran Wang, Ting-Hao Kenneth Huang, James Matthew Rehg, Henry Kautz, Aidong Zhang The Confusion Is Real: GRAPHIC - A Network Science Approach to Confusion Matrices in Deep Learning
Johanna S. Fröhlich, Bastian Heinlein, Jan U. Claar, Hans Rosenberger, Vasileios Belagiannis, Ralf R. Müller The Cost of Replicability in Active Learning
Rupkatha Hira, Dominik Kau, Jessica Sorrell The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Guibin Zhang, Hejia Geng, Xiaohang Yu, Zhenfei Yin, Zaibin Zhang, Zelin Tan, Heng Zhou, Zhong-Zhi Li, Xiangyuan Xue, Yijiang Li, Yifan Zhou, Yang Chen, Chen Zhang, Yutao Fan, Zihu Wang, Songtao Huang, Francisco Piedrahita Velez, Yue Liao, Hongru Wang, Mengyue Yang, Heng Ji, Jun Wang, Shuicheng Yan, Philip Torr, Lei Bai The Pitfalls of Text Degeneration When Aligning LLMs Through Model Merge
Peijun Qing, Lei Hsiung, Hefan Zhang, Haiquan Lu, Xingjian Diao, Chiyu Ma, Saeed Hassanpour, Soroush Vosoughi The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
Jierun Chen, Tiezheng Yu, Haoli Bai, Lewei Yao, Jiannan Wu, Kaican Li, Fei Mi, Chaofan Tao, Lei Zhu, Manyi Zhang, Xiao-Hui Li, Lu Hou, Lifeng Shang, Qun Liu The Transformer Cookbook
Andy Yang, Christopher Watson, Anton Xue, Satwik Bhattamishra, Jose Llarena, William Merrill, Emile Dos Santos Ferreira, Anej Svete, David Chiang There Are No Champions in Supervised Long-Term Time Series Forecasting
Lorenzo Brigato, Rafael Morand, Knut Joar Strømmen, Maria Panagiotou, Markus Schmidt, Stavroula Mougiakakou Thermodynamically Consistent Latent Dynamics Identification for Parametric Systems
Xiaolong He, Yeonjong Shin, Anthony Gruber, Sohyeon Jung, Kookjin Lee, Youngsoo Choi ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
Bairu Hou, Yang Zhang, Jiabao Ji, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters
Xingjian Zhang, Tianhong Gao, Suliang Jin, Tianhao Wang, Teng Ye, Eytan Adar, Qiaozhu Mei TOAST: Transformer Optimization Using Adaptive and Simple Transformations
Irene Cannistraci, Simone Antonelli, Emanuele Palumbo, Thomas M. Sutter, Emanuele Rodolà, Bastian Rieck, Julia E Vogt ToMoE: Converting Dense Large Language Models to Mixture-of-Experts Through Dynamic Structural Pruning
Shangqian Gao, Ting Hua, Reza Shirkavand, Chi-Heng Lin, Zheng Tang, Zhengao Li, Longge Yuan, Fangyi Li, Zeyu Zhang, Alireza Ganjdanesh, Qian Lou, Jie Xu, Yen-Chang Hsu Towards Online Multimodal Social Interaction Understanding
Xinpeng Li, Shijian Deng, Bolin Lai, Weiguo Pian, James Matthew Rehg, Yapeng Tian Towards Scalable Language-Image Pre-Training for 3D Medical Imaging
Chenhui Zhao, Yiwei Lyu, Asadur Zaman Chowdury, Edward S Harake, Akhil Kondepudi, Akshay T Rao, Xinhai Hou, Honglak Lee, Todd C Hollon Transformers in the Dark: Navigating Unknown Search Spaces via Bandit Feedback
Jungtaek Kim, Thomas Zeng, Ziqian Lin, Minjae Lee, Chungpa Lee, Jy-yong Sohn, Hyung Il Koo, Kangwook Lee TRecViT: A Recurrent Video Transformer
Viorica Patraucean, Xu Owen He, Joseph Heyward, Chuhan Zhang, Mehdi S. M. Sajjadi, George-Cristian Muraru, Artem Zholus, Mahdi Karami, Ross Goroshin, Yutian Chen, Simon Osindero, Joao Carreira, Razvan Pascanu Uncertainty-Aware Surrogate-Based Amortized Bayesian Inference for Computationally Expensive Models
Stefania Scheurer, Philipp Reiser, Tim Brünnette, Wolfgang Nowak, Anneli Guthke, Paul-Christian Bürkner Uncertainty-Aware Systems for Human-AI Collaboration
Vasco Pearson, Jean V. Alves, Jacopo Bono, Mario A. T. Figueiredo, Pedro Bizarro Unifying VXAI: A Systematic Review and Framework for the Evaluation of Explainable AI
David Dembinsky, Adriano Lucieri, Stanislav Frolov, Hiba Najjar, Ko Watanabe, Andreas Dengel UniRec: Unified Multimodal Encoding for LLM-Based Recommendations
Zijie Lei, Tao Feng, Zhigang Hua, Yan Xie, Guanyu Lin, Shuang Yang, Ge Liu, Jiaxuan You Unlocking [CLS] Features for Continual Post-Training
Murat Onur Yildirim, Elif Ceren Gok Yildirim, Joaquin Vanschoren Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models
Vinith Menon Suriyakumar, Rohan Alur, Ayush Sekhari, Manish Raghavan, Ashia C. Wilson VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model
Mengzhuo Chen, Jiani Zheng, Lu Wang, Fangkai Yang, Chaoyun Zhang, Lingrui Mei, Wenjie Yin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents
Rui Meng, Ziyan Jiang, Ye Liu, Mingyi Su, Xinyi Yang, Yuepeng Fu, Can Qin, Raghuveer Thirukovalluru, Xuan Zhang, Zeyuan Chen, Ran Xu, Caiming Xiong, Yingbo Zhou, Wenhu Chen, Semih Yavuz VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models
Ce Zhang, Kaixin Ma, Tianqing Fang, Wenhao Yu, Hongming Zhang, Zhisong Zhang, Haitao Mi, Dong Yu Weakly-Supervised Disentangled Representation Learning via Filter-Based Adaptive Swapping
Zhenyu Zong, Qidi Wang, Simon Yu, Hongpeng Cao, Yanbing Mao, Han Zhao, Lui Sha, Huajie Shao When Are Two Scores Better than One? Investigating Ensembles of Diffusion Models
Raphaël Razafindralambo, Rémy Sun, Damien Garreau, Frederic Precioso, Pierre-Alexandre Mattei Wikipedia in the Era of LLMs: Evolution and Risks
Siming Huang, Yuliang Xu, Mingmeng Geng, Yao Wan, Dongping Chen You Only Train Once: Differentiable Subset Selection for Omics Data
Daphné Chopard, Jorge da Silva Gonçalves, Irene Cannistraci, Thomas M. Sutter, Julia E Vogt Α-OCC: Uncertainty-Aware Camera-Based 3D Semantic Occupancy Prediction
Sanbao Su, Nuo Chen, Chenchen Lin, Felix Juefei-Xu, Chen Feng, Fei Miao