DMLR 2024

27 papers

ATCO2 Corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Juan Pablo Zuluaga Gomez, Karel Veselý, Igor Szöke, Alexander Blatt, Petr Motlicek, Martin Kocour, Khalid Choukri, Iuliia Nigmatulina, Claudia Cevenini, Allan Tart, Jan Cernocký, Dietrich Klakow
PDF
Benchmarking Edge Regression on Temporal Networks Muberra Ozmen, Florence Regol, Thomas Markovich
PDF
Benchmarking Robustness of Multimodal Image-Text Models Under Distribution Shift Jielin Qiu, Yi Zhu, Xingjian Shi, Florian Wenzel, Zhiqiang Tang, Ding Zhao, Bo Li, Mu Li
PDF
Building Better Datasets: Seven Recommendations for Responsible Design from Dataset Creators Will Orr, Kate Crawford
PDF
ComPile: A Large IR Dataset from Production Sources Aiden Grossman, Ludger Paehler, Konstantinos Parasyris, Tal Ben-Nun, Jacob Hegna, William S. Moses, Jose M Monsalve Diaz, Mircea Trofin, Johannes Doerfert
PDF
Datasets and Benchmarks for Offline Safe Reinforcement Learning Zuxin Liu, Zijian Guo, Haohong Lin, Yihang Yao, Jiacheng Zhu, Zhepeng Cen, Hanjiang Hu, Wenhao Yu, Tingnan Zhang, Jie Tan, Ding Zhao
PDF
Deep Neural Network Benchmarks for Selective Classification Andrea Pugnana, Lorenzo Perini, Jesse Davis, Salvatore Ruggieri
PDF
Detecting Errors in a Numerical Response via Any Regression Model Hang Zhou, Jonas Mueller, Mayank Kumar, Jane-Ling Wang, Jing Lei
PDF
DMLR: Data-Centric Machine Learning Research - Past, Present and Future Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš, Ahmed Alaa, Adji Bousso Dieng, Natasha Noy, Vijay Janapa Reddi, James Zou, Praveen Paritosh, Mihaela van der Schaar, Kurt Bollacker, Lora Aroyo, Ce Zhang, Joaquin Vanschoren, Isabelle Guyon, Peter Mattson
PDF
Evaluating Durability: Benchmark Insights into Image and Text Watermarking Jielin Qiu, William Han, Xuandong Zhao, Shangbang Long, Christos Faloutsos, Lei Li
PDF
FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things Samiul Alam, Tuo Zhang, Tiantian Feng, Hui Shen, Zhichao Cao, Dong Zhao, Jeonggil Ko, Kiran Somasundaram, Shrikanth Narayanan, Salman Avestimehr, Mi Zhang
PDF
Forecasting Electric Vehicle Charging Station Occupancy: Smarter Mobility Data Challenge Yvenn Amara-Ouali, Yannig Goude, Nathan Doumèche, Pascal Veyret, Alexis Thomas, Daniel Hebenstreit, Thomas Wedenig, Arthur Satouf, Aymeric Jan, Yannick Deleuze, Paul Berhaut, Sebastien Treguer
PDF
GlycoNMR: Dataset and Benchmark of Carbohydrate-Specific NMR Chemical Shift for Machine Learning Research Zizhang Chen, Ryan Paul Badman, Bethany Lachele Foley, Robert J Woods, Pengyu Hong
PDF
Highlighting Challenges of State-of-the-Art Semantic Segmentation with HAIR - A Dataset of Historical Aerial Images Saeid Shamsaliei, Odd Erik Gundersen, Knut Tore Alfredsen, Jo Halvard Halleraker, Anders Foldvik
PDF
LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning Jifan Zhang, Yifang Chen, Gregory Canal, Arnav Mohanty Das, Gantavya Bhatt, Stephen Mussmann, Yinglun Zhu, Jeff Bilmes, Simon Shaolei Du, Kevin Jamieson, Robert D Nowak
PDF
NAFlora-1m: Continental-Scale High-Resolution Fine-Grained Plant Classification Dataset John Park, Riccardo de Lutio, Brendan Rappazzo, Barbara Ambrose, Fabian Michelangeli, Kimberly Watson, Serge Belongie, Damon Little
PDF
On Catastrophic Inheritance of Large Foundation Models Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang
PDF
On Minimizing the Training Set Fill Distance in Machine Learning Regression Paolo Climaco, Jochen Garcke
PDF
OpenOOD V1.5: Enhanced Benchmark for Out-of-Distribution Detection Jingyang Zhang, Jingkang Yang, Pengyun Wang, Haoqi Wang, Yueqian Lin, Haoran Zhang, Yiyou Sun, Xuefeng Du, Yixuan Li, Ziwei Liu, Yiran Chen, Hai Li
PDF
Potion: Towards Poison Unlearning Stefan Schoepf, Jack Foster, Alexandra Brintrup
PDF
Properties of Alternative Data for Fairer Credit Risk Predictions Jung Youn Lee, Joonhyuk Yang
PDF
Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery Yoshitomo Matsubara, Naoya Chiba, Ryo Igarashi, Yoshitaka Ushiku
PDF
The Matrix Reloaded: Towards Counterfactual Group Fairness in Machine Learning Mariana Pinto, Andre V Carreiro, Pedro Madeira, Alberto Lopez, Hugo Gamboa
PDF
The Nine Lives of ImageNet: A Sociotechnical Retrospective of a Foundation Dataset and the Limits of Automated Essentialism Sasha Luccioni, Kate Crawford
PDF
VALUED - Vision and Logical Understanding Evaluation Dataset Soumadeep Saha, Saptarshi Saha, Utpal Garain
PDF
When Is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? a Data-Centric Perspective Hao Sun, Alex James Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar
PDF
You Can't Handle the (dirty) Truth: Data-Centric Insights Improve Pseudo-Labeling Nabeel Seedat, Nicolas Huynh, Fergus Imrie, Mihaela van der Schaar
PDF