FAIR Universe HiggsML Uncertainty Dataset and Competition
Abstract
The FAIR Universe – HiggsML Uncertainty Challenge focused on measuring the physical properties of elementary particles with imperfect simulators. Participants were required to compute and report confidence intervals for a parameter of interest regarding the Higgs boson while accounting for various systematic (epistemic) uncertainties. The dataset is a tabular dataset of 28 features and 280 million instances. Each instance represents a simulated proton-proton collision as observed at CERN’s Large Hadron Collider in Geneva, Switzerland. The features of these simulations were chosen to capture key characteristics of different types of particles. These include primary attributes, such as the energy and three-dimensional momentum of the particles, as well as derived attributes, which are calculated from the primary ones using domain-specific knowledge. Additionally, a label feature designates each instance’s type of proton-proton collision, distinguishing the Higgs boson events of interest from three background sources. As outlined in this paper, the permanent dataset release allows long-term benchmarking of new techniques. The leading submissions, including Contrastive Normalising Flows and Density Ratios estimation through classification, are described. Our challenge has brought together the physics and machine learning communities to advance our understanding and methodologies in handling systematic uncertainties within AI techniques.
Cite
Text
Bhimji et al. "FAIR Universe HiggsML Uncertainty Dataset and Competition." Advances in Neural Information Processing Systems, 2025.Markdown
[Bhimji et al. "FAIR Universe HiggsML Uncertainty Dataset and Competition." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/bhimji2025neurips-fair/)BibTeX
@inproceedings{bhimji2025neurips-fair,
title = {{FAIR Universe HiggsML Uncertainty Dataset and Competition}},
author = {Bhimji, Wahid and Chakkappai, Ragansu and Chang, Po-Wen and Chou, Yuan-Tang and Diefenbacher, Sascha and Dudley, Jordan and Elsharkawy, Ibrahim and Farrell, Steven and Ghosh, Aishik and Giordano, Cristina and Guyon, Isabelle and Harris, Chris and Hashizume, Yota and Hsu, Shih-Chieh and Khoda, Elham E and Krause, Claudius and Li, Ang and Nachman, Benjamin and Rousseau, David and Schöfbeck, Robert and Shooshtari, Maryam and Schwarz, Dennis and Ullah, Ihsan and Wang, Daohan and Zhang, Yulei},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/bhimji2025neurips-fair/}
}