FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries
Abstract
Simulating fluid flow around arbitrary shapes is key to solving various engineering problems. However, simulating flow physics across complex geometries remains numerically challenging and computationally resource-intensive, particularly when using conventional PDE solvers. Machine learning methods offer attractive opportunities to create fast and adaptable PDE solvers. However, benchmark datasets to measure the performance of such methods are scarce, especially for flow physics across complex geometries. We introduce FlowBench, a dataset for neural simulators with over 10K samples, which is currently larger than any publicly available flow physics dataset. FlowBench contains flow simulation data across complex geometries (parametric vs. non-parametric), spanning a range of flow conditions (Reynolds number and Grashoff number), capturing a diverse array of flow phenomena (steady vs. transient; forced vs. free convection), and for both 2D and 3D. FlowBench contains over 10K data samples, with each sample the outcome of a fully resolved, direct numerical simulation using a well-validated simulator framework designed for modeling transport phenomena in complex geometries. For each sample, we include velocity, pressure, and temperature field data at 3 different resolutions and several summary statistics features of engineering relevance (such as coefficients of lift and drag, and Nusselt numbers). We envision that FlowBench will enable evaluating the interplay between complex geometry, coupled flow phenomena, and data sufficiency on the performance of current, and future, neural PDE solvers. We enumerate several evaluation metrics to help rank order the performance of current (and future) neural PDE solvers. We benchmark the performance of several methods, including Fourier Neural Operators (FNO), Convolutional Neural Operators (CNO), DeepONets, and recent foundational models. This dataset (https://huggingface.co/datasets/BGLab/FlowBench/tree/main) will be a valuable resource for developing and evaluating AI-for-science approaches, specifically neural PDE solvers, that model complex fluid dynamics around 2D and 3D objects.
Cite
Text
Tali et al. "FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries." Data-centric Machine Learning Research, 2025.Markdown
[Tali et al. "FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries." Data-centric Machine Learning Research, 2025.](https://mlanthology.org/dmlr/2025/tali2025dmlr-flowbench/)BibTeX
@article{tali2025dmlr-flowbench,
title = {{FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries}},
author = {Tali, Ronak and Rabeh, Ali and Yang, Cheng-Hau and Shadkhah, Mehdi and Karki, Samundra and Upadhyaya, Abhisek and Dhakshinamoorthy, Suriya and Saadati, Marjan and Sarkar, Soumik and Krishnamurthy, Adarsh and Hegde, Chinmay and Balu, Aditya and Ganapathysubramanian, Baskar},
journal = {Data-centric Machine Learning Research},
year = {2025},
pages = {1-35},
volume = {2},
url = {https://mlanthology.org/dmlr/2025/tali2025dmlr-flowbench/}
}