Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization

Mielke, Arman; Bauknecht, Uwe; Strauss, Thilo; Niepert, Mathias

Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization

Arman Mielke, Uwe Bauknecht, Thilo Strauss, Mathias Niepert

TMLR 2026

/tmlr/2026/mielke2026tmlr-preferencebased/

Abstract

Combinatorial optimization (CO) problems arise across a broad spectrum of domains, including medicine, logistics, and manufacturing. While exact solutions are often computationally infeasible, many practical applications require high-quality solutions within a given time budget. To address this, we propose a learning-based approach that enhances existing non-learned heuristics for CO. Specifically, we parameterize these heuristics and train graph neural networks (GNNs) to predict parameter values that yield near-optimal solutions. Our method is trained end-to-end in a self-supervised fashion, using a novel gradient estimation scheme that treats the heuristic as a black box. This approach combines the strengths of learning and traditional algorithms: the GNN learns from data to guide the algorithm toward better solutions, while the heuristic ensures feasibility. We validate our method on two well-known combinatorial optimization problems: the travelling salesman problem (TSP) and the minimum k-cut problem. Our results demonstrate that the proposed approach is competitive with state-of-the-art learned CO solvers.

PDF TMLR OpenReview Code Semantic Scholar

Cite

Text

Mielke et al. "Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization." Transactions on Machine Learning Research, 2026.

Markdown

[Mielke et al. "Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/mielke2026tmlr-preferencebased/)

BibTeX

@article{mielke2026tmlr-preferencebased,
  title     = {{Preference-Based Gradient Estimation for ML-Guided Approximate Combinatorial Optimization}},
  author    = {Mielke, Arman and Bauknecht, Uwe and Strauss, Thilo and Niepert, Mathias},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/mielke2026tmlr-preferencebased/}
}