An Optimal Structured Zeroth-Order Algorithm for Non-Smooth Optimization
Abstract
Finite-difference methods are a class of algorithms designed to solve black-box optimization problems by approximating a gradient of the target function on a set of directions. In black-box optimization, the non-smooth setting is particularly relevant since, in practice, differentiability and smoothness assumptions cannot be verified. To cope with nonsmoothness, several authors use a smooth approximation of the target function and show that finite difference methods approximate its gradient. Recently, it has been proved that imposing a structure in the directions allows improving performance. However, only the smooth setting was considered. To close this gap, we introduce and analyze O-ZD, the first structured finite-difference algorithm for non-smooth black-box optimization. Our method exploits a smooth approximation of the target function and we prove that it approximates its gradient on a subset of random {\em orthogonal} directions. We analyze the convergence of O-ZD under different assumptions. For non-smooth convex functions, we obtain the optimal complexity. In the non-smooth non-convex setting, we characterize the number of iterations needed to bound the expected norm of the smoothed gradient. For smooth functions, our analysis recovers existing results for structured zeroth-order methods for the convex case and extends them to the non-convex setting. We conclude with numerical simulations where assumptions are satisfied, observing that our algorithm has very good practical performances.
Cite
Text
Rando et al. "An Optimal Structured Zeroth-Order Algorithm for Non-Smooth Optimization." Neural Information Processing Systems, 2023.Markdown
[Rando et al. "An Optimal Structured Zeroth-Order Algorithm for Non-Smooth Optimization." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/rando2023neurips-optimal/)BibTeX
@inproceedings{rando2023neurips-optimal,
title = {{An Optimal Structured Zeroth-Order Algorithm for Non-Smooth Optimization}},
author = {Rando, Marco and Molinari, Cesare and Rosasco, Lorenzo and Villa, Silvia},
booktitle = {Neural Information Processing Systems},
year = {2023},
url = {https://mlanthology.org/neurips/2023/rando2023neurips-optimal/}
}