Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification

Abstract

While distributed training is often viewed as a solution to optimizing linear models on increasingly large datasets, inter-machine communication costs of popular distributed approaches can dominate as data dimensionality increases. Recent work on non-interactive algorithms shows that approximate solutions for linear models can be obtained efficiently with only a single round of communication among machines. However, this approximation often degenerates as the number of machines increases. In this paper, building on the recent optimal weighted average method, we introduce a new technique, ACOWA , that allows an extra round of communication to achieve noticeably better approximation quality with minor runtime increases. Results show that for sparse distributed logistic regression, ACOWA obtains solutions that are more faithful to the empirical risk minimizer and attain substantially higher accuracy than other distributed algorithms. We also introduce isoefficiency analysis to distributed logistic regression and show that ACOWA maintains favorable scaling with respect to data size and processor count relative to prior distributed algorithms.

Cite

Text

Lu et al. "Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06096-9_9

Markdown

[Lu et al. "Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/lu2025ecmlpkdd-optimizing/) doi:10.1007/978-3-032-06096-9_9

BibTeX

@inproceedings{lu2025ecmlpkdd-optimizing,
  title     = {{Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification}},
  author    = {Lu, Fred and Curtin, Ryan R. and Raff, Edward and Ferraro, Francis and Holt, James},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2025},
  pages     = {147-163},
  doi       = {10.1007/978-3-032-06096-9_9},
  url       = {https://mlanthology.org/ecmlpkdd/2025/lu2025ecmlpkdd-optimizing/}
}