Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification

Lu, Fred; Curtin, Ryan R.; Raff, Edward; Ferraro, Francis; Holt, James

doi:10.1007/978-3-032-06096-9_9

Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification

Fred Lu, Ryan R. Curtin, Edward Raff, Francis Ferraro, James Holt

ECML-PKDD 2025 pp. 147-163

doi:10.1007/978-3-032-06096-9_9 /ecmlpkdd/2025/lu2025ecmlpkdd-optimizing/

Abstract

While distributed training is often viewed as a solution to optimizing linear models on increasingly large datasets, inter-machine communication costs of popular distributed approaches can dominate as data dimensionality increases. Recent work on non-interactive algorithms shows that approximate solutions for linear models can be obtained efficiently with only a single round of communication among machines. However, this approximation often degenerates as the number of machines increases. In this paper, building on the recent optimal weighted average method, we introduce a new technique, ACOWA , that allows an extra round of communication to achieve noticeably better approximation quality with minor runtime increases. Results show that for sparse distributed logistic regression, ACOWA obtains solutions that are more faithful to the empirical risk minimizer and attain substantially higher accuracy than other distributed algorithms. We also introduce isoefficiency analysis to distributed logistic regression and show that ACOWA maintains favorable scaling with respect to data size and processor count relative to prior distributed algorithms.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Lu et al. "Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06096-9_9

Markdown

[Lu et al. "Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/lu2025ecmlpkdd-optimizing/) doi:10.1007/978-3-032-06096-9_9

BibTeX

@inproceedings{lu2025ecmlpkdd-optimizing,
  title     = {{Optimizing the Optimal Weighted Average: Efficient Distributed Sparse Classification}},
  author    = {Lu, Fred and Curtin, Ryan R. and Raff, Edward and Ferraro, Francis and Holt, James},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2025},
  pages     = {147-163},
  doi       = {10.1007/978-3-032-06096-9_9},
  url       = {https://mlanthology.org/ecmlpkdd/2025/lu2025ecmlpkdd-optimizing/}
}