Bundle CDN: A Highly Parallelized Approach for Large-Scale ℓ1-Regularized Logistic Regression
Abstract
Parallel coordinate descent algorithms emerge with the growing demand of large-scale optimization. In general, previous algorithms are usually limited by their divergence under high degree of parallelism (DOP), or need data pre-process to avoid divergence. To better exploit parallelism, we propose a coordinate descent based parallel algorithm without needing of data pre-process, termed as Bundle Coordinate Descent Newton (BCDN), and apply it to large-scale ℓ_1-regularized logistic regression. BCDN first randomly partitions the feature set into Q non-overlapping subsets/bundles in a Gauss-Seidel manner, where each bundle contains P features. For each bundle, it finds the descent directions for the P features in parallel, and performs P -dimensional Armijo line search to obtain the stepsize. By theoretical analysis on global convergence, we show that BCDN is guaranteed to converge with a high DOP. Experimental evaluations over five public datasets show that BCDN can better exploit parallelism and outperforms state-of-the-art algorithms in speed, without losing testing accuracy.
Cite
Text
Bian et al. "Bundle CDN: A Highly Parallelized Approach for Large-Scale ℓ1-Regularized Logistic Regression." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2013. doi:10.1007/978-3-642-40994-3_6Markdown
[Bian et al. "Bundle CDN: A Highly Parallelized Approach for Large-Scale ℓ1-Regularized Logistic Regression." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2013.](https://mlanthology.org/ecmlpkdd/2013/bian2013ecmlpkdd-bundle/) doi:10.1007/978-3-642-40994-3_6BibTeX
@inproceedings{bian2013ecmlpkdd-bundle,
title = {{Bundle CDN: A Highly Parallelized Approach for Large-Scale ℓ1-Regularized Logistic Regression}},
author = {Bian, Yatao and Li, Xiong and Cao, Mingqi and Liu, Yuncai},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2013},
pages = {81-95},
doi = {10.1007/978-3-642-40994-3_6},
url = {https://mlanthology.org/ecmlpkdd/2013/bian2013ecmlpkdd-bundle/}
}