Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle
Abstract
We study a distributed training of a linear classifier in which the data is separated into many shards and each worker only has access to its own shard. The goal of this distributed training is to utilize the data of all shards to obtain a well-performing linear classifier. The iterative parameter mixture (IPM) framework (Mann et al., 2009) is a state-of-the-art distributed learning framework that has a strong theoretical guarantee when the data is clean. However, contamination on shards, which sometimes arises in real world environments, largely deteriorates the performances of the distributed training. To remedy the negative effect of the contamination, we propose a divergence minimization principle for the weight determination in IPM. From this principle, we can naturally derive the Beta-IPM scheme, which leverages the power of robust estimation based on the beta divergence. A mistake/loss bound analysis indicates the advantage of our Beta-IPM in contaminated environments. Experiments with various datasets revealed that, even when 80% of the shards are contaminated, Beta-IPM can suppress the influence of the contamination.
Cite
Text
Komiyama et al. "Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014. doi:10.1007/978-3-662-44851-9_1Markdown
[Komiyama et al. "Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014.](https://mlanthology.org/ecmlpkdd/2014/komiyama2014ecmlpkdd-robust/) doi:10.1007/978-3-662-44851-9_1BibTeX
@inproceedings{komiyama2014ecmlpkdd-robust,
title = {{Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle}},
author = {Komiyama, Junpei and Oiwa, Hidekazu and Nakagawa, Hiroshi},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2014},
pages = {1-17},
doi = {10.1007/978-3-662-44851-9_1},
url = {https://mlanthology.org/ecmlpkdd/2014/komiyama2014ecmlpkdd-robust/}
}