Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle

Komiyama, Junpei; Oiwa, Hidekazu; Nakagawa, Hiroshi

doi:10.1007/978-3-662-44851-9_1

Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle

Junpei Komiyama, Hidekazu Oiwa, Hiroshi Nakagawa

ECML-PKDD 2014 pp. 1-17

doi:10.1007/978-3-662-44851-9_1 /ecmlpkdd/2014/komiyama2014ecmlpkdd-robust/

Abstract

We study a distributed training of a linear classifier in which the data is separated into many shards and each worker only has access to its own shard. The goal of this distributed training is to utilize the data of all shards to obtain a well-performing linear classifier. The iterative parameter mixture (IPM) framework (Mann et al., 2009) is a state-of-the-art distributed learning framework that has a strong theoretical guarantee when the data is clean. However, contamination on shards, which sometimes arises in real world environments, largely deteriorates the performances of the distributed training. To remedy the negative effect of the contamination, we propose a divergence minimization principle for the weight determination in IPM. From this principle, we can naturally derive the Beta-IPM scheme, which leverages the power of robust estimation based on the beta divergence. A mistake/loss bound analysis indicates the advantage of our Beta-IPM in contaminated environments. Experiments with various datasets revealed that, even when 80% of the shards are contaminated, Beta-IPM can suppress the influence of the contamination.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Komiyama et al. "Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014. doi:10.1007/978-3-662-44851-9_1

Markdown

[Komiyama et al. "Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014.](https://mlanthology.org/ecmlpkdd/2014/komiyama2014ecmlpkdd-robust/) doi:10.1007/978-3-662-44851-9_1

BibTeX

@inproceedings{komiyama2014ecmlpkdd-robust,
  title     = {{Robust Distributed Training of Linear Classifiers Based on Divergence Minimization Principle}},
  author    = {Komiyama, Junpei and Oiwa, Hidekazu and Nakagawa, Hiroshi},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2014},
  pages     = {1-17},
  doi       = {10.1007/978-3-662-44851-9_1},
  url       = {https://mlanthology.org/ecmlpkdd/2014/komiyama2014ecmlpkdd-robust/}
}