Distributed Data Mining: Why Do More than Aggregating Models

Abstract

In this paper we deal with the problem of mining large distributed databases. We show that the aggregation of models, i.e., sets of disjoint classification rules, each built over a subdatabase is quite enough to get an aggregated model that is both predictive and descriptive, that presents excellent prediction capability and that is conceptually much simpler than the comparable techniques. These results are made possible by lifting the disjoint cover constraint on the aggregated model and by the use of a confidence coefficient associated with each rule in a weighted majority vote.

Cite

Text

Aounallah and Mineau. "Distributed Data Mining: Why Do More than Aggregating Models." International Joint Conference on Artificial Intelligence, 2007.

Markdown

[Aounallah and Mineau. "Distributed Data Mining: Why Do More than Aggregating Models." International Joint Conference on Artificial Intelligence, 2007.](https://mlanthology.org/ijcai/2007/aounallah2007ijcai-distributed/)

BibTeX

@inproceedings{aounallah2007ijcai-distributed,
  title     = {{Distributed Data Mining: Why Do More than Aggregating Models}},
  author    = {Aounallah, Mohamed and Mineau, Guy W.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2007},
  pages     = {2645-2650},
  url       = {https://mlanthology.org/ijcai/2007/aounallah2007ijcai-distributed/}
}