Inductive Learning in Less than One Sequential Data Scan

Abstract

Most recent research of scalable inductive learning on very large dataset, decision tree construction in particular, focuses on eliminating memory constraints and reducing the number of sequential data scans. However, state-of-the-art decision tree construction algorithms still require multiple scans over the data set and use sophisticated control mechanisms and data structures. We first discuss a general inductive learning framework that scans the dataset exactly once. Then, we propose an extension based on Hoeffding's inequality that scans the dataset less than once. Our frameworks are applicable to a wide range of inductive learners. 1

Cite

Text

Fan et al. "Inductive Learning in Less than One Sequential Data Scan." International Joint Conference on Artificial Intelligence, 2003.

Markdown

[Fan et al. "Inductive Learning in Less than One Sequential Data Scan." International Joint Conference on Artificial Intelligence, 2003.](https://mlanthology.org/ijcai/2003/fan2003ijcai-inductive/)

BibTeX

@inproceedings{fan2003ijcai-inductive,
  title     = {{Inductive Learning in Less than One Sequential Data Scan}},
  author    = {Fan, Wei and Wang, Haixun and Yu, Philip S. and Lo, Shaw-Hwa},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2003},
  pages     = {595-600},
  url       = {https://mlanthology.org/ijcai/2003/fan2003ijcai-inductive/}
}