Statistical Methods for Analyzing Speedup Learning Experiments

Abstract

Speedup learning systems are typically evaluated by comparing their impact on a problem solver's performance. The impact is measured by running the problem solver, before and after learning, on a sample of problems randomly drawn from some distribution. Often, the experimenter imposes a bound on the CPU time the problem solver is allowed to spend on any individual problem. Segre et al. (1991) argue that the experimenter's choice of time bound can bias the results of the experiment. To address this problem, we present statistical hypothesis tests specifically designed to analyze speedup data and eliminate this bias. We apply the tests to the data reported by Etzioni (1990a) and show that most (but not all) of the speedups observed are statistically significant.

Cite

Text

Etzioni and Etzioni. "Statistical Methods for Analyzing Speedup Learning Experiments." Machine Learning, 1994. doi:10.1023/A:1022617931401

Markdown

[Etzioni and Etzioni. "Statistical Methods for Analyzing Speedup Learning Experiments." Machine Learning, 1994.](https://mlanthology.org/mlj/1994/etzioni1994mlj-statistical/) doi:10.1023/A:1022617931401

BibTeX

@article{etzioni1994mlj-statistical,
  title     = {{Statistical Methods for Analyzing Speedup Learning Experiments}},
  author    = {Etzioni, Oren and Etzioni, Ruth},
  journal   = {Machine Learning},
  year      = {1994},
  pages     = {333-347},
  doi       = {10.1023/A:1022617931401},
  volume    = {14},
  url       = {https://mlanthology.org/mlj/1994/etzioni1994mlj-statistical/}
}