Speeding up Hyper-Parameter Optimization by Extrapolation of Learning Curves Using Previous Builds
Abstract
Recent work has shown that the usage of extrapolation of learning curves to determine when to terminate a training build has been shown to be effective in reducing the number of epochs of training required for finding a good performing hyper-parameter configuration. However, the current technique uses the information only from the current build to make the prediction. We propose the usage of a simple regression based extrapolation model that uses the trajectories from previous builds to make predictions of new builds. This can be used to terminate poorly performing builds and thus, speed up hyper-parameter search with performance comparable to non-augmented hyper-parameter optimization techniques. We compare the predictions made by our model against that of the existing extrapolation technique in different tasks. We incorporate our approach into a pre-existing termination criterion. We incorporate this termination criterion into an existing hyper-parameter optimization toolkit. We analyze the performance of our approach and contrast it against a baseline in terms of quality of prediction in three different tasks. We show that our approach yields builds with performance comparable to the non-augmented version with fewer epochs, and outperforms an existing parametric extrapolation technique for two out of three tasks in terms of number of required epochs.
Cite
Text
Chandrashekaran and Lane. "Speeding up Hyper-Parameter Optimization by Extrapolation of Learning Curves Using Previous Builds." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017. doi:10.1007/978-3-319-71249-9_29Markdown
[Chandrashekaran and Lane. "Speeding up Hyper-Parameter Optimization by Extrapolation of Learning Curves Using Previous Builds." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017.](https://mlanthology.org/ecmlpkdd/2017/chandrashekaran2017ecmlpkdd-speeding/) doi:10.1007/978-3-319-71249-9_29BibTeX
@inproceedings{chandrashekaran2017ecmlpkdd-speeding,
title = {{Speeding up Hyper-Parameter Optimization by Extrapolation of Learning Curves Using Previous Builds}},
author = {Chandrashekaran, Akshay and Lane, Ian R.},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2017},
pages = {477-492},
doi = {10.1007/978-3-319-71249-9_29},
url = {https://mlanthology.org/ecmlpkdd/2017/chandrashekaran2017ecmlpkdd-speeding/}
}