Fast, Robust Adaptive Control by Learning Only Forward Models
Abstract
A large class of motor control tasks requires that on each cycle the con(cid:173) troller is told its current state and must choose an action to achieve a specified, state-dependent, goal behaviour. This paper argues that the optimization of learning rate, the number of experimental control deci(cid:173) sions before adequate performance is obtained, and robustness is of prime importance-if necessary at the expense of computation per control cy(cid:173) cle and memory requirement. This is motivated by the observation that a robot which requires two thousand learning steps to achieve adequate performance, or a robot which occasionally gets stuck while learning, will always be undesirable, whereas moderate computational expense can be accommodated by increasingly powerful computer hardware. It is not un(cid:173) reasonable to assume the existence of inexpensive 100 Mflop controllers within a few years and so even processes with control cycles in the low tens of milliseconds will have millions of machine instructions in which to make their decisions. This paper outlines a learning control scheme which aims to make effective use of such computational power.
Cite
Text
Moore. "Fast, Robust Adaptive Control by Learning Only Forward Models." Neural Information Processing Systems, 1991.Markdown
[Moore. "Fast, Robust Adaptive Control by Learning Only Forward Models." Neural Information Processing Systems, 1991.](https://mlanthology.org/neurips/1991/moore1991neurips-fast/)BibTeX
@inproceedings{moore1991neurips-fast,
title = {{Fast, Robust Adaptive Control by Learning Only Forward Models}},
author = {Moore, Andrew W.},
booktitle = {Neural Information Processing Systems},
year = {1991},
pages = {571-578},
url = {https://mlanthology.org/neurips/1991/moore1991neurips-fast/}
}