Fast, Robust Adaptive Control by Learning Only Forward Models

Abstract

A large class of motor control tasks requires that on each cycle the con(cid:173) troller is told its current state and must choose an action to achieve a specified, state-dependent, goal behaviour. This paper argues that the optimization of learning rate, the number of experimental control deci(cid:173) sions before adequate performance is obtained, and robustness is of prime importance-if necessary at the expense of computation per control cy(cid:173) cle and memory requirement. This is motivated by the observation that a robot which requires two thousand learning steps to achieve adequate performance, or a robot which occasionally gets stuck while learning, will always be undesirable, whereas moderate computational expense can be accommodated by increasingly powerful computer hardware. It is not un(cid:173) reasonable to assume the existence of inexpensive 100 Mflop controllers within a few years and so even processes with control cycles in the low tens of milliseconds will have millions of machine instructions in which to make their decisions. This paper outlines a learning control scheme which aims to make effective use of such computational power.

Cite

Text

Moore. "Fast, Robust Adaptive Control by Learning Only Forward Models." Neural Information Processing Systems, 1991.

Markdown

[Moore. "Fast, Robust Adaptive Control by Learning Only Forward Models." Neural Information Processing Systems, 1991.](https://mlanthology.org/neurips/1991/moore1991neurips-fast/)

BibTeX

@inproceedings{moore1991neurips-fast,
  title     = {{Fast, Robust Adaptive Control by Learning Only Forward Models}},
  author    = {Moore, Andrew W.},
  booktitle = {Neural Information Processing Systems},
  year      = {1991},
  pages     = {571-578},
  url       = {https://mlanthology.org/neurips/1991/moore1991neurips-fast/}
}