Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid

Endo, Gen; Morimoto, Jun; Matsubara, Takamitsu; Nakanishi, Jun; Cheng, Gordon

Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid

Gen Endo, Jun Morimoto, Takamitsu Matsubara, Jun Nakanishi, Gordon Cheng

AAAI 2005 pp. 1267-1273

/aaai/2005/endo2005aaai-learning/

Abstract

This paper describes a learning framework for a central pattern generator based biped locomotion controller using a policy gradient method. Our goals in this study are to achieve biped walking with a 3D hardware humanoid, and to develop an efficient learning algorithm with CPG by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feed-back controller can be acquired within a thousand trials by numerical simulations and the obtained controller in numerical simulation achieves stable walking with a physical robot in the real world. Numerical simulations and hardware experiments evaluated walking velocity and stability. Furthermore, we present the possibility of an additional online learning using a hardware robot to improve the controller within 200 iterations.

PDF AAAI Semantic Scholar

Cite

Text

Endo et al. "Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid." AAAI Conference on Artificial Intelligence, 2005.

Markdown

[Endo et al. "Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid." AAAI Conference on Artificial Intelligence, 2005.](https://mlanthology.org/aaai/2005/endo2005aaai-learning/)

BibTeX

@inproceedings{endo2005aaai-learning,
  title     = {{Learning CPG Sensory Feedback with Policy Gradient for Biped Locomotion for a Full-Body Humanoid}},
  author    = {Endo, Gen and Morimoto, Jun and Matsubara, Takamitsu and Nakanishi, Jun and Cheng, Gordon},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {1267-1273},
  url       = {https://mlanthology.org/aaai/2005/endo2005aaai-learning/}
}