Robot learning by demonstration
From Scholarpedia
| This article has not been peer-reviewed or accepted for publication yet; It may be unfinished, contain inaccuracies, or unapproved changes. | ||||||||||||||||||||
Author: Dr. Aude Billard, EPFL Ecole Polytechnique Federale de Lausanne
Dr. Aude Billard accepted the invitation on 14 July 2007 (self-imposed deadline: 14 January 2008).
This article will briefly cover: Research on robot programming by demonstration from the standpoint of machine learning and computational neuroscience.
WORK IN PROGRESS!!
Contents |
Introduction
Robot Programming by demonstration (PbD) develops algorithms by which a human can teach a robot a task without explicitly programming the robot, but rather by simply "showing" to the robot how to do the task. The robot learns from observing the human.
This way of teaching resembles the way human learn. The method is thus deemed to provide a powerful way for lay people (workers in factories or end-users) to teach robots. PbD requires usually less than ten demonstrations of the same task to make the training sessions bearable for the user.
Background and History
Robot PbD started in the 80's. The rationale for moving from purely pre-programmed robots to very flexible user-based interfaces for training robots to perform a task is three-fold.
First and foremost, PbD, also referred to as imitation learning, is a powerful mechanism for reducing the complexity of search spaces for learning. When observing either good or bad examples, one can reduce the search for a possible solution, by either starting the search from the observed good solution (local optima), or conversely,by eliminating from the search space what is known as a bad solution. Imitation learning is, thus, a powerful tool for enhancing and accelerating learning in both animals and artifacts.
Second, imitation learning offers an implicit means of training a machine, such that explicit and tedious programming of a task by a human user can be minimized or eliminated. Imitation learning is thus a "natural" means of interacting with a machine that would be accessible to lay people.
Third, studying and modeling the coupling of perception and action, which is at the core of imitation learning, helps us to understand the mechanisms by which the self-organization of perception and action could arise during development. The reciprocal interaction of perception and action could explain how competence in motor control can be grounded in rich structure of perceptual variables, and vice versa, how the processes of perception can develop as means to create successful actions.
PbD promises were thus multiple. On the one hand, one hoped that it would make learning faster, in contrast to tedious reinforcement learning methods or trials-and-error learning. On the other hand, one expected that the methods, being user-friendly, would enhance the application of robots in human daily environments.
Robot Programming by demonstration (PbD) has by now become a central topic of robotics that spans across general research areas such as human-robot interaction, machine learning, machine vision and motor control. For reviews of current advances in the field, see [References].
Interfaces
Means of teaching the robot include Several interfaces can be used to teach the robot:
- Kinesthetic Teaching consists in guiding the robot through the task's steps by moving passively its limbs
- Motion Sensors worn by the trainer allows the robot to track the motion of each of the user's limbs. The method
using motion sensors and having the robot mirroring the motion on the fly.
Vocal commands can in some cases be used to help direct the robot's attention to some aspect of the task [Dillmann].
Teaching can be either done in batch, i.e. by showing all the demonstrations at once and then get the robot to replicate a "generalized" version of the demonstration or incrementally, by showing the robot
Dynamical Systems's Approach
To be done
Symbolic Approach
To be done
Scaffolding, Moulding and Incremental Teaching
To be done
References
- Reviews Papers
Links
Related Reviews
Related Laboratories
- item1
- item1
- item1
Source Codes
- Learning a dynamical system [Author:Mohammad Khansari]
- Encoding a N-Dim. Trajectory in a Gaussian Mixture Model and retrieving a Generalized version [Author:Sylvain Calinon]
| Invited by: | Dr. Jan Peters, Max-Planck Institute, Germany & University of Southern California, USC |
