Socially guided intrinsic motivation for robot learning of motor skills

This paper presents a technical approach to robot learning of motor skills which combines active intrinsically motivated learning with imitation learning. Our algorithmic architecture, called SGIM-D , allows efficient learning of high-dimensional continuous sensorimotor inverse models in robots, and...

Full description

Saved in:
Bibliographic Details
Published in:Autonomous robots Vol. 36; no. 3; pp. 273 - 294
Main Authors: Nguyen, Sao Mai, Oudeyer, Pierre-Yves
Format: Journal Article
Language:English
Published: Boston Springer US 01.03.2014
Springer Nature B.V
Springer Verlag
Subjects:
ISSN:0929-5593, 1573-7527
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents a technical approach to robot learning of motor skills which combines active intrinsically motivated learning with imitation learning. Our algorithmic architecture, called SGIM-D , allows efficient learning of high-dimensional continuous sensorimotor inverse models in robots, and in particular learns distributions of parameterised motor policies that solve a corresponding distribution of parameterised goals/tasks. This is made possible by the technical integration of imitation learning techniques within an algorithm for learning inverse models that relies on active goal babbling. After reviewing social learning and intrinsic motivation approaches to action learning, we describe the general framework of our algorithm, before detailing its architecture. In an experiment where a robot arm has to learn to use a flexible fishing line, we illustrate that SGIM-D efficiently combines the advantages of social learning and intrinsic motivation and benefits from human demonstration properties to learn how to produce varied outcomes in the environment, while developing more precise control policies in large spaces.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Article-2
ObjectType-Feature-1
content type line 23
ISSN:0929-5593
1573-7527
DOI:10.1007/s10514-013-9339-y