Cooperative Q-learning based on learning automata

The theory of learning automata has already been applied in reinforcement learning which is characterized by single-agent and single-stage. This paper proposed a multi-robot cooperative Q-learning algorithm based on learning automata. Each robot updates probability for action selection through the l...

Full description

Saved in:
Bibliographic Details
Published in:2009 IEEE International Conference on Automation and Logistics pp. 1973 - 1978
Main Authors: Mao Yang, Yantao Tian, Xinyue Qi
Format: Conference Proceeding
Language:English
Published: IEEE 01.08.2009
Subjects:
ISBN:9781424447947, 1424447941
ISSN:2161-8151
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The theory of learning automata has already been applied in reinforcement learning which is characterized by single-agent and single-stage. This paper proposed a multi-robot cooperative Q-learning algorithm based on learning automata. Each robot updates probability for action selection through the learning automata constantly, and then converts the probability to special experience. Robots can accelerate the learning process by means of sharing experiences among each other. Simulation experiments verify the effectiveness of this algorithm.
ISBN:9781424447947
1424447941
ISSN:2161-8151
DOI:10.1109/ICAL.2009.5262629