Vidharth, Hari (2021) Accelerating Multi-Goal Reinforcement Learning by Learning from Demonstrations for Robotic Manipulation Tasks. Master's Thesis / Essay, Artificial Intelligence.
|
Text
mAI_2021_HariVidharth.pdf Download (6MB) | Preview |
|
Text
toestemming.pdf Restricted to Registered users only Download (97kB) |
Abstract
Reinforcement Learning in complex goal-based robotics environments which involves interaction, manipulation and sparse rewards has always remained a challenge. Exploration is a large part of reinforcement learning which uses a big chunk of the training time. Random exploration cannot solve complex manipulation tasks and a well-engineered reward function can result in sub-optimal performance. This work investigates the combination of behaviour cloning and reinforcement learning to accelerate the training of an autonomous agent without compromising on performance and generalization in the Fetch Robotics Environments. Behaviour cloning is the simplest form of imitation learning where the agent uses expert demonstrations to learn actions for performing tasks. Humans often use this form of imitation by observing others better at a given task and ap- plying similar methods to achieve the same goal. The same principle is extended and combined with the reinforcement learning agent. This paper proposes a form of behaviour cloning to over- come this exploration phase and perform more meaningful actions early on, along with a simple yet informative reward scheme to accelerate learning. This proposed method achieves consider- able speedup compared to previous baselines using a lesser number of demonstrations containing success, failure and noisy examples. The agent is not restricted by these demonstrations. It can outperform the demonstrator while developing new behaviours.
Item Type: | Thesis (Master's Thesis / Essay) |
---|---|
Supervisor name: | Schomaker, L.R.B. and Mohades Kasaei, S.H. |
Degree programme: | Artificial Intelligence |
Thesis type: | Master's Thesis / Essay |
Language: | English |
Date Deposited: | 30 Nov 2021 10:00 |
Last Modified: | 30 Nov 2021 10:00 |
URI: | https://fse.studenttheses.ub.rug.nl/id/eprint/26326 |
Actions (login required)
View Item |