Model-Based Multi-Objective Reinforcement Learning

Withagen, M.L.C. (2014) Model-Based Multi-Objective Reinforcement Learning. Bachelor's Thesis, Artificial Intelligence.

Preview

Text
AI_BA_2014_Withagen.pdf - Published Version
Download (273kB) | Preview

Text
akkoord_WithagenMLC.pdf - Other
Restricted to Registered users only
Download (99kB)

Abstract

This thesis describes a novel multi-objective reinforcement learning algorithm. The proposed algorithm first learns a model of the multi-objective sequential decision making problem, after which this learned model is used by a multi-objective dynamic programming method to compute Pareto optimal policies. The advantage of this model-based multi-objective reinforcement learning method is that once an accurate model has been estimated from the experiences of an agent in some environment, the dynamic programming method will compute all Pareto optimal policies. Therefore it is important that the agent explores the environment in an intelligent way by using a good exploration strategy. In this paper we have supplied the agent with two different exploration strategies and compare their effectiveness in estimating accurate models within a reasonable amount of time. The experimental results show that our method with the best exploration strategy is able to quickly learn all Pareto optimal policies for the Deep Sea Treasure problem.

Item Type:	Thesis (Bachelor's Thesis)
Supervisor name:	Wiering, M. and Drugan, M.M.
Degree programme:	Artificial Intelligence
Thesis type:	Bachelor's Thesis
Language:	English
Date Deposited:	15 Feb 2018 07:58
Last Modified:	02 May 2019 11:22
URI:	https://fse.studenttheses.ub.rug.nl/id/eprint/12035

Actions (login required)

View Item