Effectiveness of Connectionist Q-learning Strategies on Agent Performance in Asteroids

Mallon, Sjors and Meima, Niels (2018) Effectiveness of Connectionist Q-learning Strategies on Agent Performance in Asteroids. Bachelor's Thesis, Artificial Intelligence.

Preview

Text
AI_BA_2018_SJORSMALLON.pdf
Download (604kB) | Preview

Text
toestemming.pdf
Restricted to Registered users only
Download (98kB)

Abstract

This research proposes a higher-order state extraction algorithm serving as input for neural networks to learn to play the Atari game Asteroids. Asteroids is a 1980's space shooter, and poses a challenging environment due to its continuous and stochastic nature. Learning capabilities of the reinforcement learning algorithms Q-learning, Q-learning combined with a target network, Double Q-learning, QV-learning and QVMAX-learning are compared at a constant difficulty level, both using online learning and experience replay. Q-learning combined with a target network achieved the highest win rate of 0.76, in both the online and experience replay setting. Furthermore, the influence of incremental learning on agent performance is compared to learning at a constant difficulty. Incremental learning did not show a significant improvement in performance. Finally, state modeling in combination with Monte Carlo rollouts is used to learn from predictions about the future. Results show that learning from predictions is ineffective in its current implementation. The agent effectively learns to play the game Asteroids using the higher-order state extraction algorithm in combination with the described reinforcement learning algorithms.

Item Type:	Thesis (Bachelor's Thesis)
Supervisor name:	Wiering, M.A.
Degree programme:	Artificial Intelligence
Thesis type:	Bachelor's Thesis
Language:	English
Date Deposited:	29 Jul 2018
Last Modified:	30 Jul 2018 13:56
URI:	https://fse.studenttheses.ub.rug.nl/id/eprint/18117

Actions (login required)

View Item