Neural-Fitted Temporal Difference Learning to Learn to Play Connect Four

Haan, H. de (2013) Neural-Fitted Temporal Difference Learning to Learn to Play Connect Four. Bachelor's Thesis, Artificial Intelligence.

Preview

Text
AI_BA_2013_HEINDEHAAN.pdf - Published Version
Download (303kB) | Preview

Text
HaanDeAkkoordWiering.pdf - Other
Restricted to Registered users only
Download (25kB)

Abstract

In this thesis, neural-fitted temporal difference learning, a form of reinforcement learning, is used to learn to play the game of Connect Four. Seven different artificial players are compared, using five different neural networks. While the first network only uses the basic board state as input, the larger ones also use specific features: rows of two, three and four in the different rows of the board state. It is shown that these features dramatically improve the performance of the agent. Furthermore, two different exploration strategies are used: Boltzmann with constant temperature and epsilon-greedy. The results show that epsilon-greedy gives the most stable result. Finally, the smallest network was given the same number of hidden nodes as the largest network, showing that adding hidden nodes does not improve the score of the system.

Item Type:	Thesis (Bachelor's Thesis)
Supervisor name:	Wiering, M.A.
Degree programme:	Artificial Intelligence
Thesis type:	Bachelor's Thesis
Language:	English
Date Deposited:	15 Feb 2018 07:54
Last Modified:	02 May 2019 11:24
URI:	https://fse.studenttheses.ub.rug.nl/id/eprint/11267

Actions (login required)

View Item