Haan, H. de (2013) Neural-Fitted Temporal Difference Learning to Learn to Play Connect Four. Bachelor's Thesis, Artificial Intelligence.
|
Text
AI_BA_2013_HEINDEHAAN.pdf - Published Version Download (303kB) | Preview |
|
Text
HaanDeAkkoordWiering.pdf - Other Restricted to Registered users only Download (25kB) |
Abstract
In this thesis, neural-fitted temporal difference learning, a form of reinforcement learning, is used to learn to play the game of Connect Four. Seven different artificial players are compared, using five different neural networks. While the first network only uses the basic board state as input, the larger ones also use specific features: rows of two, three and four in the different rows of the board state. It is shown that these features dramatically improve the performance of the agent. Furthermore, two different exploration strategies are used: Boltzmann with constant temperature and epsilon-greedy. The results show that epsilon-greedy gives the most stable result. Finally, the smallest network was given the same number of hidden nodes as the largest network, showing that adding hidden nodes does not improve the score of the system.
Item Type: | Thesis (Bachelor's Thesis) |
---|---|
Supervisor name: | Wiering, M.A. |
Degree programme: | Artificial Intelligence |
Thesis type: | Bachelor's Thesis |
Language: | English |
Date Deposited: | 15 Feb 2018 07:54 |
Last Modified: | 02 May 2019 11:24 |
URI: | https://fse.studenttheses.ub.rug.nl/id/eprint/11267 |
Actions (login required)
View Item |