Abdul Aziz, Amir (2024) Safe Deep Reinforcement Learning for Super Mario Bros using Heuristics. Bachelor's Thesis, Artificial Intelligence.
|
Text
bAI2024AbdulAzizA.pdf Download (759kB) | Preview |
|
Text
Abdul Aziz - ja.pdf Restricted to Registered users only Download (178kB) |
Abstract
This thesis explores the integration of heuristic safety mechanisms within Deep Reinforcement Learning (Deep RL) frameworks to enhance agent survival in the complex, dynamic environment of Super Mario Bros. We examine the influence of heuristic-augmented Actor-Critic models on agent behavior, particularly in terms of safety without compromising learning efficiency. The study compares various configurations, including separate and shared neural network architectures and different combinations of frame stacks for the input layer. It quantitatively measures survival rates, highlighting a significant improvement in avoiding fatal encounters with the most effective configuration around the 10000th episode mark. However, these safety mechanisms may also constrain exploration and long-term reward optimization. Despite achieving a stable survival pattern and a consistent minimum mean loss, a tension between safety and reward maximization is evident. This work contributes to the domain of Safe RL, emphasizing the nuanced trade-offs in stochastic gaming environments.
Item Type: | Thesis (Bachelor's Thesis) |
---|---|
Supervisor name: | Cardenas Cartagena, J. D. |
Degree programme: | Artificial Intelligence |
Thesis type: | Bachelor's Thesis |
Language: | English |
Date Deposited: | 03 May 2024 09:19 |
Last Modified: | 03 May 2024 11:06 |
URI: | https://fse.studenttheses.ub.rug.nl/id/eprint/32359 |
Actions (login required)
View Item |