Barbieri, Davide (2019) Baum-Welch Algorithm for Bottom Up Option Learning by Imitation. Bachelor's Thesis, Artificial Intelligence.
|
Text
AI_BA_2019_DAVIDEBARBIERI.pdf Download (757kB) | Preview |
|
Text
Toestemming.pdf Restricted to Registered users only Download (139kB) |
Abstract
The Baum-Welch algorithm (Jelinek, Bahl, and Mercer, 1975) was developed to fit a hidden Markov model that maximizes the likelihood of a given sequence of observations (with a given number of hidden states). Fox, Krishnan, Stoica, and Goldberg (2017) proposed to look at option learning by imitation as solving a hidden Markov model, where the observations are primitives in a given solution trajectory, and the options are hidden probability distributions over primitives generating such trajectory. This paper adapts the Baum-Welch algorithm to fit the above idea by accounting for the differences between an option hierarchy and a hidden Markov model. The results seem to suggest that the architecture is capable of learning by imitation a series of static options. It is also able to exploit them to solve previously unseen tasks, similar to the ones solved by the imitation data.
Item Type: | Thesis (Bachelor's Thesis) |
---|---|
Supervisor name: | Wiering, M.A. |
Degree programme: | Artificial Intelligence |
Thesis type: | Bachelor's Thesis |
Language: | English |
Date Deposited: | 28 Aug 2019 |
Last Modified: | 11 Sep 2019 06:52 |
URI: | https://fse.studenttheses.ub.rug.nl/id/eprint/20815 |
Actions (login required)
View Item |