Reinforcement learning in the Minecraft gaming environment

Reynard, Matthew

Reinforcement learning in the Minecraft gaming environment

dc.contributor.advisor	Engelbrecht, H. A.	en_ZA
dc.contributor.advisor	Kamper, M. J.	en_ZA
dc.contributor.advisor	Rosman, Benjamin	en_ZA
dc.contributor.author	Reynard, Matthew	en_ZA
dc.contributor.other	Stellenbosch University. Faculty of Engineering. Dept. of Electrical and Electronic Engineering.	en_ZA
dc.date.accessioned	2020-02-25T12:42:24Z	en_ZA
dc.date.accessioned	2020-04-28T12:07:00Z	en_ZA
dc.date.available	2020-02-25T12:42:24Z	en_ZA
dc.date.available	2020-04-28T12:07:00Z	en_ZA
dc.date.issued	2020-03	en_ZA
dc.description	Thesis (MEng)--Stellenbosch University, 2020.	en_ZA
dc.description.abstract	ENGLISH ABSTRACT: With the long-term goal of surviving the night in Minecraft, we ask whether a reinforcement learning agent learns better by first learning the skills to perform smaller tasks in a complex environment or by learning the skills in the complex environment from the start. This is investigated empirically in a non-trivial game environment. We use the premise of curriculum learning where an agent learns different skills in independent and isolated sub-environments referred to as dojos. The skills learned in the dojos are then used as different actions as the agent decides which skill to perform that best applies to the current game state. We evaluate this with experiments conducted in the Minecraft gaming environment. We find that our approach of Dojo learning is able to achieve better performance with faster training time in certain environments. The main benefit of this approach is that the reward functions can be finely tuned in the dojos for each action as compared to the traditional methods. However, the skills learned in the individual dojos become the limiting factor in performance as the agent is unable to combine these skills effectively when put in certain complex environments. This can be mitigated if the dojo modules are further trained to achieve similar results as a standard deep Q network.	en_ZA
dc.description.abstract	AFRIKAANSE OPSOMMING: Met die langtermyndoel om ’n nag in Minecraft te oorleef, vra ons of versterkingsleer beter leer deur eers die vaardighede aan te leer om kleiner take in ’n komplekse omgewing uit te voer of deur die vaardighede in die komplekse omgewing aan te leer. Dit word in ’n uitdagende spelomgewing ondersoek. Ons gebruik kurrikulumleer waar ’n agent verskillende vaardighede aanleer in onafhanklike en geïsoleerde sub-omgewings waarna as dojos verwys word. Die vaardighede wat in die dojos aangeleer word, word dan as verskillende aksies gebruik aangesien die agent besluit watter vaardighede hy moet uitvoer wat die beste van toepassing is op die huidige speltoestand. Ons evalueer dit eksperimenteel in die Minecraft-spelomgewing. Ons vind dat ons benadering van Dojo-leer beter vaar met ’n vinniger opleidingstyd in sekere omgewings. Die belangrikste voordeel van hierdie benadering is dat die beloningsfunksies in die dojo’s vir elke aksie fyn ingestel kan word in vergelyking met die tradisionele metodes. Die vaardighede wat in die individuele dojos aangeleer word, word egter die beperkende faktor aangesien die agent nie in staat is om hierdie vaardighede effektief te kombineer as dit in sekere komplekse omgewings geplaas word nie. Dit kan versag word as die dojo-modules verder afgerig word om soortgelyke resultate te lewer as ’n standaard diep Q-netwerk .	af_ZA
dc.description.version	Masters	en_ZA
dc.format.extent	ix, 65 leaves : illustrations (some color)	en_ZA
dc.identifier.uri	http://hdl.handle.net/10019.1/107868	en_ZA
dc.language.iso	en_ZA	en_ZA
dc.publisher	Stellenbosch : Stellenbosch University	en_ZA
dc.rights.holder	Stellenbosch University	en_ZA
dc.subject	Reinforcement learning -- Mathematical models	en_ZA
dc.subject	Machine learning	en_ZA
dc.subject	Minecraft (Game)	en_ZA
dc.subject	Dojos	en_ZA
dc.subject	UCTD	en_ZA
dc.title	Reinforcement learning in the Minecraft gaming environment	en_ZA
dc.type	Thesis	en_ZA

Files

Original bundle

Now showing 1 - 1 of 1

Name:: reynard_reinforcement_2020.pdf
Size:: 6.07 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Plain Text
Description:

Download

Collections

Masters Degrees (Electrical and Electronic Engineering)