Roland Dubb is a MSc candidate in applied mathematics at Shocklab, at the University of Cape Town. For his MSc, he is researching the empirical performance evaluations of reinforcement learning algorithms, under the supervision of Assoc. Prof. Jonathan Shock.