Ibrahim, Tarek (2022)
Diplomityö
Reinforcement learning policies often need to be trained in simulations of the real environments, since training directly on the real agents can either be not feasible or expensive. When transferring those trained policies ...