Publications

Entropic Regularization of Markov Decision Processes
f-Divergence constrained policy improvement
Catching heuristics are optimal control policies
Best poster award at CMCW 2017, 2nd place