
Daniel Willemsen
Delft University of Technology, Centrum Wiskunde & Informatica
1
presentations
1
number of views
SHORT BIO
Daniel Willemsen is currently a master student at the Delft University of Technology. The research presented here was performed at Centrum Wiskunde & Informatica, where he worked on the interaction between planning and reinforcement learning, such as done by the AlphaZero line of algorithms.
Presentations

Value targets in off-policy AlphaZero: a new greedy backup
Daniel Willemsen and 2 other authors