
Davide Maran
Graduate student @ Politecnico di Milano
adversarial
rl
reinforcement
learning
lipschitz
regret
configurable mdp
2
presentations
SHORT BIO
PhD student, current working on smoothness in RL, delayed RL and bandits.
Presentations

Online Markov Decision Processes Configuration with Continuous Decision Space
Davide Maran and 5 other authors

Tight Performance Guarantees of Imitator Policies with Continuous Actions
Davide Maran and 2 other authors