
Raghuram Bharadwaj Diddigi
Indian Institute of Science, Bengalore, India
reinforcement learning
off-policy prediction
stochastic approximation
1
presentations
SHORT BIO
Raghuram Bharadwaj is a Ph.D. student in the Department of Computer Science and Automation, Indian Institute of Science, India, under the guidance of Prof. Shalabh Bhatnagar.
Presentations

A Convergent Off-Policy Temporal Difference Algorithm
Raghuram Bharadwaj Diddigi