VIDEO DOI: https://doi.org/10.48448/3n3h-5q89
PAPER DOI: Reinforcement learning; Sample-based planning; AlphaZero; MCTS

technical paper

AAMAS 2020

May 09, 2020

Live on Underline

Value targets in off-policy AlphaZero: a new greedy backup

Please log in to leave a comment

Downloads

SlidesTranscript English (automatic)

Next from AAMAS 2020

Trajectory Modelling in Shared Spaces: Expert-Based vs. Deep Learning Approach?
technical paper

Trajectory Modelling in Shared Spaces: Expert-Based vs. Deep Learning Approach?

AAMAS 2020

+1Fatema Tuj JohoraHao Cheng
Hao Cheng and 3 other authors

09 May 2020

Similar lecture

ContraFeat: Contrasting Deep Features for Semantic Discovery
poster

ContraFeat: Contrasting Deep Features for Semantic Discovery

AAAI 2023

Xinqi Zhu and 2 other authors

10 February 2023

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved