VIDEO DOI: https://doi.org/10.48448/q9x9-mc69
PAPER DOI: Reinforcement Learning, Learning, Safe Policy Improvement,

technical paper

AAMAS 2020

May 11, 2020

Live on Underline

Safe Policy Improvement with an Estimated Baseline Policy

Please log in to leave a comment

Downloads

SlidesTranscript English (automatic)

Next from AAMAS 2020

Viral Vs. Effective: Utility Based Influence Maximization
technical paper

Viral Vs. Effective: Utility Based Influence Maximization

AAMAS 2020

Noam HazonAmos AzariaYael Sabato
Yael Sabato and 2 other authors

11 May 2020

Similar lecture

OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning Distribution
poster

OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning Distribution

AAAI 2023

+4Minfang LuShuangrong LiuLin Wang
Lin Wang and 6 other authors

11 February 2023

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved