VIDEO DOI: https://doi.org/10.48448/m8dd-9m74
PAPER DOI: https://doi.org/10.1609/aaai.v38i10.28973

technical paper

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes

Please log in to leave a comment

Downloads

SlidesPaperTranscript English (automatic)

Next from AAAI 2024

E2E-AT: A Unified Framework for Tackling Uncertainty in Task-Aware End-to-End Learning | VIDEO
technical paper

E2E-AT: A Unified Framework for Tackling Uncertainty in Task-Aware End-to-End Learning | VIDEO

AAAI 2024

Wangkun Xu
Wangkun Xu and 2 other authors

23 February 2024

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2023 Underline - All rights reserved