profile picture

Shangtong Zhang

reinforcement learning

ml

actor critic

discount factor

1

presentations

1

number of views

SHORT BIO

Shangtong Zhang is a DPhil student at the University of Oxford working with Prof. Shimon Whiteson. The goal of his research is to solve sequential decision making problems in a scalable and reliable way. Currently, he focuses on off-policy and offline reinforcement learning as solution methods. His research has been published at venues like ICML, NeurIPS, and AAAI and his work won the best paper award at AAMAS. He regularly serves as the reviewer for major AI conferences such as NeurIPS, ICML, ICLR, and AAAI and several workshops such as NeurIPS deep RL workshops and offline RL workshops, and ICML RL for real-world workshops. He spent some time at Microsoft Research Montreal and DeepMind London AlphaStar team during his DPhil. Please visit https://shangtongzhang.github.io for more information.

Presentations

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

Shangtong Zhang

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Lectures
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved