
1
presentations
SHORT BIO
I am a master's student in software Engineering at the University of Science and Technology of China. My research direction is reinforcement learning.
Presentations

NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching
Hongbo Zhang and 6 other authors