UNDERLINE DOI: https://doi.org/10.48448/kksk-eb63
PAPER DOI: https://doi.org/10.1609/aaai.v38i12.29218
technical paper
Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization
Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
Please log in to leave a comment
There is a typo in the poster, where e_nu(s,a) should be r(s,a) + gamma T(s'|s,a)nu(s') - nu(s)
0
