Statistically Efficient Off-Policy Policy Gradients

Publication
In ICML 2020