Masatoshi_Uehara
Home
Publications
Contact
CV
Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies
Nathan Kallus
,
Masatoshi Uehara*
June 2020
PDF
Type
Report
Cite
×