Efficient entropy for policy gradient with multi-dimensional action space

Yiming Zhang, Quan Ho Vuong, Kenny Song, Xiao Yue Gong, Keith W. Ross

    Research output: Contribution to conferencePaperpeer-review

    Fingerprint

    Dive into the research topics of 'Efficient entropy for policy gradient with multi-dimensional action space'. Together they form a unique fingerprint.