On the model-based stochastic value gradient for continuous reinforcement learning

Brandon Amos, Samuel Stanton, Denis Yarats, Andrew Gordon Wilson

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'On the model-based stochastic value gradient for continuous reinforcement learning'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science