Learning to score behaviors for guided policy optimization

Aldo Pacchiano, Jack Parker-Holder, Yunhao Tang, Anna Choromanska, Krzysztof Choromanski, Michael I. Jordan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'Learning to score behaviors for guided policy optimization'. Together they form a unique fingerprint.

Computer Science

Mathematics

Keyphrases