A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning

Christoph Dann, Mehryar Mohri, Tong Zhang, Julian Zimmert

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fingerprint

Dive into the research topics of 'A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning'. Together they form a unique fingerprint.

Keyphrases

Mathematics