Provably Efficient Third-Person Imitation from Offline Observation

Aaron Zweig, Joan Bruna

Research output: Contribution to journalConference articlepeer-review

Abstract

Domain adaptation in imitation learning represents an essential step towards improving generalizability. However, even in the restricted setting of third-person imitation where transfer is between isomorphic Markov Decision Processes, there are no strong guarantees on the performance of transferred policies. We present problem-dependent, statistical learning guarantees for third-person imitation from observation in an offline setting, and a lower bound on performance in an online setting.

Original languageEnglish (US)
Pages (from-to)1228-1237
Number of pages10
JournalProceedings of Machine Learning Research
Volume124
StatePublished - 2020
Event36th Conference on Uncertainty in Artificial Intelligence, UAI 2020 - Virtual, Online
Duration: Aug 3 2020Aug 6 2020

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Provably Efficient Third-Person Imitation from Offline Observation'. Together they form a unique fingerprint.

Cite this