Addressing Sample Efficiency and Model-bias in Model-based Reinforcement Learning

Akhil S. Anand, Jens Erik Kveen, Fares Abu-Dakka, Esten Ingar Grøtli, Jan Tommy Gravdahl

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Model-based reinforcement learning promises to be an effective way to bring reinforcement learning to real-world robotic systems by offering a sample efficient learning approach compared to model-free reinforcement learning. However, model-based reinforcement learning approaches at present struggle to match the performance of model-free ones. This work attempts to fill this gap by improving the performance of model-based reinforcement learning while further improving its sample efficiency. To improve the sample efficiency, an exploration strategy is formulated which maximizes the information gain. The asymptotic performance is improved by compensating for the model-bias using a model-free critic. We have evaluated our proposed approach on four reinforcement learning benchmarking tasks in the openAI gym framework.

Original languageEnglish (US)
Title of host publicationProceedings - 21st IEEE International Conference on Machine Learning and Applications, ICMLA 2022
EditorsM. Arif Wani, Mehmed Kantardzic, Vasile Palade, Daniel Neagu, Longzhi Yang, Kit-Yan Chan
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-6
Number of pages6
ISBN (Electronic)9781665462839
DOIs
StatePublished - 2022
Event21st IEEE International Conference on Machine Learning and Applications, ICMLA 2022 - Nassau, Bahamas
Duration: Dec 12 2022Dec 14 2022

Publication series

NameProceedings - 21st IEEE International Conference on Machine Learning and Applications, ICMLA 2022

Conference

Conference21st IEEE International Conference on Machine Learning and Applications, ICMLA 2022
Country/TerritoryBahamas
CityNassau
Period12/12/2212/14/22

Keywords

  • Model based reinforcement learning
  • model predictive control
  • sample efficient learning

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Computer Science Applications
  • Artificial Intelligence
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Addressing Sample Efficiency and Model-bias in Model-based Reinforcement Learning'. Together they form a unique fingerprint.

Cite this