Reinforcement Learning with Dual-Observation for General Video Game Playing

Chengpeng Hu, Ziqi Wang, Tianye Shu, Hao Tong, Julian Togelius, Xin Yao, Jialin Liu

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Reinforcement learning algorithms have performed well in playing challenging board and video games. More and more studies focus on improving the generalisation ability of reinforcement learning algorithms. The GVGAI Learning Competition aims to develop agents capable of learning to play different game levels that were unseen during training. This paper summarises the five years' GVGAI Learning Competition editions. At each edition, three new games were designed. The training and test levels were designed separately in the first three editions. Since 2020, three test levels of each game were generated by perturbing or combining two training levels. Then, we present a novel reinforcement learning technique with dual-observation for general video game playing, assuming that it is more likely to observe similar local information in different levels rather than global information. Instead of directly inputting a single, raw pixel-based screenshot of the current game screen, our proposed general technique takes the encoded, transformed global and local observations of the game screen as two simultaneous inputs, aiming at learning local information for playing new levels. Our proposed technique is implemented with three state-of-the-art reinforcement learning algorithms and tested on the game set of the 2020 GVGAI Learning Competition. Ablation studies show the outstanding performance of using encoded, transformed dual observations as input.

    Original languageEnglish (US)
    JournalIEEE Transactions on Games
    DOIs
    StateAccepted/In press - 2022

    Keywords

    • artificial intelligence
    • Atari
    • Avatars
    • Encoding
    • Games
    • General video game playing
    • GVGAI
    • Reinforcement learning
    • reinforcement learning
    • Task analysis
    • Training
    • Training data

    ASJC Scopus subject areas

    • Software
    • Control and Systems Engineering
    • Artificial Intelligence
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Reinforcement Learning with Dual-Observation for General Video Game Playing'. Together they form a unique fingerprint.

    Cite this