GiGAN: Gate in GAN, could gate mechanism filter the features in image-to-image translation?

Xuan Nie, Jianchao Jia, Haoxuan Ding, Edward K. Wong

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Image-to-image translation techniques have been used in many different fields and have obtained remarkable performance in recent years. However, in many image-to-image translation tasks, only certain parts of the image need to be converted instead of the whole image. Traditional GAN-based methods often reconstruct the entire image, which may lead to artifacts and low-quality results. To address this issue, we propose a novel model, named GiGAN: Gate in GAN, which utilizes special Residual Blocks embedded with Gate Cells to filter and extract the features for facial attribute transfer and facial expression synthesis tasks. Specifically, we treat the intermediate feature from source domain to target domain as a sequence, and introduce the gate mechanism into this sequential task. To achieve this, we introduce the convolutional layers into gate cell and modify the stream in traditional gate cell to suit for image-to-image translation task. We designed two types of methods based on reusing parameters in the residual blocks or not, namely GiGAN-reuse and GiGAN-non-reuse. Experimental results and quantitative evaluations show that our model has superior performance against state-of-the-arts. And ablation studies demonstrate the effectiveness of our method. Furthermore, visualization of the features in Gate Cells shows that Gate Mechanism can filter the features in image-to-image translation effectively.

    Original languageEnglish (US)
    Pages (from-to)376-388
    Number of pages13
    JournalNeurocomputing
    Volume462
    DOIs
    StatePublished - Oct 28 2021

    Keywords

    • Gate mechanism
    • Generative adversarial networks (GANs)
    • Image-to-image translation
    • LSTM

    ASJC Scopus subject areas

    • Computer Science Applications
    • Cognitive Neuroscience
    • Artificial Intelligence

    Fingerprint

    Dive into the research topics of 'GiGAN: Gate in GAN, could gate mechanism filter the features in image-to-image translation?'. Together they form a unique fingerprint.

    Cite this