TY - JOUR
T1 - GiGAN
T2 - Gate in GAN, could gate mechanism filter the features in image-to-image translation?
AU - Nie, Xuan
AU - Jia, Jianchao
AU - Ding, Haoxuan
AU - Wong, Edward K.
N1 - Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/10/28
Y1 - 2021/10/28
N2 - Image-to-image translation techniques have been used in many different fields and have obtained remarkable performance in recent years. However, in many image-to-image translation tasks, only certain parts of the image need to be converted instead of the whole image. Traditional GAN-based methods often reconstruct the entire image, which may lead to artifacts and low-quality results. To address this issue, we propose a novel model, named GiGAN: Gate in GAN, which utilizes special Residual Blocks embedded with Gate Cells to filter and extract the features for facial attribute transfer and facial expression synthesis tasks. Specifically, we treat the intermediate feature from source domain to target domain as a sequence, and introduce the gate mechanism into this sequential task. To achieve this, we introduce the convolutional layers into gate cell and modify the stream in traditional gate cell to suit for image-to-image translation task. We designed two types of methods based on reusing parameters in the residual blocks or not, namely GiGAN-reuse and GiGAN-non-reuse. Experimental results and quantitative evaluations show that our model has superior performance against state-of-the-arts. And ablation studies demonstrate the effectiveness of our method. Furthermore, visualization of the features in Gate Cells shows that Gate Mechanism can filter the features in image-to-image translation effectively.
AB - Image-to-image translation techniques have been used in many different fields and have obtained remarkable performance in recent years. However, in many image-to-image translation tasks, only certain parts of the image need to be converted instead of the whole image. Traditional GAN-based methods often reconstruct the entire image, which may lead to artifacts and low-quality results. To address this issue, we propose a novel model, named GiGAN: Gate in GAN, which utilizes special Residual Blocks embedded with Gate Cells to filter and extract the features for facial attribute transfer and facial expression synthesis tasks. Specifically, we treat the intermediate feature from source domain to target domain as a sequence, and introduce the gate mechanism into this sequential task. To achieve this, we introduce the convolutional layers into gate cell and modify the stream in traditional gate cell to suit for image-to-image translation task. We designed two types of methods based on reusing parameters in the residual blocks or not, namely GiGAN-reuse and GiGAN-non-reuse. Experimental results and quantitative evaluations show that our model has superior performance against state-of-the-arts. And ablation studies demonstrate the effectiveness of our method. Furthermore, visualization of the features in Gate Cells shows that Gate Mechanism can filter the features in image-to-image translation effectively.
KW - Gate mechanism
KW - Generative adversarial networks (GANs)
KW - Image-to-image translation
KW - LSTM
UR - http://www.scopus.com/inward/record.url?scp=85112807068&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112807068&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2021.07.085
DO - 10.1016/j.neucom.2021.07.085
M3 - Article
AN - SCOPUS:85112807068
SN - 0925-2312
VL - 462
SP - 376
EP - 388
JO - Neurocomputing
JF - Neurocomputing
ER -