TY - GEN
T1 - Text-to-Painting on a Large Variance Dataset with Sequential Generative Adversarial Networks
AU - Ozgen, Azmi Can
AU - Aghdam, Omid Abdollahi
AU - Ekenel, Hazim Kemal
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/10/5
Y1 - 2020/10/5
N2 - Converting text descriptions to images using Generative Adversarial Networks has become a popular research area. Visually appealing images were generated in recent years successfully. We investigated the generation of artistic images on a custom-built large variance dataset, which includes training images with variations, for example, in shape, color, and content. These variations in images provide originality, which is an important factor for artistic essence. One major characteristic of our work is that we used keywords as image descriptions, instead of sentences. As a network architecture, we proposed a sequential Generative Adversarial Network model, which utilizes several techniques like Wasserstein loss, spectral normalization, and minibatch discrimination to have stable training curves. Ultimately, we were able to generate painting images, which have a variety of styles. We evaluated the quality of generated paintings by using Fréchet Inception Distance score.
AB - Converting text descriptions to images using Generative Adversarial Networks has become a popular research area. Visually appealing images were generated in recent years successfully. We investigated the generation of artistic images on a custom-built large variance dataset, which includes training images with variations, for example, in shape, color, and content. These variations in images provide originality, which is an important factor for artistic essence. One major characteristic of our work is that we used keywords as image descriptions, instead of sentences. As a network architecture, we proposed a sequential Generative Adversarial Network model, which utilizes several techniques like Wasserstein loss, spectral normalization, and minibatch discrimination to have stable training curves. Ultimately, we were able to generate painting images, which have a variety of styles. We evaluated the quality of generated paintings by using Fréchet Inception Distance score.
KW - Generative Adversarial Networks (GANs)
KW - Painting generation
KW - Sequential GANs
KW - Text-to-Image synthesis
UR - http://www.scopus.com/inward/record.url?scp=85100305294&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100305294&partnerID=8YFLogxK
U2 - 10.1109/SIU49456.2020.9302112
DO - 10.1109/SIU49456.2020.9302112
M3 - Conference contribution
AN - SCOPUS:85100305294
T3 - 2020 28th Signal Processing and Communications Applications Conference, SIU 2020 - Proceedings
BT - 2020 28th Signal Processing and Communications Applications Conference, SIU 2020 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 28th Signal Processing and Communications Applications Conference, SIU 2020
Y2 - 5 October 2020 through 7 October 2020
ER -