TY - GEN
T1 - Metric-based generative adversarial network
AU - Dai, Guoxian
AU - Xie, Jin
AU - Fang, Yi
N1 - Funding Information:
This material is partly based upon work supported by New York University Abu Dhabi institute (AD131 and REF131).
Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/10/23
Y1 - 2017/10/23
N2 - Existing methods of generative adversarial network (GAN) use different criteria to distinguish between real and fake samples, such as probability [9], energy [44] or other losses [30]. In this paper, by employing the merits of deep metric learning, we propose a novel metric-based generative adversarial network (MBGAN), which uses the distance-criteria to distinguish between real and fake samples. Specifically, the discriminator of MBGAN adopts a triplet structure and learns a deep nonlinear transformation, which maps input samples into a new feature space. In the transformed space, the distance between real samples is minimized, while the distance between real sample and fake sample is maximized. Similar to the adversarial procedure of existing GANs, a generator is trained to produce synthesized examples, which are close to real examples, while a discriminator is trained to maximize the distance between real and fake samples to a large margin. Meanwhile, instead of using a fixed margin, we adopt a data-dependent margin [30], so that the generator could focus on improving the synthesized samples with poor quality, instead of wasting energy on well-produce samples. Our proposed method is verified on various benchmarks, such as CIFAR-10, SVHN and CelebA, and generates high-quality samples.
AB - Existing methods of generative adversarial network (GAN) use different criteria to distinguish between real and fake samples, such as probability [9], energy [44] or other losses [30]. In this paper, by employing the merits of deep metric learning, we propose a novel metric-based generative adversarial network (MBGAN), which uses the distance-criteria to distinguish between real and fake samples. Specifically, the discriminator of MBGAN adopts a triplet structure and learns a deep nonlinear transformation, which maps input samples into a new feature space. In the transformed space, the distance between real samples is minimized, while the distance between real sample and fake sample is maximized. Similar to the adversarial procedure of existing GANs, a generator is trained to produce synthesized examples, which are close to real examples, while a discriminator is trained to maximize the distance between real and fake samples to a large margin. Meanwhile, instead of using a fixed margin, we adopt a data-dependent margin [30], so that the generator could focus on improving the synthesized samples with poor quality, instead of wasting energy on well-produce samples. Our proposed method is verified on various benchmarks, such as CIFAR-10, SVHN and CelebA, and generates high-quality samples.
KW - Data-dependent margin
KW - Deep metric learning
KW - Generative adversarial network
UR - http://www.scopus.com/inward/record.url?scp=85035208332&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85035208332&partnerID=8YFLogxK
U2 - 10.1145/3123266.3123334
DO - 10.1145/3123266.3123334
M3 - Conference contribution
AN - SCOPUS:85035208332
T3 - MM 2017 - Proceedings of the 2017 ACM Multimedia Conference
SP - 672
EP - 680
BT - MM 2017 - Proceedings of the 2017 ACM Multimedia Conference
PB - Association for Computing Machinery, Inc
T2 - 25th ACM International Conference on Multimedia, MM 2017
Y2 - 23 October 2017 through 27 October 2017
ER -