Synthetic Aerial Dataset for UAV Detection via Text-to-Image Diffusion Models

Daitao Xing, Anthony Tzes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this work, we present an approach to generate a synthetic aerial dataset for efficient Unmanned Aerial Vehicle (UAV) detection. We propose controlling the output of a text-to-image diffusion model by applying additional input conditions. Specifically, we train a diffusion model that enables conditional inputs, i.e., binary masks that specify all tractable parameters, including quantity, scale, pose, color, background, etc. Diverse photorealistic images with corresponding ground truth bounding boxes are generated automatically in an end-to-end manner. Without any interference, the dataset can be scaled to a large magnitude to facilitate the training process of UAV detection. Experimental results of YOLOv7 trained on the synthetic dataset demonstrate an extensive precision increment on unseen datasets of real images.

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE Conference on Artificial Intelligence, CAI 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages51-52
Number of pages2
ISBN (Electronic)9798350339840
DOIs
StatePublished - 2023
Event2023 IEEE Conference on Artificial Intelligence, CAI 2023 - Santa Clara, United States
Duration: Jun 5 2023Jun 6 2023

Publication series

NameProceedings - 2023 IEEE Conference on Artificial Intelligence, CAI 2023

Conference

Conference2023 IEEE Conference on Artificial Intelligence, CAI 2023
Country/TerritoryUnited States
CitySanta Clara
Period6/5/236/6/23

Keywords

  • Generative Model
  • Text-to-Image Diffusion
  • UAV detection

ASJC Scopus subject areas

  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Modeling and Simulation
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Synthetic Aerial Dataset for UAV Detection via Text-to-Image Diffusion Models'. Together they form a unique fingerprint.

Cite this