Is this real? Generating synthetic data that looks real

Miro Mannino, Azza Abouzied

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Synner is a tool that helps users generate real-looking synthetic data by visually and declaratively specifying the properties of the dataset such as each field's statistical distribution, its domain, and its relationship to other fields. It provides instant feedback on every user interaction by updating multiple visualizations of the generated dataset and even suggests data generation specifications from a few user examples and interactions. Synner visually communicates the inherent randomness of statistical data generation. Our evaluation of Synner demonstrates its effectiveness at generating realistic data when compared with Mockaroo, a popular data generation tool, and with hired developers who coded data generation scripts for a fee.

Original languageEnglish (US)
Title of host publicationUIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology
PublisherAssociation for Computing Machinery, Inc
Pages549-561
Number of pages13
ISBN (Electronic)9781450368162
DOIs
StatePublished - Oct 17 2019
Event32nd Annual ACM Symposium on User Interface Software and Technology, UIST 2019 - New Orleans, United States
Duration: Oct 20 2019Oct 23 2019

Publication series

NameUIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology

Conference

Conference32nd Annual ACM Symposium on User Interface Software and Technology, UIST 2019
CountryUnited States
CityNew Orleans
Period10/20/1910/23/19

Keywords

  • Data generation
  • Mixed-initiative UI
  • Uncertainty visualization

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Software

Fingerprint Dive into the research topics of 'Is this real? Generating synthetic data that looks real'. Together they form a unique fingerprint.

  • Cite this

    Mannino, M., & Abouzied, A. (2019). Is this real? Generating synthetic data that looks real. In UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology (pp. 549-561). (UIST 2019 - Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology). Association for Computing Machinery, Inc. https://doi.org/10.1145/3332165.3347866