TRAPDOOR: Repurposing neural network backdoors to detect dataset bias in machine learning-based genomic analysis

Esha Sarkar, Constantine Doumanidis, Michail Maniatakos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Use of Machine Learning (ML) to understand underlying patterns in gene mutations (genomics) has far-reaching results in diagnosis and treatment for life-threatening diseases like cancer. Success and sustainability of ML algorithms depends on the quality and diversity of training data, and under-representation of groups (gender, race, etc.) can lead to exacerbation of systemic discrimination issues. In this work, we propose TRAPDOOR, a methodology for the identification of biased datasets by repurposing, otherwise malicious, neural backdoors. Our methodology can leak potential bias information about the cloud's dataset which is collected in a collaborative setting, without hampering the genuine performance. Using a real-world cancer genomics dataset, we analyze feasibility of leaking bias for gender and race attributes. Our experimental results show that TRAPDOOR can detect the presence of dataset bias with 100% accuracy, and furthermore can also extract the extent of bias by recovering the percentage with a small error.

Original languageEnglish (US)
Title of host publication2023 IFIP/IEEE 31st International Conference on Very Large Scale Integration, VLSI-SoC 2023
PublisherIEEE Computer Society
ISBN (Electronic)9798350325997
DOIs
StatePublished - 2023
Event31st IFIP/IEEE International Conference on Very Large Scale Integration, VLSI-SoC 2023 - Dubai, United Arab Emirates
Duration: Oct 16 2023Oct 18 2023

Publication series

NameIEEE/IFIP International Conference on VLSI and System-on-Chip, VLSI-SoC
ISSN (Print)2324-8432
ISSN (Electronic)2324-8440

Conference

Conference31st IFIP/IEEE International Conference on Very Large Scale Integration, VLSI-SoC 2023
Country/TerritoryUnited Arab Emirates
CityDubai
Period10/16/2310/18/23

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'TRAPDOOR: Repurposing neural network backdoors to detect dataset bias in machine learning-based genomic analysis'. Together they form a unique fingerprint.

Cite this