Measures of Disparity and their Efficient Estimation

Harvineet Singh, Rumi Chunara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Quantifying disparities, that is differences in outcomes among population groups, is an important task in public health, economics, and increasingly in machine learning. In this work, we study the question of how to collect data to measure disparities. The field of survey statistics provides extensive guidance on sample sizes necessary to accurately estimate quantities such as averages. However, there is limited guidance for estimating disparities. We consider a broad class of disparity metrics including those used in machine learning for measuring fairness of model outputs. For each metric, we derive the number of samples to be collected per group that increases the precision of disparity estimates given a fixed data collection budget. We also provide sample size calculations for hypothesis tests that check for significant disparities. Our methods can be used to determine sample sizes for fairness evaluations. We validate the methods on two nationwide surveys, used for understanding population-level attributes like employment and health, and a prediction model. Absent a priori information on the groups, we find that equally sampling the groups typically performs well.

Original languageEnglish (US)
Title of host publicationAIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
PublisherAssociation for Computing Machinery, Inc
Pages927-938
Number of pages12
ISBN (Electronic)9798400702310
DOIs
StatePublished - Aug 8 2023
Event2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023 - Montreal, Canada
Duration: Aug 8 2023Aug 10 2023

Publication series

NameAIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society

Conference

Conference2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023
Country/TerritoryCanada
CityMontreal
Period8/8/238/10/23

Keywords

  • AI
  • Social Sciences
  • and well-being
  • disparity estimation
  • fairness metrics
  • health
  • optimal data collection

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Measures of Disparity and their Efficient Estimation'. Together they form a unique fingerprint.

Cite this