TY - GEN
T1 - Measures of Disparity and their Efficient Estimation
AU - Singh, Harvineet
AU - Chunara, Rumi
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/8/8
Y1 - 2023/8/8
N2 - Quantifying disparities, that is differences in outcomes among population groups, is an important task in public health, economics, and increasingly in machine learning. In this work, we study the question of how to collect data to measure disparities. The field of survey statistics provides extensive guidance on sample sizes necessary to accurately estimate quantities such as averages. However, there is limited guidance for estimating disparities. We consider a broad class of disparity metrics including those used in machine learning for measuring fairness of model outputs. For each metric, we derive the number of samples to be collected per group that increases the precision of disparity estimates given a fixed data collection budget. We also provide sample size calculations for hypothesis tests that check for significant disparities. Our methods can be used to determine sample sizes for fairness evaluations. We validate the methods on two nationwide surveys, used for understanding population-level attributes like employment and health, and a prediction model. Absent a priori information on the groups, we find that equally sampling the groups typically performs well.
AB - Quantifying disparities, that is differences in outcomes among population groups, is an important task in public health, economics, and increasingly in machine learning. In this work, we study the question of how to collect data to measure disparities. The field of survey statistics provides extensive guidance on sample sizes necessary to accurately estimate quantities such as averages. However, there is limited guidance for estimating disparities. We consider a broad class of disparity metrics including those used in machine learning for measuring fairness of model outputs. For each metric, we derive the number of samples to be collected per group that increases the precision of disparity estimates given a fixed data collection budget. We also provide sample size calculations for hypothesis tests that check for significant disparities. Our methods can be used to determine sample sizes for fairness evaluations. We validate the methods on two nationwide surveys, used for understanding population-level attributes like employment and health, and a prediction model. Absent a priori information on the groups, we find that equally sampling the groups typically performs well.
KW - AI
KW - Social Sciences
KW - and well-being
KW - disparity estimation
KW - fairness metrics
KW - health
KW - optimal data collection
UR - http://www.scopus.com/inward/record.url?scp=85173629292&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173629292&partnerID=8YFLogxK
U2 - 10.1145/3600211.3604697
DO - 10.1145/3600211.3604697
M3 - Conference contribution
AN - SCOPUS:85173629292
T3 - AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
SP - 927
EP - 938
BT - AIES 2023 - Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
PB - Association for Computing Machinery, Inc
T2 - 2023 AAAI / ACM Conference on Artificial Intelligence, Ethics, and Society, AIES 2023
Y2 - 8 August 2023 through 10 August 2023
ER -