TY - GEN
T1 - Provable Detection of Propagating Sampling Bias in Prediction Models
AU - Ravishankar, Pavan
AU - Mo, Qingyu
AU - McFowland, Edward
AU - Neill, Daniel B.
N1 - Funding Information:
This work was partially supported by the National Science Foundation Program on Fairness in Artificial Intelligence in Collaboration with Amazon, grant IIS-2040898. We gratefully acknowledge input from Prof. Ravi Shroff for designing experiments on the NYPD Stop and Frisk Data.
Publisher Copyright:
Copyright © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2023/6/27
Y1 - 2023/6/27
N2 - With an increased focus on incorporating fairness in machine learning models, it becomes imperative not only to assess and mitigate bias at each stage of the machine learning pipeline but also to understand the downstream impacts of bias across stages. Here we consider a general, but realistic, scenario in which a predictive model is learned from (potentially biased) training data, and model predictions are assessed post-hoc for fairness by some auditing method. We provide a theoretical analysis of how a specific form of data bias, differential sampling bias, propagates from the data stage to the prediction stage. Unlike prior work, we evaluate the downstream impacts of data biases quantitatively rather than qualitatively and prove theoretical guarantees for detection. Under reasonable assumptions, we quantify how the amount of bias in the model predictions varies as a function of the amount of differential sampling bias in the data, and at what point this bias becomes provably detectable by the auditor. Through experiments on two criminal justice datasets- the well-known COMPAS dataset and historical data from NYPD's stop and frisk policy- we demonstrate that the theoretical results hold in practice even when our assumptions are relaxed.
AB - With an increased focus on incorporating fairness in machine learning models, it becomes imperative not only to assess and mitigate bias at each stage of the machine learning pipeline but also to understand the downstream impacts of bias across stages. Here we consider a general, but realistic, scenario in which a predictive model is learned from (potentially biased) training data, and model predictions are assessed post-hoc for fairness by some auditing method. We provide a theoretical analysis of how a specific form of data bias, differential sampling bias, propagates from the data stage to the prediction stage. Unlike prior work, we evaluate the downstream impacts of data biases quantitatively rather than qualitatively and prove theoretical guarantees for detection. Under reasonable assumptions, we quantify how the amount of bias in the model predictions varies as a function of the amount of differential sampling bias in the data, and at what point this bias becomes provably detectable by the auditor. Through experiments on two criminal justice datasets- the well-known COMPAS dataset and historical data from NYPD's stop and frisk policy- we demonstrate that the theoretical results hold in practice even when our assumptions are relaxed.
UR - http://www.scopus.com/inward/record.url?scp=85168248653&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85168248653&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85168248653
T3 - Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
SP - 9562
EP - 9569
BT - AAAI-23 Technical Tracks 8
A2 - Williams, Brian
A2 - Chen, Yiling
A2 - Neville, Jennifer
PB - AAAI press
T2 - 37th AAAI Conference on Artificial Intelligence, AAAI 2023
Y2 - 7 February 2023 through 14 February 2023
ER -