TY - GEN
T1 - On the Praxes and Politics of AI Speech Emotion Recognition
AU - Kang, Edward B.
N1 - Funding Information:
I am grateful for the conversations I had with friends, colleagues, and mentors in the writing and editing of this paper. These include, but are not limited to: Kate Crawford, Larry Gross, Josh Kun, Mike Ananny, John Cheney-Lippold, Amy Lee, and fellow members of the Sloan-funded research collective "Knowing Machines." I also very much thank the three anonymous reviewers and program chairs of FAccT'23 for their feedback.
Publisher Copyright:
© 2023 Owner/Author.
PY - 2023/6/12
Y1 - 2023/6/12
N2 - There is no scientific consensus on what is meant by "emotion"- researchers have examined various phenomena spanning brain modes, feelings, sensations, and cognitive structures, among others, in their study of emotional experiences. For the purposes of developing an AI speech emotion recognition (SER) system, however, emotion must be defined, bounded, and instantiated as ground truth in the training data. This means practical choices must be made in which particular emotional ontologies are prioritized over others in the construction of SER datasets. In this paper, I explore these tensions around fairness, accountability, and transparency by analyzing open-source datasets used for SER applications along with their accompanying methodology papers. Specifically, I critique the centrality of discrete emotion theory in SER applications as a contestable emotional framework that is invoked primarily for its practical utility and alignment - as opposed to scientific rigor - with machine learning epistemologies. In so doing, I also shed light on the role of the dataset creators as emotional designers in their attempt to produce, elicit, record, and index emotional expressions for the purposes of crafting SER training datasets. Ultimately, by further querying SER through the aperture of Critical Disability Studies, I use this empirical work to examine the sociopolitical stakes of SER as a normative and regulatory technology that siphons emotion into a broader agenda of capitalistic productivity in the context of call center optimization.
AB - There is no scientific consensus on what is meant by "emotion"- researchers have examined various phenomena spanning brain modes, feelings, sensations, and cognitive structures, among others, in their study of emotional experiences. For the purposes of developing an AI speech emotion recognition (SER) system, however, emotion must be defined, bounded, and instantiated as ground truth in the training data. This means practical choices must be made in which particular emotional ontologies are prioritized over others in the construction of SER datasets. In this paper, I explore these tensions around fairness, accountability, and transparency by analyzing open-source datasets used for SER applications along with their accompanying methodology papers. Specifically, I critique the centrality of discrete emotion theory in SER applications as a contestable emotional framework that is invoked primarily for its practical utility and alignment - as opposed to scientific rigor - with machine learning epistemologies. In so doing, I also shed light on the role of the dataset creators as emotional designers in their attempt to produce, elicit, record, and index emotional expressions for the purposes of crafting SER training datasets. Ultimately, by further querying SER through the aperture of Critical Disability Studies, I use this empirical work to examine the sociopolitical stakes of SER as a normative and regulatory technology that siphons emotion into a broader agenda of capitalistic productivity in the context of call center optimization.
KW - affective computing
KW - critical study of AI
KW - disability and AI
KW - Emotion AI
KW - social critique of AI
KW - speech emotion recognition
UR - http://www.scopus.com/inward/record.url?scp=85163620673&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163620673&partnerID=8YFLogxK
U2 - 10.1145/3593013.3594011
DO - 10.1145/3593013.3594011
M3 - Conference contribution
AN - SCOPUS:85163620673
T3 - ACM International Conference Proceeding Series
SP - 455
EP - 466
BT - Proceedings of the 6th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2023
PB - Association for Computing Machinery
T2 - 6th ACM Conference on Fairness, Accountability, and Transparency, FAccT 2023
Y2 - 12 June 2023 through 15 June 2023
ER -