TY - GEN
T1 - How Intense Are Your Words? Understanding Emotion Intensity from Speech
AU - Mukherjee, Himadri
AU - Salam, Hanan
AU - Othmani, Alice
AU - Santosh, K. C.
N1 - Funding Information:
* The first two authors contributed equally to this research. This research is supported by New York University Abu Dhabi internal research grant.
Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Speech-based emotion recognition has emerged as an important research area in the field of affective computing. Despite the multifarious developments in emotion recognition, the analysis of the intensity of the expressed emotion has mostly been unexplored. As compared to distinguishing emotions, understanding their intensities is a herculean task. The spectral characteristics of a speech signal portray important information which is necessary to distinguish emotional intensity. Line Spectral Frequency (LSF) features offer a spectral representation of the speech signal in addition to modeling the formant structure. Such features have been explored for modeling emotions from speech. However, they have not been explored for emotion intensity detection. In this paper, we explore LSF features for emotion intensity characterization. In order to make the proposed approach appropriate for low computational resources settings, we present a low dimensional version of LSF: the Band Dominance and Dynamics LSF (BDD-LSF). The proposed BDD-LSF is based on a frequency bands dominance and dynamics analysis technique of the LSF features. Such feature is capable of handling intra-class variation between different intensities involving multifarious emotional states. Using the publicly available RAVDESS dataset, we achieved the highest accuracy of 75.75% for distinguishing emotional intensities. Our system also outperforms reported works which use deep learning-based techniques.
AB - Speech-based emotion recognition has emerged as an important research area in the field of affective computing. Despite the multifarious developments in emotion recognition, the analysis of the intensity of the expressed emotion has mostly been unexplored. As compared to distinguishing emotions, understanding their intensities is a herculean task. The spectral characteristics of a speech signal portray important information which is necessary to distinguish emotional intensity. Line Spectral Frequency (LSF) features offer a spectral representation of the speech signal in addition to modeling the formant structure. Such features have been explored for modeling emotions from speech. However, they have not been explored for emotion intensity detection. In this paper, we explore LSF features for emotion intensity characterization. In order to make the proposed approach appropriate for low computational resources settings, we present a low dimensional version of LSF: the Band Dominance and Dynamics LSF (BDD-LSF). The proposed BDD-LSF is based on a frequency bands dominance and dynamics analysis technique of the LSF features. Such feature is capable of handling intra-class variation between different intensities involving multifarious emotional states. Using the publicly available RAVDESS dataset, we achieved the highest accuracy of 75.75% for distinguishing emotional intensities. Our system also outperforms reported works which use deep learning-based techniques.
KW - Affective Computing
KW - Line Spectral Frequency
KW - Random Forest
KW - SER
KW - Speech Signal Processing
KW - Speech-based Emotional Intensity Recognition
UR - http://www.scopus.com/inward/record.url?scp=85124403029&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124403029&partnerID=8YFLogxK
U2 - 10.1109/ICCT52962.2021.9658078
DO - 10.1109/ICCT52962.2021.9658078
M3 - Conference contribution
AN - SCOPUS:85124403029
T3 - International Conference on Communication Technology Proceedings, ICCT
SP - 1280
EP - 1286
BT - 2021 IEEE 21st International Conference on Communication Technology, ICCT 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 21st IEEE International Conference on Communication Technology, ICCT 2021
Y2 - 13 October 2021 through 16 October 2021
ER -