TY - JOUR
T1 - Multi-facial patches aggregation network for facial expression recognition and facial regions contributions to emotion display
AU - Hazourli, Ahmed Rachid
AU - Djeghri, Amine
AU - Salam, Hanan
AU - Othmani, Alice
N1 - Publisher Copyright:
© 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature.
PY - 2021/4
Y1 - 2021/4
N2 - In this paper, an approach for Facial Expressions Recognition (FER) based on a multi-facial patches (MFP) aggregation network is proposed. Deep features are learned from facial patches using convolutional neural sub-networks and aggregated within one architecture for expression classification. Besides, a framework based on two data augmentation techniques is proposed to expand FER labels training datasets. Consequently, the proposed shallow convolutional neural networks (CNN) based approach does not need large datasets for training. The proposed framework is evaluated on three FER datasets. Results show that the proposed approach achieves state-of-art FER deep learning approaches performance when the model is trained and tested on images from the same dataset. Moreover, the proposed data augmentation techniques improve the expression recognition rate, and thus can be a solution for training deep learning FER models using small datasets. The accuracy degrades significantly when testing for dataset bias. A fine-tuning can overcome the problem of transition from laboratory-controlled conditions to in-the-wild conditions. Finally, the emotional face is mapped using the MFP-CNN and the contribution of the different facial areas in displaying emotion as well as their importance in the recognition of each facial expression are studied.
AB - In this paper, an approach for Facial Expressions Recognition (FER) based on a multi-facial patches (MFP) aggregation network is proposed. Deep features are learned from facial patches using convolutional neural sub-networks and aggregated within one architecture for expression classification. Besides, a framework based on two data augmentation techniques is proposed to expand FER labels training datasets. Consequently, the proposed shallow convolutional neural networks (CNN) based approach does not need large datasets for training. The proposed framework is evaluated on three FER datasets. Results show that the proposed approach achieves state-of-art FER deep learning approaches performance when the model is trained and tested on images from the same dataset. Moreover, the proposed data augmentation techniques improve the expression recognition rate, and thus can be a solution for training deep learning FER models using small datasets. The accuracy degrades significantly when testing for dataset bias. A fine-tuning can overcome the problem of transition from laboratory-controlled conditions to in-the-wild conditions. Finally, the emotional face is mapped using the MFP-CNN and the contribution of the different facial areas in displaying emotion as well as their importance in the recognition of each facial expression are studied.
KW - Conditional generative adversarial network
KW - Deep visual learning
KW - Facial expression recognition
KW - Human-computer interaction
KW - multi-facial patches
UR - http://www.scopus.com/inward/record.url?scp=85100209032&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100209032&partnerID=8YFLogxK
U2 - 10.1007/s11042-020-10332-7
DO - 10.1007/s11042-020-10332-7
M3 - Article
AN - SCOPUS:85100209032
SN - 1380-7501
VL - 80
SP - 13639
EP - 13662
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 9
ER -