Facial Expression Recognition Using Light Field Cameras: A Comparative Study of Deep Learning Architectures
authors
keywords
- Facial Expression Recognition
- Light Field Camera
- Convolutional Neural Networks
- Recurrent Neural Networks
document type
COMMabstract
This paper presents our contribution to facial expression recognition using images obtained from the Light Field Face Dataset (LF). We compare several variants of neural network architectures to demonstrate the potential benefits of using this relatively new optical system in the field of facial expression recognition. We propose the use of the EfficientNetV2-S convolutional neural network as the base architecture, combined with various recurrent neural networks (LSTM, GRU, BiLSTM, and BiGRU) in our experiments. Furthermore, we investigate different sets of sub-aperture images, each varying in terms of the number of images and virtual position. The results demonstrate a significant improvement in accuracy for two specific configurations, depending on the sets of sub-aperture images used. The first configuration involves using the EfficientNetV2-S model in a two-branch configuration combined with an LSTM. The second configuration uses a single branch model with a BiLSTM.