Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network

Abdullah Tariq*, Martin Mesak, R. Muhammad Atif Azad, Zulqarnain Gilani

*Corresponding author for this work

    Research output: Contribution to conferencePaperpeer-review

    Abstract

    Facial feature analysis, particularly dynamic facial expression recognition, is essential in computer vision for understanding human emotions, behaviors, and physiological states. However, existing approaches often exhibit limited performance, stemming from inadequate modelling of facial dynamics, noise sensitivity, ambiguous expression semantics, and are generally specific to single-task scenarios. To address these issues, we propose a compact 3D spatio-temporal network capable of handling both expression recognition and physiological state analysis. Our network includes two custom modules: (1) Contrastive Adversarial Efficient Local Channel Attention (ConAdv-ELCA), which extracts and disentangles fine-grained local facial features, and (2) Efficient Global Channel Attention (EGCA), to capture local-global interactions. Unlike prior work, which predominantly evaluates models on similar datasets within single-task domains, our work has demonstrated the ability to generalize across different tasks that are based on facial analysis. Experimental results demonstrate that our model consistently achieves state-ofthe-art or near-state-of-the-art performance on blood alcohol concentration estimation, dynamic facial expression recognition, and driver fatigue detection.
    Original languageEnglish
    Publication statusAccepted/In press (AAM) - 1 Aug 2025

    Fingerprint

    Dive into the research topics of 'Jack of many Faces: A Step Towards Facial Expression and Physiological State Analysis with a Single Network'. Together they form a unique fingerprint.

    Cite this