Abstract
Face presentation attack detection (FacePAD) is critical for securing face recognition systems against attacks such as printed photos, videos, and 3D masks. Existing methods often struggle with generalizability, computational efficiency, and handling sophisticated attacks, particularly in resource-constrained environments. To address these challenges, this study proposes a lightweight CNN-based architecture, MobileNetV3, integrated with spatio-temporal feature extraction. The proposed method effectively captures both dynamic and static characteristics and achieves state-of-the-art performance, including an Equal Error Rate (EER) of 0.0% on the Replay-Attack and Replay-Mobile datasets, and 0.83% on the challenging ROSE-Youtu dataset. With real-time efficiency, processing 256 samples in 11 ms, the model is suitable for deployment on mobile and embedded platforms. This work demonstrates that lightweight architectures with spatio-temporal features can balance computational efficiency and accuracy, setting a benchmark for practical FacePAD systems in applications like mobile authentication, surveillance, and access control, enhancing biometric security.
| Original language | English |
|---|---|
| Article number | 113059 |
| Journal | Knowledge-Based Systems |
| Volume | 311 |
| DOIs | |
| State | Published - 28 Feb 2025 |
Bibliographical note
Publisher Copyright:© 2025 Elsevier B.V.
Keywords
- Deep learning
- Face anti-spoofing
- Face presentation attack detection
- MobileNetV3
- Spatio-temporal
ASJC Scopus subject areas
- Software
- Management Information Systems
- Information Systems and Management
- Artificial Intelligence