Lightweight Deepfake Detection Based on Multi-Feature Fusion
Siddiqui Muhammad Yasir, Hyun Kim
TL;DR
This work addresses the need for real-time, resource-efficient deepfake detection by fusing handcrafted texture (LBP, HOG) and keypoint (KAZE) features within a lightweight ML framework. Keyframes are selectively extracted to minimize data, and the combined feature vector $\mathbf{F}_{\text{combined}}$ blends $\mathbf{F}_{\text{LBP}}$ and $\mathbf{F}_{\text{KAZE}}$ for classification by RF, Extra Trees, SVC, or XGBoost. Empirical results on FaceForensics++ and Celeb-DF demonstrate that the HOG+KAZE fusion achieves strong accuracy (up to $92.12\%$ on FaceForensics++ with SVC) while maintaining competitive inference and training times, offering a practical alternative to heavier CNN-based detectors. The approach emphasizes robustness to compression and limited computational resources, highlighting potential for real-time media authentication on mobile and edge devices. Overall, the study provides a scalable, interpretable pathway for enhancing digital content security against evolving deepfake threats.
Abstract
Deepfake technology utilizes deep learning based face manipulation techniques to seamlessly replace faces in videos creating highly realistic but artificially generated content. Although this technology has beneficial applications in media and entertainment misuse of its capabilities may lead to serious risks including identity theft cyberbullying and false information. The integration of DL with visual cognition has resulted in important technological improvements particularly in addressing privacy risks caused by artificially generated deepfake images on digital media platforms. In this study we propose an efficient and lightweight method for detecting deepfake images and videos making it suitable for devices with limited computational resources. In order to reduce the computational burden usually associated with DL models our method integrates machine learning classifiers in combination with keyframing approaches and texture analysis. Moreover the features extracted with a histogram of oriented gradients (HOG) local binary pattern (LBP) and KAZE bands were integrated to evaluate using random forest extreme gradient boosting extra trees and support vector classifier algorithms. Our findings show a feature-level fusion of HOG LBP and KAZE features improves accuracy to 92% and 96% on FaceForensics++ and Celeb-DFv2 respectively.
