Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

Created by
  • Haebom

Author

Hao Tan, Jun Lan, Zichang Tan, Ajian Liu, Chuanbiao Song, Senyuan Shi, Huijia Zhu, Weiqiang Wang, Jun Wan, Zhen Lei

Outline

This paper highlights the challenging nature of deepfake content in real-world scenarios due to its complex and evolving nature. Existing academic benchmarks typically feature homogeneous training sources and low-quality test images, significantly discouraging real-world deployment of current detectors. To address this gap, we present the HydraFake dataset, which simulates real-world challenges through hierarchical generalization testing. HydraFake encompasses a variety of deepfake techniques, field forgeries, rigorous training and evaluation protocols, and covers unseen model architectures, novel forgery techniques, and novel data domains. Building on these resources, we propose Veritas, a multimodal large-scale language model (MLLM)-based deepfake detector. Unlike conventional thought processes (CoT), we introduce pattern recognition inference, which incorporates key inference patterns such as "planning" and "self-reflection," to mimic human forensic processes. We also propose a two-stage training pipeline to seamlessly integrate these deepfake inference capabilities into existing MLLMs. Experiments on the HydraFake dataset demonstrate that previous detectors demonstrate excellent generalization performance in cross-model scenarios, but fall short in unseen forgery and data domains. Veritas achieves significant performance improvements across a variety of OOD scenarios, delivering transparent and accurate detection results.

Takeaways, Limitations

Takeaways:
We present HydraFake, a new dataset that reflects real-world deepfake detection challenges.
We propose Veritas, a novel deepfake detector based on a multimodal large-scale language model (MLLM).
Improve detection performance by mimicking human forensic reasoning processes through pattern recognition inference.
Veritas outperforms existing detectors in a variety of Out-of-Distribution (OOD) scenarios.
Provides transparent and reliable detection results.
Limitations:
The HydraFake dataset may not perfectly reflect all real-world deepfake threats.
Veritas's performance may depend on training and evaluation on the HydraFake dataset. Further research is needed to determine generalization performance on other datasets.
MLLM-based approaches can be computationally expensive.
As new deepfake technologies continue to emerge, Veritas requires ongoing updates and improvements.
👍