Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

STOPA: A Database of Systematic Variation Of DeePfake Audio for Open-Set Source Tracing and Attribution

Created by
  • Haebom

Author

Anton Firc, Manasi Chhibber, Jagabandhu Mishra, Vishwanath Pratap Singh, Tomi Kinnunen, Kamil Malinka

STOPA: Deepfake Speech Source Tracing Dataset

Outline

This paper introduces STOPA, a novel dataset aimed at addressing the lack of systematically curated datasets for source attribution of synthesized speech, a key area of deepfake voice detection research. STOPA comprises 700,000 samples generated from eight acoustic models (AMs), six vocoder models (VMs), and 13 different synthesizers, systematically addressing a wide range of parameter settings. Unlike existing datasets, STOPA provides a systematically controlled framework that encompasses various generative elements, such as vocoder models, acoustic models, and pre-trained weight selection, thereby improving attribution confidence.

Takeaways, Limitations

Contributing to the advancement of deepfake voice source tracking research: The STOPA dataset contributes to the study of voice synthesis source tracking by covering a wide range of generative elements.
Improve attribution accuracy: Systematic controls improve attribution accuracy to enhance forensic analysis, deepfake detection, and generative model transparency.
Complexity of the dataset: STOPA has a large dataset with a variety of settings, including 8 AMs, 6 VMs, 13 synthesizers, and 700,000 samples, which may require significant computational resources for analysis.
Difficulty in dataset creation: STOPA requires collecting and managing a variety of models and parameter settings for systematic variation, requiring significant effort and expertise to create and maintain the dataset.
Generalization issues: The specific models and settings included in STOPA may not perfectly represent all real-world deepfake scenarios, and the dataset's generalization ability may be limited.
👍