Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning

Created by
  • Haebom

Author

Fanrui Zhang, Dian Li, Qiang Zhang, Jun Chen, Gang Liu, Junxiong Lin, Jiahong Yan, Jiawei Liu, Zheng-Jun Zha

Outline

The rapid proliferation of multimodal misinformation on social media has fueled the growing need for research on video misinformation detection, but the lack of large-scale datasets has hampered research. In this paper, we introduce FakeVV, a large-scale benchmark consisting of over 100,000 video-text pairs, and propose Fact-R1, a novel framework that integrates deep inference and collaborative rule-based reinforcement learning. Trained through misinformation CoT instruction tuning, preference alignment via DPO, and GRPO with a verifiable reward function, Fact-R1 exhibits inference behavior similar to advanced text-based reinforcement learning systems in complex multimodal misinformation settings. This study presents a new paradigm for misinformation detection by combining large-scale video understanding, inference-based alignment, and interpretable verification.

Takeaways, Limitations

Takeaways:
Laying the foundation for research on video misinformation detection through the large-scale video-to-text dataset FakeVV.
Improving false information detection performance by leveraging deep inference and reinforcement learning through the Fact-R1 framework.
Simultaneously improving model interpretability and performance through a learning process utilizing CoT, DPO, and GRPO.
Demonstrating inference capabilities similar to text-based reinforcement learning systems in multimodal environments.
Limitations:
Lack of detailed information on specific model performance metrics and comparison results.
The need to verify the generalizability of various types of misinformation.
Further research is needed on its application and effectiveness in real-world social media environments.
👍