Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform

Created by
  • Haebom

Author

Hayoung Jung, Shravika Mittal, Ananya Aatreya, Navreet Kaur, Munmun De Choudhury, Tanushree Mitra

Outline

This paper presents the first large-scale study of the prevalence of misinformation related to opioid use disorder (OUD) on YouTube. In collaboration with clinical experts, we validated eight common misinformation categories and released a dataset of expert-labeled videos. To achieve efficient labeling, we introduced a classification pipeline called MythTriage, which uses lightweight models to handle common cases and delegates difficult cases to a high-performance, yet expensive, large-scale language model (LLM). MythTriage achieved a macro F1 score of up to 0.86 and is estimated to reduce annotation time and financial costs by over 76% compared to expert and full LLM labeling. By analyzing 2,900 search results and 343,000 recommendations, we uncover how misinformation persists on YouTube, providing actionable insights for public health and platform moderation.

Takeaways, Limitations

Takeaways:
Presents the first large-scale study on the prevalence of OUD-related misinformation on YouTube.
A dataset of eight widely-circulated misinformation topics related to OUD, validated by experts, is now available.
Development and performance validation of the MythTriage pipeline for efficient labeling (up to 0.86 macro F1 score, 76% cost reduction).
Provides insights into the mechanisms by which misinformation persists on YouTube and actionable suggestions for public health and platform moderation.
Limitations:
The study was limited to YouTube and may not reflect the situation on other platforms.
The performance of MythTriage depends on the performance of the LLM and lightweight models used.
We only cover a limited number of misinformation topics, which may not encompass all misinformation related to OUD.
Because the research results are based on data from a specific point in time, they may not reflect changes over time.
👍