Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

HateClipSeg: A Segment-Level Annotated Dataset for Fine-Grained Hate Video Detection

Created by
  • Haebom

Author

Han Wang, Zhuoran Wang, Roy Ka-Wei Lee

Outline

We present a large-scale, multimodal dataset called HateClipSeg. This dataset contains over 11,714 video segments, each labeled with five offensive categories: normal or hateful, insulting, sexual, violent, and self-harm. It is annotated at both the overall video and segment levels, and includes victim information. We benchmark model performance and demonstrate the limitations of existing models by presenting three tasks: (1) classifying edited hate videos, (2) temporally localizing hate videos, and (3) classifying online hate videos. The dataset is publicly available.

Takeaways, Limitations

Takeaways:
Contributing to research on detecting hate speech in videos by providing a large-scale multi-modal video dataset, HateClipSeg.
Enables more sophisticated analysis with video-level and segment-level annotations and victim information.
The three tasks presented clearly reveal the limitations of existing models and suggest future research directions.
High reliability annotation (Krippendorff's alpha = 0.817).
Improving research accessibility by providing open datasets.
Limitations:
The dataset may not be large enough yet (needs to be expanded in the future).
Potential bias towards certain languages or cultures (efforts to ensure diversity are needed).
Beyond the three presented tasks, various research tasks are needed (requiring diverse analyses and approaches).
👍