[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Hands-On: Segmenting Individual Signs from Continuous Sequences

Created by
  • Haebom

Author

JianHe Low, Harry Walsh, Ozge Mercanoglu Sincan, Richard Bowden

Outline

This paper addresses the problem of continuous sign language segmentation, which is a critical task for sign language translation and data annotation. We propose a Transformer-based architecture that models temporal dynamics, and define frame segmentation as a sequence labeling problem using the Begin-In-Out (BIO) tagging technique. We leverage HaMeR hand features and 3D angles, and demonstrate that our approach achieves state-of-the-art results on the DGS Corpus and outperforms existing benchmarks on BSLCorpus.

Takeaways, Limitations

Takeaways:
We present a novel approach to the sign language segmentation problem by leveraging a Transformer-based architecture.
Effective frame segmentation using BIO tagging technique.
Improved performance by combining HaMeR hand features and 3D angles.
Demonstrated excellent performance on DGS Corpus and BSLCorpus.
Limitations:
Only performance evaluations on specific sign language datasets (DGS Corpus, BSLCorpus) are presented, requiring further research on generalizability.
Lack of analysis of the computational cost and complexity of the proposed model.
Lack of scalability validation across different sign languages and datasets.
👍