Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Direct Video-Based Spatiotemporal Deep Learning for Cattle Lameness Detection

Created by
  • Haebom

Author

Md Fahimuzzman Sohan, Raid Alzubi, Hadeel Alzoubi, Eid Albalawi, AH Abdul Hafez

Outline

This paper proposes a spatiotemporal deep learning framework for detecting gait abnormalities in cattle using publicly available video data. We build and release a balanced dataset consisting of 50 video clips of 42 cattle, and employ data augmentation techniques to train and evaluate two deep learning models: a 3D CNN and a ConvLSTM2D model. The 3D CNN model achieves 90% video-level classification accuracy and 90.9% precision, recall, and F1 score, outperforming the ConvLSTM2D model (85% accuracy). Unlike existing methods that rely on multi-stage pipelines for object detection and pose estimation, this study demonstrates the effectiveness of a direct end-to-end video classification approach. This approach effectively extracts and learns spatiotemporal features from diverse video sources, enabling scalable and efficient detection of cattle gait abnormalities in real-world farm environments.

Takeaways, Limitations

Takeaways:
An effective end-to-end deep learning framework for detecting gait abnormalities in cattle is presented.
Achieving high accuracy (90%) through 3D CNN model.
Increased efficiency by omitting object detection and pose estimation preprocessing steps.
Facilitating further research by providing open datasets.
Limitations:
The dataset is relatively small (50 video clips).
Generalization performance validation is needed for cattle of different breeds, ages, and environmental conditions.
Real-time processing performance evaluation in actual farm environments is required.
The performance of the ConvLSTM2D model is lower than that of the 3D CNN model.
👍