Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Direct Video-Based Spatiotemporal Deep Learning for Cattle Lameness Detection

Created by
  • Haebom

Author

Md Fahimuzzman Sohan, Raid Alzubi, Hadeel Alzoubi, Eid Albalawi, AH Abdul Hafez

Outline

This paper proposes a spatiotemporal deep learning framework for automatic detection of lameness in cattle. Using 50 publicly available video clips (42 cattle), we train and evaluate two deep learning architectures: a 3D CNN and a ConvLSTM2D. Data augmentation techniques are applied to improve generalization performance, and the 3D CNN achieves 90% accuracy (90.9% precision, recall, and F1 score, respectively), outperforming ConvLSTM2D (85% accuracy). Unlike existing multi-stage pipelined approaches that incorporate object detection and pose estimation, this study demonstrates the effectiveness of a direct end-to-end video classification approach. By successfully extracting and learning spatiotemporal features from diverse video sources, we enable scalable and efficient cattle lameness detection in real-world farm environments.

Takeaways, Limitations

Takeaways:
An Effective Spatiotemporal Deep Learning Framework for Automatic Crippling Detection in Cattle
Excellent performance validation of an end-to-end video classification approach using 3D CNN (90% accuracy)
Improved efficiency compared to existing multi-stage pipeline methods and omission of the posture estimation preprocessing step.
Supporting follow-up research by providing open datasets
Limitations:
The dataset used was relatively small (50 video clips, 42 cows).
May have limited variety in breeds, ages, types of lameness, etc.
There is a possibility that it may not fully reflect the complexity of the actual farm environment.
The performance of the ConvLSTM2D model is relatively low compared to the 3D CNN model.
👍