Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

RGB-Event based Pedestrian Attribute Recognition: A Benchmark Dataset and An Asymmetric RWKV Fusion Framework

Created by
  • Haebom

Author

Xiao Wang, Haiyang Wang, Shiao Wang, Qiang Chen, Jiandong Jin, Haoyu Song, Bo Jiang, Chenglong Li

Outline

Existing pedestrian attribute recognition methods have been developed based on RGB cameras, but they are vulnerable to lighting conditions and motion blur and have limitations in considering emotional aspects. This paper proposes a multimodal RGB-event pedestrian attribute recognition task utilizing an event camera, which boasts low-light performance, high speed, and low power consumption. We release EventPAR, a large-scale multimodal pedestrian attribute recognition dataset containing 100K RGB-event samples and covering 50 attributes related to appearance and six emotions. We retrain and evaluate existing PAR models to establish a benchmark, and propose a multimodal pedestrian attribute recognition framework based on RWKV. State-of-the-art results are achieved through experiments on the proposed dataset, MARS-Attribute, and DukeMTMC-VID-Attribute simulation datasets. The source code and dataset will be made available on GitHub.

Takeaways, Limitations

Takeaways:
Presenting new possibilities for multi-modal pedestrian attribute recognition using event cameras.
Release and Benchmark of EventPAR, a Large-Scale Multimodal Pedestrian Attribute Recognition Dataset
Proposal and Performance Verification of a Novel Multimodal Pedestrian Attribute Recognition Framework Based on RWKV
Laying the foundation for research on pedestrian attribute recognition that considers not only appearance but also emotions.
Limitations:
Further validation of the diversity and generalization performance of the EventPAR dataset is needed.
The proposed RWKV-based framework requires enhanced comparative analysis with other models.
Performance evaluation and robustness analysis in real-world environments are needed.
👍