Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Vision Backbone Efficient Selection for Image Classification in Low-Data Regimes

Created by
  • Haebom

Author

Joris Guerin, Shray Bansal, Amirreza Shaban, Paulo Mann, Harshvardhan Gazula

Outline

This paper highlights the importance of transfer learning, which leverages the pre-trained weights of backbones, for training successful models using limited annotated data. Selecting an appropriate backbone is crucial, especially on small datasets, where final performance heavily relies on the quality of the initial feature representation. While previous research has benchmarked backbones across diverse datasets to identify universally best-performing backbones, this paper demonstrates that the effectiveness of backbones is highly dataset-dependent, particularly in low-data scenarios where no single backbone consistently performs well. To overcome these limitations, this paper proposes a new research direction for dataset-specific backbone selection and investigates its practicality in low-data environments. Because a full evaluation of a large pool of backbones is computationally impractical, this paper formulates Vision Backbone Efficient Selection (VIBES), a problem for searching for high-performance backbones under computational constraints. We define a solution space, propose several heuristics, and conduct experiments on four diverse datasets to demonstrate the feasibility of VIBES for low-data image classification. Our experiments demonstrate that even a simple search strategy can find a suitable backbone from a pool of over 1,300 pre-trained models, outperforming common benchmark recommendations with a search time of less than 10 minutes on a single GPU (NVIDIA RTX A5000).

Takeaways, Limitations

Takeaways: Demonstrates the importance and practicality of dataset-specific backbone selection in low-data image classification problems. We present an efficient backbone search method called VIBES, which finds the optimal backbone while reducing computational costs. Experimentally, we demonstrate that even a simple search strategy can achieve significant performance improvements.
Limitations: The dataset used in this paper may be limited in diversity. Further experiments on diverse datasets and tasks are needed. Further research is needed to evaluate the generalization performance of the proposed heuristic. Scalability evaluation on more complex and larger backbone pools is also needed.
👍