[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

SemiOccam: A Robust Semi-Supervised Image Recognition Network Using Sparse Labels

Created by
  • Haebom

Author

Rui Yann, Tianshuo Zhang, Xianglei Xing

Outline

SemiOccam is an efficient semi-supervised learning-based image recognition network using limited label data. To overcome the limitations of the complex structure and training process of existing methods, we build a hierarchical mixed density classification mechanism that optimizes the mutual information between feature representations and target classes, thereby removing unnecessary information and retaining important discriminant elements. Experimental results show that it achieves state-of-the-art performance on three common datasets, and achieves over 95% accuracy on two datasets using only four labeled samples per class. The simple structure allows the training time to be shortened to a few minutes. In addition, we uncovered the data leakage problem of the STL-10 dataset, which was overlooked in previous studies, and removed redundant data to obtain reliable experimental results, and released the refined CleanSTL-10 dataset.

Takeaways, Limitations

Takeaways:
We present an efficient semi-supervised learning method using limited label data.
Much faster training time (on the order of minutes) compared to existing methods.
Achieves state-of-the-art performance on three datasets (>95% accuracy on two datasets).
Discover and resolve data leakage issues in the STL-10 dataset, and improve research reproducibility by releasing the CleanSTL-10 dataset.
Limitations:
Further research is needed on the generalization performance of the proposed method.
Additional experiments on different datasets and tasks are needed.
Further comparative performance analysis on datasets other than the CleanSTL-10 dataset may be needed.
👍