Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization

Created by
  • Haebom

Author

Emiliano Penaloza, Tianyue H. Zhang, Laurent Charlin, Mateo Espinosa Zarlenga

Outline

Concept Bottleneck Models (CBMs) are proposed to increase the reliability of AI systems by restricting their decisions to a set of concepts that humans can understand. However, CBMs typically assume that datasets contain accurate concept labels, a practice often violated, leading to significant performance degradation (up to 25% in some cases). In this paper, we propose a novel loss function, the Concept Preference Optimization (CPO) objective, that effectively mitigates the negative impact of concept mislabeling. We analyze the key characteristics of the CPO objective and demonstrate that it directly optimizes the posterior distribution of concepts. Compared to Binary Cross Entropy (BCE), we demonstrate that CPO is inherently less sensitive to concept noise. Experimentally, we demonstrate that CPO consistently outperforms BCE on three real-world datasets, both with and without additional label noise. The code is available on GitHub.

Takeaways, Limitations

Takeaways: To improve the reliability of concept bottleneck models, we propose CPO, a new loss function that addresses concept misrepresentation problems, and experimentally validate its effectiveness. CPO demonstrates lower sensitivity to concept noise and performs better on real-world datasets.
Limitations: The type and scope of the dataset used in this study may be limited. Further research on diverse dataset types and concepts is needed. Furthermore, further verification is needed to determine whether CPO's performance improvements are consistent across all situations.
👍