[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Measuring Leakage in Concept-Based Methods: An Information Theoretic Approach

Created by
  • Haebom

Author

Mikael Makonnen, Moritz Vandenhirtz, Sonia Laguna, Julia E Vogt

Outline

Concept Bottleneck Models (CBMs) aim to enhance interpretability by structuring predictions around concepts that humans can understand. However, unintended information leakage, where prediction signals bypass concept bottlenecks, hinders transparency. In this paper, we present an information-theoretic measure that quantifies information leakage in CBMs, identifying the extent to which concept embeddings encode additional unintended information beyond the given concepts. We validate the measure through controlled synthetic experiments and demonstrate its effectiveness in detecting leakage trends in a variety of configurations. We highlight that feature and concept dimensionality significantly influence leakage, and that classifier choice influences measurement stability (XGBoost emerges as the most stable estimator). Furthermore, our initial investigations show that the measure exhibits the expected behavior when applied to soft-joint CBMs, suggesting the reliability of leakage quantification beyond fully synthetic environments. While this study rigorously evaluates the measure in controlled synthetic experiments, future work could extend its application to real-world datasets.

Takeaways, Limitations

Takeaways: We present a new information-theoretic measure to quantify information leakage, and validate it on synthetic datasets. We reveal the effects of feature and concept dimensions, and classifier selection on information leakage. We also confirm its applicability to soft-joint CBM.
Limitations: The study is limited to controlled synthetic experiments, and application to real-world datasets remains a task for future research.
👍