Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Probabilistic Stability Guarantees for Feature Attributions

Created by
  • Haebom

Author

Helen Jin, Anton Xue, Weiqiu You, Surbhi Goel, Eric Wong

Outline

This paper presents stability guarantees as a principled method for evaluating feature attribution. It points out the limitations of existing authentication methods, which use overly smoothed classifiers and provide only conservative guarantees. To address this, we introduce "soft stability" and propose a simple, model-independent, and sample-efficient stability authentication algorithm (SCA) that provides non-obvious and interpretable guarantees for all attribution methods. Furthermore, we demonstrate that mild smoothing offers a better trade-off between accuracy and stability, avoiding the excessive trade-offs inherent in existing authentication methods. To account for this phenomenon, we derive novel features for stability under smoothing using Boolean functional analysis. We demonstrate the effectiveness of soft stability in measuring the robustness of explanation methods by evaluating SCA on vision and language tasks.

Takeaways, Limitations

Takeaways:
By introducing the concept of soft stability, we overcome the limitations of existing stability guarantees and enable more practical stability certification.
We propose a model-independent and sample-efficient stability authentication algorithm (SCA) to enhance its applicability to various attribution methods.
We present a method to find the optimal balance between accuracy and stability through mild smoothing.
Boolean function analysis provides new insights into stability under smoothing.
Validating the effectiveness of SCA through vision and language tasks.
Limitations:
Further analysis of the computational complexity and scalability of the proposed SCA algorithm is needed.
Further research may be needed to define and measure soft stability.
More extensive experiments with different types of models and datasets are needed.
👍