Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups

Created by
  • Haebom

Author

Weiqiu You, Helen Qu, Marco Gatti, Bhuvnesh Jain, Eric Wong

Outline

This paper addresses the issue that self-attributing neural networks (SANNs), which are presented as a potential path for interpretable models for high-dimensional problems, often face significant trade-offs in terms of poor performance. This paper formally proves a lower bound on the error of feature-wise SANNs, while group-based SANNs can achieve zero error and thus high performance. Based on this insight, in this paper, we propose a Sum-of-Parts (SOP) framework that transforms any differentiable model into a group-based SANN that learns feature groups end-to-end without group supervision. SOP achieves state-of-the-art performance for SANNs on vision and language tasks, and verifies that the groups are interpretable for a variety of quantitative and semantic metrics. We also demonstrate the utility of SOP explanations in model debugging and cosmological scientific discovery. The code is available at https://github.com/BrachioLab/sop .

Takeaways, Limitations

Takeaways:
We mathematically prove that group-based SANN can achieve higher performance than feature-wise SANN.
We present a SOP framework that can transform any differentiable model into a group-based SANN.
SANN achieves state-of-the-art performance on vision and language tasks.
Validation of the interpretability of the learned group using quantitative and semantic indicators.
Validation of the usefulness of SOP descriptions for model debugging and scientific discovery (cosmology).
Limitations:
Further research is needed on the general applicability of the SOP framework.
Further research is needed to determine optimal group size or group composition for specific problems.
Consideration should be given to the limitations of quantitative and semantic indicators used to assess interpretability.
👍