This is a page that curates AI-related papers published worldwide. All content here is summarized using Google Gemini and operated on a non-profit basis. Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.
Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups
Created by
Haebom
Author
Weiqiu You, Helen Qu, Marco Gatti, Bhuvnesh Jain, Eric Wong
Outline
This paper addresses the issue that self-attributing neural networks (SANNs), which are presented as a potential path for interpretable models for high-dimensional problems, often face significant trade-offs in terms of poor performance. This paper formally proves a lower bound on the error of feature-wise SANNs, while group-based SANNs can achieve zero error and thus high performance. Based on this insight, in this paper, we propose a Sum-of-Parts (SOP) framework that transforms any differentiable model into a group-based SANN that learns feature groups end-to-end without group supervision. SOP achieves state-of-the-art performance for SANNs on vision and language tasks, and verifies that the groups are interpretable for a variety of quantitative and semantic metrics. We also demonstrate the utility of SOP explanations in model debugging and cosmological scientific discovery. The code is available at https://github.com/BrachioLab/sop .