Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

The FIX Benchmark: Extracting Features Interpretable to eXperts

Created by
  • Haebom

Author

Helen Jin, Shreya Havaldar, Chaehyeon Kim, Anton Xue, Weiqiu You, Helen Qu, Marco Gatti, Daniel A Hashimoto, Bhuvnesh Jain, Amin Madani, Masao Sako, Lyle Ungar, Eric Wong

Outline

This paper points out that feature-based methods commonly used to explain model predictions in high-dimensional data implicitly assume the availability of interpretable features. However, in high-dimensional data, it is often difficult even for experts to mathematically specify important features. To address this problem, this paper presents FIX (Features Interpretable to eXperts), a benchmark that measures how well features align with expert knowledge. In collaboration with experts from diverse fields such as cosmology, psychology, and medicine, we propose FIXScore, a unified expert alignment measure applicable to a variety of real-world settings, including vision, language, and time-series data modalities. Using FIXScore, we find that popular feature-based explanation methods lack alignment with expert-specified knowledge, highlighting the need for new methods to better identify features that are interpretable to experts.

Takeaways, Limitations

Takeaways: We clearly present the limitations of feature-based explanation methods in high-dimensional data, and propose a new benchmark, FIX, and a measurement index, FIXScore, to measure the alignment with expert knowledge, thereby suggesting future research directions. We emphasize the need to develop new interpretable feature extraction methods that take expert knowledge into account.
Limitations: The number and diversity of experts involved in the development of FIXScore may be insufficient. Further validation is needed to determine whether FIXScore can be generalized to all types of expert knowledge and data modalities. Further research is needed to determine how much the currently presented method can actually contribute to the development of new interpretable feature extraction methods.
👍