Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Explanations are a meaning to an end

Created by
  • Haebom

Author

Jessica Hullman, Ziyang Guo, Berk Ustun

Outline

This paper points out that existing explainable machine learning (XAI) methodologies focus only on explaining the process of mapping inputs and outputs of models, and lack consideration of how they are actually used. Explanations should be designed and evaluated with specific purposes, and we present a method to formalize these purposes through a framework based on statistical decision theory. We demonstrate how this feature-oriented approach can be applied to a variety of use cases, such as clinical decision support, providing remedies, or debugging, and we use it to characterize the maximum performance gain that an ideal decision maker can obtain on a specific task, thereby avoiding misuse due to ambiguity. Researchers should specify specific use cases and analyze them considering the expected usage models of explanations. Finally, we present an evaluation method that integrates theoretical and empirical perspectives on the value of explanations, and a definition that encompasses these perspectives.

Takeaways, Limitations

Takeaways:
We highlight the importance of a goal-driven approach in evaluating and designing explainable machine learning (XAI).
We present a feature-centric XAI framework based on statistical decision theory.
Provides a practical XAI methodology applicable to a wide range of use cases.
We provide suggestions on how to avoid misuse of descriptions and maximize performance gains.
We present XAI evaluation criteria that integrate theoretical and empirical perspectives.
Limitations:
Further research is needed to determine the practical applicability of the proposed framework.
Generalizability across a variety of use cases needs to be verified.
Further research is needed to determine the objectivity and reliability of the proposed evaluation criteria.
👍