Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Overcoming label shift with target-aware federated learning

Created by
  • Haebom

Author

Edvin Listo Zec, Adam Breitholtz, Fredrik D. Johansson

Outline

This paper proposes a novel model aggregation method, FedPALS, to address the performance degradation caused by label shift between client and target domains in federated learning. FedPALS leverages label distribution information from a central server to adjust model aggregation based on the target domain, achieving robust generalization across diverse client data with label shift. FedPALS guarantees distortion-free updates under federated stochastic gradient descent (SGD), and extensive experiments on image classification tasks demonstrate its superior performance over existing methods. Specifically, we demonstrate that existing federated learning methods suffer severe performance degradation when client labels are severely insufficient, highlighting the importance of target domain-aware aggregation proposed by FedPALS.

Takeaways, Limitations

Takeaways:
Empirically demonstrating the severity of the label shift problem in federated learning.
Proposal of FedPALS, an effective model aggregation method for solving the label shift problem.
Improved target domain adaptability by utilizing label distribution information from the central server.
Emphasizes the need to consider label sparsity in client data.
A method to overcome the limitations of existing federated learning methods and improve their performance is presented.
Limitations:
Further validation of the proposed method's generalization performance is needed (experiments on various datasets and scenarios are required).
Dependence on the accuracy of label distribution information from the central server (requires analysis of the impact of erroneous label distribution information).
Analysis of computational cost and communication overhead is required.
Further research is needed on the applicability and practicality in real-world application environments.
👍