Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of Plasticity

Created by
  • Haebom

Author

Zhiyuan Su, Sunhao Dai, Xiao Zhang

Outline

This paper proposes the Selective Reinitialization (SeRe) framework to address the "plasticity loss" problem of the Clustering of Neural Bandits (CNB) algorithm, a neural network-based extension of the clustering technique (CB) of the bandit algorithm. While CNB improves performance by clustering similar bandits, its fixed neural network parameters struggle to adapt to abnormal environments over time. SeRe mitigates plasticity loss and achieves stable knowledge retention by selectively reinitializing underutilized units using a contribution utility metric. Furthermore, it ensures effective adaptation without unnecessary reinitialization through an adaptive change detection mechanism that adjusts the reinitialization frequency based on the degree of abnormality. Theoretically, SeRe achieves sublinear cumulative regret in interval-normal environments. Experiments on six real-world recommendation datasets demonstrate lower regret, improved adaptability, and robustness compared to the existing CNB algorithm.

Takeaways, Limitations

Takeaways:
We present the SeRe framework to effectively address the plasticity loss problem of the CNB algorithm.
Achieving both stable knowledge retention and improved adaptability through selective reinitialization using contribution utility metrics.
Efficient adaptation to abnormal environments through adaptive change detection mechanisms.
Verification of the superiority of SeRe through theoretical analysis and experimental results.
Contributes to improving the performance of bandit algorithms in dynamic environments such as actual recommendation systems.
Limitations:
Further research is needed on defining and optimizing contribution utility metrics.
There is a need to evaluate the generalization performance of SeRe for various types of abnormalities.
The scalability and computational cost of SeRe for high-dimensional data needs to be analyzed.
Further analysis is needed to determine the impact of experimental dataset features on SeRe's performance.
👍