[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

How to Leverage Predictive Uncertainty Estimates for Reducing Catastrophic Forgetting in Online Continual Learning

Created by
  • Haebom

Author

Giuseppe Serra, Ben Werner, Florian Buettner

Outline

This paper addresses the problem of real-world machine learning applications that deal with abnormal data distributions and require long-term unsupervised learning. In particular, we focus on the catastrophic forgetting (CF) problem that occurs in online learning environments. CF is a phenomenon in which a model focuses on recent tasks and its prediction performance on previous tasks deteriorates. Existing solutions use a fixed-size memory buffer to store previous samples and reuse them when learning new tasks, but there is a lack of clear guidance on how to effectively utilize prediction uncertainty information in memory management and on the strategy for filling the memory. Based on the intuition that prediction uncertainty represents the location of a sample in the decision space, this paper deeply analyzes various uncertainty estimation and memory filling strategies. We understand the characteristics of data points that are effective in alleviating CF, propose a prediction uncertainty estimation method using the generalized variance induced by the negative log-likelihood, and experimentally demonstrate the effectiveness of prediction uncertainty measures in reducing CF in various environments.

Takeaways, Limitations

Takeaways:
Presentation of effective design and utilization method of memory management strategy using prediction uncertainty information
Provides a deeper understanding of data point features for CF mitigation
Proposing a new prediction uncertainty estimation method based on generalized variance
Verification of CF reduction effect through prediction uncertainty measurement in various experimental environments
Limitations:
Additional validation of the generalization performance of the proposed method is needed.
Need to evaluate versatility for different types of data and tasks
Further research is needed on optimizing memory buffer size.
Possibility of analysis being biased towards estimating certain types of uncertainty
👍