Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Distilling Calibration via Conformalized Credal Inference

Created by
  • Haebom

Author

Jiayi Huang, Sangwoo Park, Nicola Paoletti, Osvaldo Simeone

Conformalized Distillation for Credal Inference (CD-CI)

Outline

This paper presents a novel methodology that reduces complexity while ensuring reliable performance to address the challenges of deploying artificial intelligence (AI) models on edge devices with limited memory and energy resources. Specifically, to address the challenges of uncertainty quantification through Bayesian inference, we propose a method that leverages calibration information obtained from high-complexity cloud-based models. This method establishes a threshold based on the typical degree of divergence between cloud and edge models in an offline phase and uses this threshold at runtime to construct a Credal Set, representing the range of predicted probabilities. This set is guaranteed to include cloud model predictions at a user-specified confidence level. Experiments on visual and language tasks demonstrate that this method significantly improves calibration performance compared to low-complexity Bayesian methods such as the Laplace approximation.

Takeaways, Limitations

Takeaways:
Presenting a practical and efficient solution to improve the calibration performance of low-complexity models when deploying edge AI models.
Leveraging information from high-complexity models enables reliable uncertainty estimation even in low-complexity environments.
The CD-CI methodology outperforms existing methods in visual and language tasks.
Limitations:
Limitations stated in the paper is not specifically presented.
In general, it is highly dependent on the performance of the cloud model, so if the performance of the cloud model deteriorates, the performance of the edge model may also be affected.
👍