Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

MetaLLMix: An XAI Aided LLM-Meta-learning Based Approach for Hyper-parameters Optimization

Created by
  • Haebom

Author

Mohammed Tiouti, Mohamed Bal-Ghaoui

Outline

MetaLLMiX is a zero-shot hyperparameter optimization framework that combines meta-learning, explainable AI, and efficient LLM inference. To address the trial-and-error and high-cost API challenges of existing AutoML and LLM-based approaches, it leverages SHAP explanations to recommend optimal hyperparameters and pre-trained models based on past experimental results without additional trials. LLMs are used as judges to control output format, accuracy, and completeness. In experiments using eight medical image datasets and nine open-source lightweight LLMs, it achieves competitive or superior performance to existing HPO methods while significantly reducing computational costs. It outperforms existing API-based approaches, achieving optimal results on five out of eight tasks, reducing response times by 99.6-99.9%, and achieving the fastest training times (2.4-15.7x) on six datasets. Accuracy remains within 1-5% of the best-performing benchmark model.

Takeaways, Limitations

Takeaways:
Zero-shot hyperparameter optimization overcomes the trial-and-error and high costs of AutoML and LLM-based approaches.
We increased explainability by leveraging SHAP descriptions.
We achieved superior or competitive performance over existing HPO methods in a shorter time.
Local deployment has been shown to perform more efficiently than API-based approaches.
Limitations:
Generalizability validation is required for the eight proposed medical image datasets and nine open-source lightweight LLMs.
Further experiments with different types of datasets and models are needed.
Further research may be needed to interpret the SHAP description.
👍