ai-newsbits
Sign In

Untitled

#AI #LLM

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

•
언어 모델(LM)이 재학습 또는 GPU를 이용한 작업 없이도 동종 모델의 파라미터를 흡수하여 새로운 기능을 획득할 수 있음
🔗 https://news.ycombinator.com/item?id=39952826
🔗 https://arxiv.org/abs/2311.03099
Language Models are Super Mario: Absorbing Abilities from...
In this paper, we unveil that Language Models (LMs) can acquire new capabilities by assimilating parameters from homologous models without retraining or GPUs. We first introduce DARE to set most...
arxiv.org
Language models are Super Mario: Absorbing abilities from homologous models | Hacker News
news.ycombinator.com
👍
Made with Slashpage