- - ai-newsbits

#AI #LLM

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

•

언어 모델(LM)이 재학습 또는 GPU를 이용한 작업 없이도 동종 모델의 파라미터를 흡수하여 새로운 기능을 획득할 수 있음

🔗 https://news.ycombinator.com/item?id=39952826

🔗 https://arxiv.org/abs/2311.03099

Language Models are Super Mario: Absorbing Abilities from...

In this paper, we unveil that Language Models (LMs) can acquire new capabilities by assimilating parameters from homologous models without retraining or GPUs. We first introduce DARE to set most...

Language models are Super Mario: Absorbing abilities from homologous models | Hacker News

news.ycombinator.com

Made with Slashpage