Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

VectorFit: Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models

Created by
  • Haebom

Author

Suhas G Hegde, Shilpy Kaur, Aruna Tiwari

Outline

This paper highlights the problem that existing parameter-efficient fine-tuning (PEFT) methods learn new low-rank or sparse weights in parallel with pre-trained weights ($W$), but learn these weights from scratch, resulting in a performance gap. To address this, we propose a novel parameterization method, VectorFit. VectorFit efficiently leverages existing knowledge inherent in $W$ to adaptively learn singular vectors and biases, thereby generating a high-rank incremental weight matrix $\Delta W$, similar to full fine-tuning. Through experiments on 19 diverse language and vision tasks (including natural language understanding and generation, question answering, image classification, and image generation), we demonstrate that VectorFit achieves superior performance compared to existing PEFT methods with nine times fewer learnable parameters.

Takeaways, Limitations

Takeaways:
We present VectorFit, a new parameterization method that overcomes the performance limitations of existing PEFT methods.
We significantly improve parameter efficiency by leveraging the structural and transformational properties of pre-trained weights.
It outperforms existing methods in a variety of language and vision tasks.
It can maintain high performance even in resource-constrained environments.
Limitations:
The experimental results presented in this paper may be limited to specific datasets and tasks. More diverse and extensive experiments are needed.
Further research is needed to determine whether the performance improvements of VectorFit generalize to all types of models and datasets.
There is a lack of detailed analysis of the computational complexity and memory requirements of VectorFit.
👍