Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection

Created by
  • Haebom

Author

Yixian Shen, Qi Bi, Jia-Hong Huang, Hongyi Zhu, Andy D. Pimentel, Anuj Pathania

Outline

MaCP (Minimal yet Mighty Adaptive Cosine Projection) is a novel adaptive method for fine-tuning large-scale base models. It achieves superior performance while using fewer parameters and memory compared to existing methods. Its core idea is to leverage the superior energy compression and de-correlation properties of cosine projection to improve model efficiency and accuracy. It projects the weight changes from low-dimensional adaptation into the discrete cosine space, partitions the weight changes across multiple levels of the discrete cosine spectrum, and selects the most significant frequency components from each partition. It demonstrates effectiveness across a wide range of tasks, including unimodal tasks such as natural language understanding, natural language generation, and text summarization, as well as multimodal tasks such as image classification and video understanding. It offers higher accuracy and significantly lower computational complexity and memory requirements than existing methods.

Takeaways, Limitations

Takeaways:
Presenting the possibility of efficient fine-tuning of large-scale basic models with few parameters and memory.
A novel method is presented that simultaneously improves the accuracy and efficiency of models by utilizing cosine projection.
Excellent performance in both single-mode and multi-mode operations.
Reduced computational complexity and memory requirements compared to existing methods.
Limitations:
The paper does not explicitly mention the specific Limitations. Experimental results for various tasks are presented, but further research is needed to determine whether the method is vulnerable to specific tasks or datasets.
Further validation of MaCP's generalization performance may be needed.
👍