Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection

Created by
  • Haebom

Author

Yixian Shen, Qi Bi, Jia-Hong Huang, Hongyi Zhu, Andy D. Pimentel, Anuj Pathania

Outline

This paper presents MaCP (Minimal yet Mighty Adaptive Cosine Projection), a novel adaptive method for fine-tuning large-scale base models. MaCP aims to achieve superior performance while using minimal parameters and memory. It is based on the idea of leveraging the superior energy compression and de-correlation properties of cosine projection to improve both model efficiency and accuracy. Specifically, we project the weight changes from low-dimensional adaptation into the discrete cosine space, partition the weight changes across different levels of the discrete cosine spectrum, and then select the most significant frequency components from each partition. Through experiments on a wide range of unimodal tasks (e.g., natural language understanding, natural language generation, text summarization) and multimodal tasks (e.g., image classification, video understanding), we demonstrate that MaCP consistently delivers superior accuracy, significantly reduced computational complexity, and lower memory requirements compared to existing alternatives.

Takeaways, Limitations

Takeaways:
Presenting the possibility of efficient fine-tuning of large-scale base models with minimal parameters and memory.
Improving model efficiency and accuracy simultaneously by leveraging cosine projection.
Proven performance in a variety of single and multi-mode tasks
Improved accuracy, reduced computational complexity, and memory requirements compared to existing methods.
Limitations:
The paper lacks specific references to Limitations or limitations.
Further research is needed on generalization performance for specific tasks or datasets.
A more in-depth analysis of the optimization possibilities and limitations of MaCP's cosine projection method is needed.
👍