Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

HNote: Extending YNote with Hexadecimal Encoding for Fine-Tuning LLMs in Music Modeling

Created by
  • Haebom

Author

Hung-Ying Chu, Shao-Yu Wei, Guan-Wei Chen, Tzu-Wei Hung, ChengYang Tsai, Yu-Cheng Lin

Outline

To open up new opportunities for symbolic music generation using large-scale language models (LLMs), we propose HNote to address the complexity and structural consistency of existing formats. HNote is a hexadecimal-based notation that extends YNote and encodes pitch and duration within a fixed 32-unit measurement framework, providing direct compatibility with the LLM architecture. We converted 12,300 Gangnam Style folk songs into HNote and fine-tuned LLaMA-3.1(8B) using LoRA. Experimental results show that HNote achieves 82.5% syntactic accuracy and generates stylistically coherent compositions, demonstrating strong symbolic and structural similarity in BLEU and ROUGE evaluations.

Takeaways, Limitations

Takeaways:
Presenting an effective framework for cultural music modeling using LLM.
HNote demonstrates that the LLM can be successfully applied to cultural music composition.
HNote's performance excellence is proven through high syntactic accuracy and BLEU and ROUGE scores.
Limitations:
The specific Limitations is not specified in the paper.
👍