Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Towards Secure and Explainable Smart Contract Generation with Security-Aware Group Relative Policy Optimization

Created by
  • Haebom

Author

Lei Yu, Jingyuan Zhang, Xin Wang, Jiajia Ma, Li Yang, and Fengjun Zhang

Outline

This paper proposes a novel framework, SmartCoder-R1, based on Qwen2.5-Coder-7B, to address vulnerabilities arising during smart contract generation. SmartCoder-R1 is trained to generate secure and explainable smart contracts through a series of steps: Continual Pretraining (CPT), Long Chain-of-Thought Supervised Fine-Tuning (L-CoT SFT), and Security-Aware Group Relative Policy Optimization (S-GRPO). It outperforms 17 existing models on 756 real-world function benchmarks, and the quality of the generated inferences is also highly evaluated.

Takeaways, Limitations

Takeaways:
A new framework is presented to address security vulnerabilities in smart contract creation.
Utilizing Continual Pre-training (CPT) for model specialization.
Applying Long Chain-of-Thought Supervised Fine-Tuning (L-CoT SFT) to train to mimic human security analysis.
Introducing Security-Aware Group Relative Policy Optimization (S-GRPO) to mitigate vulnerabilities.
Demonstrated superior performance compared to existing models.
High quality evaluation of generated inferences.
Limitations:
The specific Limitations is not specified in the paper.
👍