Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

AI Agentic Vulnerability Injection And Transformation with Optimized Reasoning

Created by
  • Haebom

Author

Amine Lbath, Massih-Reza Amini, Aurelien Delaitre, Vadim Okun

Outline

This paper emphasizes the importance of automated vulnerability detection and repair systems and points out the limitations of existing static program analysis methods. To overcome the scalability and adaptability issues of existing methods, as well as their high false positive and false negative rates, we propose an AI approach based on machine learning and deep learning. However, AI-based approaches suffer from a significant dependence on the quality and quantity of training data. Therefore, we present a novel framework for generating datasets by automatically introducing realistic, categorical vulnerabilities into secure C/C++ codebases. We coordinate multiple AI agents, functional agents, and existing code analysis tools that simulate expert reasoning. We leverage Retrieval-Augmented Generation to establish a contextual foundation, and perform efficient model fine-tuning using low-rank approximation. Experimental results on 116 code samples across three benchmarks demonstrate that our proposed approach successfully injects vulnerabilities at the function level with a success rate of 89% to 95%, outperforming competing techniques in terms of dataset accuracy.

Takeaways, Limitations

Takeaways:
Presenting an effective framework for automatically injecting realistic vulnerabilities into secure C/C++ codebases.
Attempting to overcome the limitations of existing methods through an AI agent-based approach.
Efficient model learning using retrieval-augmented generation and low-rank approximation.
Successful vulnerability injection with high accuracy (89% to 95%)
Limitations:
Using experimental data of limited scale (116 code samples)
Further research is needed to determine generalizability across different types of C/C++ code and vulnerabilities.
Need to verify applicability to complex software systems in the real world
👍