Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Can ChatGPT support software verification?

Created by
  • Haebom

Author

Christian Jan{\ss}en, Cedric Richter, Heike Wehrheim

Outline

This paper investigates whether ChatGPT, a large-scale language model (LLM), can support formal software verification by generating loop invariants. We utilize ChatGPT to generate loop invariants for 106 C programs, and verify the validity and usability of the generated invariants using Frama-C and CPAchecker. Our results demonstrate that ChatGPT generates valid and useful invariants, enabling verification of tasks previously unsolved by Frama-C.

Takeaways, Limitations

Takeaways:
LLM, including ChatGPT, suggests the potential to contribute to formal software verification.
Loop invariants generated by ChatGPT can improve the performance of verification tools.
We propose an integration scheme between ChatGPT and software verification tools.
Limitations:
We performed limited experiments on 106 C programs.
The results of experiments using CPAchecker were not presented.
Discuss the current Limitations and further research topics.
👍