Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Lemmanaid: Neuro-Symbolic Lemma Conjecturing

Created by
  • Haebom

Author

Yousef Alhessi, S olr un Halla Einarsd ottir, George Granberry, Emily First, Moa Johansson, Sorin Lerner, Nicholas Smallbone

Outline

In this paper, we present a novel and interesting method for automatically guessing lemmas, in order to improve the performance of automatic lemmas and lower the entry barrier for formalizing mathematics in proof aids. To overcome the limitations of existing neural networks and symbolic approaches, we propose a novel neural symbolic lemma guesser, Lemmanaid, which combines large-scale language models (LLMs) and symbolic methods. Lemmanaid generates templates describing the form of lemmas using LLMs and fills in the details using symbolic methods. Using the proof library of the Isabelle proof aid, we compare Lemmanaid with LLMs trained to generate complete lemma sentences and existing fully symbolic guessing methods. As a result, Lemmanaid outperforms existing methods on the test set of Isabelle’s HOL library and formal proof archive, and discovers 29-39.5% of the gold standard lemmas written by humans, which is 8-15% more lemmas than pure neural network methods. This study promotes computer-aided theory development and formalization by combining the strengths of neural networks and symbolic methods to generate useful auxiliary theorems for a variety of input domains.

Takeaways, Limitations

Takeaways:
We show that a neurosymbolic approach combining LLM and symbolic methods can generate more useful auxiliary theorems than conventional methods.
To present a practical tool that can contribute to the development and formalization of computer-aided theory.
We present a method for generating auxiliary theorems that works effectively across a variety of input domains.
Limitations:
Not all auxiliary theorems written by humans have been discovered yet (29-39.5% discovered).
Further research is needed to improve the performance of Lemmanaid.
The evaluation results are limited to a specific proof assistant tool (Isabelle). Further research is needed on the generalizability to other tools and mathematical fields.
👍