Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Auto-Formulating Dynamic Programming Problems with Large Language Models

Created by
  • Haebom

Author

Chenyu Zhou, Jingyuan Yang, Linwei Xin, Yitian Chen, Ziyan He, Dongdong Ge

Outline

This paper presents a novel methodology based on large-scale language models (LLMs) for automating dynamic programming (DP) modeling. Traditional DP modeling requires expert knowledge, but LLMs have the potential to automate this process. However, the probabilistic nature of DP problems and limited training data make direct application of traditional LLM-based models difficult. Therefore, in this paper, we introduce DP-Bench, a benchmark that covers a variety of DP problems, and present DPLM, a specialized model with 7 billion parameters. DPLM extends the training data from limited initial examples by utilizing DualReflect, a synthetic data generation pipeline. DualReflect combines forward generation for diversity and backward generation for reliability, and shows that backward generation is more effective in low-data environments and forward generation is more effective in large-data environments. DPLM achieves performance comparable to state-of-the-art LLMs such as OpenAI's o1 and DeepSeek-R1, and outperforms them on difficult problems.

Takeaways, Limitations

Takeaways:
Presenting the possibility of automating dynamic programming modeling using LLM.
Proposing an effective synthetic data generation technique (DualReflect) in a limited data environment.
Revealing the complementary strengths of forward and reverse data generation.
Achieving state-of-the-art performance with DPLM, a specialized LLM with 7 billion parameters.
Limitations:
The scope of the DP-Bench benchmark is limited to textbook-level problems.
The generalization performance of DPLM to real-world problems requires further study.
The effectiveness of DualReflect may vary depending on the nature of the problem.
Resource consumption problem of 7 billion parameter model.
👍