Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Data Dependency Inference for Industrial Code Generation Based on UML Sequence Diagrams

Created by
  • Haebom

Author

Wenxin Mao, Zhitao Wang, Long Wang, Sirong Chen, Cuiyun Gao, Luyang Cao, Ziming Liu, Qiming Zhang, Jun Zhou, Zhi Jin

Outline

UML2Dep is a step-by-step code generation framework that overcomes the ambiguity of natural language descriptions and satisfies complex system requirements. It uses extended UML sequence diagrams to clearly formalize the complex requirements of service-oriented architectures. These diagrams eliminate linguistic ambiguity by explicitly formalizing the structural relationships and business logic flow of service interactions by integrating decision tables and API specifications. Furthermore, recognizing the importance of data flow, it introduces a dedicated Data Dependency Inference (DDI) task. DDI systematically constructs an explicit data dependency graph prior to code synthesis and is formalized as a constrained mathematical inference task using a novel prompting strategy, leveraging the mathematical strengths of LLMs. Additional static parsing and dependency pruning reduce the contextual complexity and cognitive load associated with complex specifications, thereby improving inference accuracy and efficiency.

Takeaways, Limitations

Takeaways:
Solving the difficulty of code generation due to the ambiguity of natural language through a formal specification based on UML.
Effectively solving data dependency issues in service-oriented architectures through DDI.
Leveraging the mathematical capabilities of LLMs to improve the reliability of data dependency inference.
Improving the inference accuracy and efficiency of LLMs through static parsing and dependency pruning.
Limitations:
Further verification of the scalability and practical applicability of the proposed UML sequence diagram is needed.
The accuracy and efficiency of DDI can be affected by prompting strategies and data characteristics.
Applicability and performance evaluation for complex systems is required.
Research is needed to determine the generalizability of the proposed methodology and its applicability to other architectures.
👍