[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering

Created by
  • Haebom

Author

Hao Li, Haoxiang Zhang, Ahmed E. Hassan

Outline

This paper introduces AIDev, a large-scale dataset that captures the real-world operation of AI-based autonomous coding agents. It contains over 456,000 pull requests (PRs) generated by 47,000 developers across 61,000 repositories by five major agents: OpenAI Codex, Devin, GitHub Copilot, Cursor, and Claude Code. Unlike previous theoretical studies, AIDev provides structured open data for benchmarking, readiness, optimization, collaborative modeling, and AI governance research of autonomous agents. The dataset contains rich metadata about PRs, authors, review periods, code changes, and integration results, and provides insights such as the tendency for agents to be fast but have low PR adoption rates and low code complexity. AIDev is expected to contribute to the study of AI-based software engineering workflows and the advancement of human-AI collaboration as a scalable, analyzable, and vivid resource for the SE and AI communities.

Takeaways, Limitations

Takeaways:
Provides operational data from real-world AI-based autonomous coding agents, providing a solid foundation for AI-native software engineering research.
We analyze the correlation between agent speed, PR adoption rate, and code complexity to reveal the strengths and weaknesses of AI agents.
It builds a new model of human-AI collaboration and provides important data for AI governance research.
It enables real-data-based research that goes beyond existing synthetic benchmarks.
Limitations:
The type and number of agents included in the dataset may be limited.
The scope of the dataset may be biased towards a specific platform or development method.
Continuous updating of data is needed to capture long-term trends and changes.
Consideration must be given to data bias and ethical issues.
👍