Generalist Software Agents to Solve Soware Engineering Tasks
-
Updated
Dec 10, 2024 - Python
Generalist Software Agents to Solve Soware Engineering Tasks
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
Gemini CLI for repo-level operations
Add a description, image, and links to the repo-level topic page so that developers can more easily learn about it.
To associate your repository with the repo-level topic, visit your repo's landing page and select "manage topics."