Google's AI Agent 'Passerine' Fixes 73% of Machine-Reported Bugs, Outperforming on Human-Reported Issues in Enterprise Test

Evaluating Agent-based Program Repair at Google

View PDF HTML (experimental) Abstract:Agent-based program repair offers to automatically resolve complex bugs end-to-end by combining the planning, tool use, and code generation abilities of modern LLMs. Recent work has explored the use of agent-based repair approaches on the popular open-source SWE-Bench, a collection of bugs from highly-rated GitHub Python projects. In addition, various agentic approaches such as SWE-Agent have been proposed to solve bugs in this benchmark. This paper explores...