News Score: Score the News, Sort the News, Rewrite the Headlines

Evaluating Agent-based Program Repair at Google

View PDF HTML (experimental) Abstract:Agent-based program repair offers to automatically resolve complex bugs end-to-end by combining the planning, tool use, and code generation abilities of modern LLMs. Recent work has explored the use of agent-based repair approaches on the popular open-source SWE-Bench, a collection of bugs from highly-rated GitHub Python projects. In addition, various agentic approaches such as SWE-Agent have been proposed to solve bugs in this benchmark. This paper explores...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines