"GPT-4o Dominates Aider’s LLM Code Editing and Refactoring Leaderboards, Outperforming Opus and Turbo Models"

GPT-4o takes #1 & #2 on the Aider LLM leaderboards

Aider works best with LLMs which are good at editing code, not just good at writing code. To evaluate an LLM’s editing skill, aider uses a pair of benchmarks that assess a model’s ability to consistently follow the system prompt to successfully edit code. The leaderboards below report the results from a number of popular LLMs. While aider can connect to almost any LLM, it works best with models that score well on the benchmarks. GPT-4o GPT-4o tops the aider LLM code editing leaderboard at 72.9%,...