GPT-4o takes #1 & #2 on the Aider LLM leaderboards
Aider works best with LLMs which are good at editing code, not just good at writing
code.
To evaluate an LLM’s editing skill, aider uses a pair of benchmarks that
assess a model’s ability to consistently follow the system prompt
to successfully edit code.
The leaderboards below report the results from a number of popular LLMs.
While aider can connect to almost any LLM,
it works best with models that score well on the benchmarks.
GPT-4o
GPT-4o tops the aider LLM code editing leaderboard at 72.9%,...
Read more at aider.chat