Claude 3 beats GPT-4 on Aider’s code editing benchmark
Anthropic just released their new Claude 3 models
with evals showing better performance on coding tasks.
With that in mind, I’ve been benchmarking the new models
using Aider’s code editing benchmark suite.
Claude 3 Opus outperforms all of OpenAI’s models,
making it the best available model for pair programming with AI.
Aider currently supports Claude 3 Opus via
OpenRouter:
# Install aider
pip install aider-chat
# Setup OpenRouter access
export OPENAI_API_KEY=<your-openrouter-key>
export OPENAI_A...
Read more at aider.chat