"Anthropic's Claude 3 Opus Surpasses OpenAI's GPT-4 In Aider's Code Editing Benchmark, Becomes Top AI for Pair Programming"

Claude 3 beats GPT-4 on Aider’s code editing benchmark

Anthropic just released their new Claude 3 models with evals showing better performance on coding tasks. With that in mind, I’ve been benchmarking the new models using Aider’s code editing benchmark suite. Claude 3 Opus outperforms all of OpenAI’s models, making it the best available model for pair programming with AI. Aider currently supports Claude 3 Opus via OpenRouter: # Install aider pip install aider-chat # Setup OpenRouter access export OPENAI_API_KEY=<your-openrouter-key> export OPENAI_A...