News Score: Score the News, Sort the News, Rewrite the Headlines

Claude 3 beats GPT-4 on Aider’s code editing benchmark

Anthropic just released their new Claude 3 models with evals showing better performance on coding tasks. With that in mind, I’ve been benchmarking the new models using Aider’s code editing benchmark suite. Claude 3 Opus outperforms all of OpenAI’s models, making it the best available model for pair programming with AI. Aider currently supports Claude 3 Opus via OpenRouter: # Install aider pip install aider-chat # Setup OpenRouter access export OPENAI_API_KEY=<your-openrouter-key> export OPENAI_A...

Read more at aider.chat

© News Score  score the news, sort the news, rewrite the headlines