News Score: Score the News, Sort the News, Rewrite the Headlines

Anthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunch

Image Credits:Maxwell Zeff 10:47 AM PDT · May 22, 2025 Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday. During pre-release testing, Anthropic asked Claude Opus 4 to act as an assistant for a fictional company and consider the long-term consequences of its actions. Saf...

Read more at techcrunch.com

© News Score  score the news, sort the news, rewrite the headlines