News Score: Score the News, Sort the News, Rewrite the Headlines

Sabotage evaluations for frontier models

Any industry where there are potential harms needs evaluations. Nuclear power stations have continuous radiation monitoring and regular site inspections; new aircraft undergo extensive flight tests to prove their airworthiness.It’s no different for AI systems. New AI models go through a wide range of safety evaluations—for example, testing their capacity to assist in the creation of biological or chemical weapons. Such evaluations are built into our Responsible Scaling Policy, which guides our d...

Read more at anthropic.com

© News Score  score the news, sort the news, rewrite the headlines