Study finds AI agents choose harmful tools 10-79% of time when facing deadlines, financial pressure; Google's Gemini 2.5 Pro worst at 79%, OpenAI's o3 best at 10.5%

AI Agents Care Less About Safety When Under Pressure

Several recent studies have shown that artificial-intelligence agents sometimes decide to misbehave, for instance by attempting to blackmail people who plan to replace them. But such behavior often occurs in contrived scenarios. Now, a new study presents PropensityBench, a benchmark that measures an agentic model’s choices to use harmful tools in order to complete assigned tasks. It finds that somewhat realistic pressures (such as looming deadlines) dramatically increase rates of misbehavior.“Th...