Feds freaked over Fable 5 after simple 'fix this code' prompt, not jailbreak, says researcher
security
According to the one person who actually read the research paper
The “jailbreak” that prompted the Trump administration to block Anthropic’s most advanced models was actually a simple three-word prompt: “Fix this code.”That's according to Katie Moussouris, founder and CEO of Luta Security, and the fairy godmother of bug bounties. She says she was the only outside expert to read the third-party research paper on the Fable 5 guardrail bypass techniques that prompted the ban.On Friday, the...
Read more at theregister.com