News Score: Score the News, Sort the News, Rewrite the Headlines

Adversarial Policies Beat Superhuman Go AIs

Authors:Tony T. Wang, Adam Gleave, Tom Tseng, Kellin Pelrine, Nora Belrose, Joseph Miller, Michael D. Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell View PDF Abstract:We attack the state-of-the-art Go-playing AI system KataGo by training adversarial policies against it, achieving a >97% win rate against KataGo running at superhuman settings. Our adversaries do not win by playing Go well. Instead, they trick KataGo into making serious blunders. Our attack transfers zero-shot...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines