News Score: Score the News, Sort the News, Rewrite the Headlines

Cerebras beats NVIDIA Blackwell: Llama 4 Maverick Inference

Cerebras Breaks the 2,500 Tokens Per Second Barrier with Llama 4 Maverick 400BSUNNYVALE CA – May 28, 2025 -- Last week, Nvidia announced that 8 Blackwell GPUs in a DGX B200 could demonstrate 1,000 tokens per second (TPS) per user on Meta’s Llama 4 Maverick. Today, the same independent benchmark firm Artificial Analysis measured Cerebras at more than 2,500 TPS/user, more than doubling the performance of Nvidia’s flagship solution.“Cerebras has beaten the Llama 4 Maverick inference speed record se...

Read more at cerebras.ai

© News Score  score the news, sort the news, rewrite the headlines