Cerebras beats NVIDIA Blackwell: Llama 4 Maverick Inference
Cerebras Breaks the 2,500 Tokens Per Second Barrier with Llama 4 Maverick 400BSUNNYVALE CA – May 28, 2025 -- Last week, Nvidia announced that 8 Blackwell GPUs in a DGX B200 could demonstrate 1,000 tokens per second (TPS) per user on Meta’s Llama 4 Maverick. Today, the same independent benchmark firm Artificial Analysis measured Cerebras at more than 2,500 TPS/user, more than doubling the performance of Nvidia’s flagship solution.“Cerebras has beaten the Llama 4 Maverick inference speed record se...
Read more at cerebras.ai