News Score: Score the News, Sort the News, Rewrite the Headlines

What is currently the best LLM model for consumer grade hardware? Is it phi-4?

> DeepSeek-R1-0528-Qwen3-8B https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B ... Released today; probably the best reasoning model in 8B size. ... we distilled the chain-of-thought from DeepSeek-R1-0528 to post-train Qwen3-8B Base, obtaining DeepSeek-R1-0528-Qwen3-8B. This model achieves state-of-the-art performance ... on AIME 2024, surpassing Qwen3-8B by +10.0% & matching the performance of Qwen3-235B-thinking. Wild how effective distillation is turning out to be. No wonder, most ...

Read more at news.ycombinator.com

© News Score  score the news, sort the news, rewrite the headlines