DeepSeek v3.1 Is Not Having a Moment
What if DeepSeek released a model claiming 66 on SWE and almost no one tried using it? Would it be any good? Would you be able to tell? Or would we get the shortest post of the year?
Why We Haven’t Seen v4 or r2
Why are we settling for v3.1 and have yet to see DeepSeek release v4 or r2 yet?
Eleanor Olcott and Zijing Wu: Chinese artificial intelligence company DeepSeek delayed the release of its new model after failing to train it using Huawei’s chips, highlighting the limits of Beijing’s push to...
Read more at thezvi.wordpress.com