News Score: Score the News, Sort the News, Rewrite the Headlines

deepseek-ai/DeepSeek-V3-0324

deepseek-ai/DeepSeek-V3-0324. Chinese AI lab DeepSeek just released the latest version of their enormous DeepSeek v3 model, baking the release date into the name DeepSeek-V3-0324. The license is MIT (that's new - previous DeepSeek v3 had a custom license), the README is empty and the release adds up a to a total of 641 GB of files, mostly of the form model-00035-of-000163.safetensors. The model only came out a few hours ago and MLX developer Awni Hannun already has it running at >20 tokens/secon...

Read more at simonwillison.net

© News Score  score the news, sort the news, rewrite the headlines