DeepSeek Debuts ‘Sparse Attention’ Method in Next-Gen AI Model
DeepSeek updated an experimental AI model Monday in what it called a step toward next-generation artificial intelligence. The secretive Chinese startup outlined the DeepSeek-V3.1-Exp platform, explaining it uses a new technique it calls DeepSeek Sparse Attention or DSA, according to a post on its Hugging Face page. The latest version marked “an intermediate step toward our next-generation architecture,” the Hangzhou-based startup said, also indicating it was working with Chinese chipmakers on th...
Read more at bloomberg.com