Chinese AI start-up DeepSeek unveils new reasoning method for LLMs; next-gen model R2 anticipated

DeepSeek unveils new AI reasoning method amid anticipation for R2 model

Chinese artificial intelligence (AI) start-up DeepSeek has introduced a novel approach to improving the reasoning capabilities of large language models (LLMs), as the public awaits the release of the company’s next-generation model.In collaboration with researchers from Tsinghua University, DeepSeek developed a technique that combines methods referred to as generative reward modelling (GRM) and self-principled critique tuning, according to a paper published on Friday. The dual approach aims to e...