Researchers Develop 'Coconut' System: LLMs Reason in Continuous Space, Outperforming Chain-of-Thought in Complex Tasks

Training Large Language Models to Reason in a Continuous Latent Space

View PDF HTML (experimental) Abstract:Large language models (LLMs) are restricted to reason in the "language space", where they typically express the reasoning process with a chain-of-thought (CoT) to solve a complex reasoning problem. However, we argue that language space may not always be optimal for reasoning. For example, most word tokens are primarily for textual coherence and not essential for reasoning, while some critical tokens require complex planning and pose huge challenges to LLMs. ...