News Score: Score the News, Sort the News, Rewrite the Headlines

Training Large Language Models to Reason in a Continuous Latent Space

View PDF HTML (experimental) Abstract:Large language models (LLMs) are restricted to reason in the "language space", where they typically express the reasoning process with a chain-of-thought (CoT) to solve a complex reasoning problem. However, we argue that language space may not always be optimal for reasoning. For example, most word tokens are primarily for textual coherence and not essential for reasoning, while some critical tokens require complex planning and pose huge challenges to LLMs. ...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines