News Score: Score the News, Sort the News, Rewrite the Headlines

Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens

View PDF HTML (experimental) Abstract:Recent impressive results from large reasoning models have been interpreted as a triumph of Chain of Thought (CoT), and especially of the process of training on CoTs sampled from base LLMs in order to help find new reasoning patterns. In this paper, we critically examine that interpretation by investigating how the semantics of intermediate tokens-often anthropomorphized as "thoughts" or reasoning traces and which are claimed to display behaviors like backtr...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines