News Score: Score the News, Sort the News, Rewrite the Headlines

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought

Authors:Violet Xiang, Charlie Snell, Kanishk Gandhi, Alon Albalak, Anikait Singh, Chase Blagden, Duy Phung, Rafael Rafailov, Nathan Lile, Dakota Mahan, Louis Castricato, Jan-Philipp Franken, Nick Haber, Chelsea Finn View PDF Abstract:We propose a novel framework, Meta Chain-of-Thought (Meta-CoT), which extends traditional Chain-of-Thought (CoT) by explicitly modeling the underlying reasoning required to arrive at a particular CoT. We present empirical evidence from state-of-the-art models exhibi...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines