News Score: Score the News, Sort the News, Rewrite the Headlines

LIMO: Less is More for Reasoning

View PDF HTML (experimental) Abstract:We present a fundamental discovery that challenges our understanding of how complex reasoning emerges in large language models. While conventional wisdom suggests that sophisticated reasoning tasks demand extensive training data (>100,000 examples), we demonstrate that complex mathematical reasoning abilities can be effectively elicited with surprisingly few examples. Through comprehensive experiments, our proposed model LIMO demonstrates unprecedented perfo...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines