Study Reveals: LLMs Use Procedural Knowledge, Not Retrieval, for Reasoning Tasks

Study Reveals: LLMs Use Procedural Knowledge, Not Retrieval, for Reasoning Tasks—Insights from 2.5B Pretraining Tokens

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

View PDF HTML (experimental) Abstract:The capabilities and limitations of Large Language Models have been sketched out in great detail in recent years, providing an intriguing yet conflicting picture. On the one hand, LLMs demonstrate a general ability to solve problems. On the other hand, they show surprising reasoning gaps when compared to humans, casting doubt on the robustness of their generalisation strategies. The sheer volume of data used in the design of LLMs has precluded us from applyi...