"Boosting the Reasoning Abilities of Large Language Models Through Geometric Understanding, Finds Study"

Reasoning in Large Language Models: A Geometric Perspective

View PDF HTML (experimental) Abstract:The advancement of large language models (LLMs) for real-world applications hinges critically on enhancing their reasoning capabilities. In this work, we explore the reasoning abilities of large language models (LLMs) through their geometrical understanding. We establish a connection between the expressive power of LLMs and the density of their self-attention graphs. Our analysis demonstrates that the density of these graphs defines the intrinsic dimension o...