"Understanding Hierarchical and Categorical Concepts in Language Models: Study Explores Encoding of Semantic Meaning in Gemma Model Using WordNet Data"

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

View PDF HTML (experimental) Abstract:Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the fact that 'dog' is a kind of 'mammal' encoded? We show how to...