News Score: Score the News, Sort the News, Rewrite the Headlines

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

View PDF HTML (experimental) Abstract:Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the fact that 'dog' is a kind of 'mammal' encoded? We show how to...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines