News Score: Score the News, Sort the News, Rewrite the Headlines

How much do language models memorize?

View PDF HTML (experimental) Abstract:We propose a new method for estimating how much a model ``knows'' about a datapoint and use it to measure the capacity of modern language models. Prior studies of language model memorization have struggled to disentangle memorization from generalization. We formally separate memorization into two components: \textit{unintended memorization}, the information a model contains about a specific dataset, and \textit{generalization}, the information a model contai...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines