News Score: Score the News, Sort the News, Rewrite the Headlines

LM2: Large Memory Models

View PDF HTML (experimental) Abstract:This paper introduces the Large Memory Model (LM2), a decoder-only Transformer architecture enhanced with an auxiliary memory module that aims to address the limitations of standard Transformers in multi-step reasoning, relational argumentation, and synthesizing information distributed over long contexts. The proposed LM2 incorporates a memory module that acts as a contextual representation repository, interacting with input tokens via cross attention and up...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines