Researchers develop δ-mem, compact 8x8 memory system for AI language models; boosts performance 1.31x on memory tasks without costly context expansion or model retraining.

$δ$-mem: Efficient Online Memory for Large Language Models

View PDF Abstract:Large language models increasingly need to accumulate and reuse historical information in long-term assistants and agent systems. Simply expanding the context window is costly and often fails to ensure effective context utilization. We propose $\delta$-mem, a lightweight memory mechanism that augments a frozen full-attention backbone with a compact online state of associative memory. $\delta$-mem compresses past information into a fixed-size state matrix updated by delta-rule l...