New AI Method 'InfiniRetri' Enables LLMs to Process Infinite-Length Inputs, Achieves 100% Accuracy on 1M Token Test

Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing

View PDF HTML (experimental) Abstract:Limited by the context window size of Large Language Models(LLMs), handling various tasks with input tokens exceeding the upper limit has been challenging, whether it is a simple direct retrieval task or a complex multi-hop reasoning task. Although various methods have been proposed to enhance the long-context processing capabilities of LLMs, they either incur substantial post-training costs, or require additional tool modules(e.g.,RAG), or have not shown si...