Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
View PDF
HTML (experimental)
Abstract:Efficiently acquiring external knowledge and up-to-date information is essential for effective reasoning and text generation in large language models (LLMs). Prompting advanced LLMs with reasoning capabilities during inference to use search engines is not optimal, since the LLM does not learn how to optimally interact with the search engine. This paper introduces Search-R1, an extension of the DeepSeek-R1 model where the LLM learns -- solely through reinforc...
Read more at arxiv.org