WebRL: New AI Framework Boosts Open LLMs' Web Agent Performance, Outperforming GPT-4 in Web Tasks

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Authors:Zehan Qi, Xiao Liu, Iat Long Iong, Hanyu Lai, Xueqiao Sun, Xinyue Yang, Jiadai Sun, Yu Yang, Shuntian Yao, Tianjie Zhang, Wei Xu, Jie Tang, Yuxiao Dong View PDF HTML (experimental) Abstract:Large language models (LLMs) have shown remarkable potential as autonomous agents, particularly in web-based tasks. However, existing LLM web agents heavily rely on expensive proprietary LLM APIs, while open LLMs lack the necessary decision-making capabilities. This paper introduces WebRL, a self-evol...