News Score: Score the News, Sort the News, Rewrite the Headlines

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Authors:Zehan Qi, Xiao Liu, Iat Long Iong, Hanyu Lai, Xueqiao Sun, Xinyue Yang, Jiadai Sun, Yu Yang, Shuntian Yao, Tianjie Zhang, Wei Xu, Jie Tang, Yuxiao Dong View PDF HTML (experimental) Abstract:Large language models (LLMs) have shown remarkable potential as autonomous agents, particularly in web-based tasks. However, existing LLM web agents heavily rely on expensive proprietary LLM APIs, while open LLMs lack the necessary decision-making capabilities. This paper introduces WebRL, a self-evol...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines