AI Model Outperforms gcc -O3, Boosts Assembly Code Speed 1.47x Using Reinforcement Learning

Improving Assembly Code Performance with Large Language Models via Reinforcement Learning

View PDF HTML (experimental) Abstract:Large language models (LLMs) have demonstrated strong performance across a wide range of programming tasks, yet their potential for code optimization remains underexplored. This work investigates whether LLMs can optimize the performance of assembly code, where fine-grained control over execution enables improvements that are difficult to express in high-level languages. We present a reinforcement learning framework that trains LLMs using Proximal Policy Opt...