News Score: Score the News, Sort the News, Rewrite the Headlines

LLMs Can Teach Themselves to Better Predict the Future

View PDF Abstract:We present an outcome-driven fine-tuning framework that enhances the forecasting capabilities of large language models (LLMs) without relying on human-curated reasoning samples. Our method leverages model self-play to generate pairs of diverse reasoning trajectories and probabilistic forecasts for a set of diverse questions that resolve after the models' knowledge cutoff date. We then rank pairs of these reasoning traces by their distance to the actual outcomes before fine-tuni...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines