News Score: Score the News, Sort the News, Rewrite the Headlines

Universal pre-training by iterated random computation

View PDF HTML (experimental) Abstract:We investigate the use of randomly generated data for the sake of pre-training a model. We justify this approach theoretically from the perspective of algorithmic complexity, building on recent research that shows that sequence models can be trained to approximate Solomonoff induction. We derive similar, but complementary theoretical results. We show empirically that synthetically generated data can be used to pre-train a model before the data is seen. We re...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines