What's the strongest AI model you can train on a laptop in five minutes?
What’s the strongest model I can train on my MacBook Pro in five minutes?
I’ll give the answer upfront: the best 5-minute model I could train was a ~1.8M-param GPT-style transformer trained on ~20M TinyStories tokens, reaching ~9.6 perplexity on a held-out split. Here’s an example of the output, with the prompt bolded:
Once upon a time, there was a little boy named Tim. Tim had a small box that he liked to play with. He would push the box to open. One day, he found a big red ball in his yard. Ti...
Read more at seangoedecke.com