Hugging Face Releases SmolLM2: Compact AI Models Run On-Device, Trained on 11 Trillion Tokens

SmolLM2

SmolLM2 (via) New from Loubna Ben Allal and her research team at Hugging Face: SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters. They are capable of solving a wide range of tasks while being lightweight enough to run on-device. [...] It was trained on 11 trillion tokens using a diverse dataset combination: FineWeb-Edu, DCLM, The Stack, along with new mathematics and coding datasets that we curated and will release soon. The model weights are...