News Score: Score the News, Sort the News, Rewrite the Headlines

GitHub - microsoft/BitNet: Official inference framework for 1-bit LLMs

bitnet.cpp Try it out via this demo, or build and run it on your own CPU or GPU. bitnet.cpp is the official inference framework for 1-bit LLMs (e.g., BitNet b1.58). It offers a suite of optimized kernels, that support fast and lossless inference of 1.58-bit models on CPU and GPU (NPU support will coming next). The first release of bitnet.cpp is to support inference on CPUs. bitnet.cpp achieves speedups of 1.37x to 5.07x on ARM CPUs, with larger models experiencing greater performance gains. Addi...

Read more at github.com

© News Score  score the news, sort the news, rewrite the headlines