GitHub - mukel/llama3.java: Practical Llama 3 inference in Java
Llama3.java
Practical Llama 3 inference implemented in a single Java file.
This project is the successor of llama2.java
based on llama2.c by Andrej Karpathy and his excellent educational videos.
Besides the educational value, this project will be used to test and tune compiler optimizations and features on the JVM, particularly for the Graal compiler.
Features
Single file, no dependencies
GGUF format parser
Llama 3 tokenizer based on minbpe
Llama 3 inference with Grouped-Query Attention
Support ...
Read more at github.com