"Tencent's ARC Lab Develops LLaMA-Pro-8B: An Advanced AI Model Specializing in Programming and Mathematics"

TencentARC/LLaMA-Pro-8B · Hugging Face

LLaMA-Pro-8B Model Card Model Description LLaMA-Pro is a progressive version of the original LLaMA model, enhanced by the addition of Transformer blocks. It specializes in integrating both general language understanding and domain-specific knowledge, particularly in programming and mathematics. Development and Training Developed by Tencent's ARC Lab, LLaMA-Pro is an 8.3 billion parameter model. It's an expansion of LLaMA2-7B, further trained on code and math corpora totaling 80 billion tokens. I...