Llama 3 is a collection of pre-trained and fine-tuned generative text models ranging in scale starting at 8 billion parameters, including a 70 billion parameter model published in April 2024 by Meta. It was released under a special Llama 3 license.

Llama 2 was pre-trained on 15 trillion tokens of data from publicly available sources.

Supported variants

Airtrain supports two variants of the Llama 3 model.

llama-3-8b-chat

Llama-3-8b-chat is an 8B parameter model optimized for dialogue use cases. It outperforms many similarly sized open-source chat models on most benchmarks.

llama-3-70b-chat

Llama-3-70b-chat is a 70B parameter model optimized for dialogue use cases. It outperforms many similarly sized open-source chat models on most benchmarks, and even several popular proprietary models (Gemini Pro, Claude 3 Sonnet).