Documentation

Inference pricing

Once your free credits expire, usage of the LLM Playground is charged per inference tokens. Find below the pricing per model.

Model$ per million input tokens$ per million output tokens
open-mistral-7b$0.28$0.28
open-mixtral-8x7b$1.50$1.50
mistral-large-latest$11$33
mistral-medium-latest$3$9
mistral-small-latest$3$9
gpt-4$33$66
gpt-3.5-turbo$1.50$1.50
Gemini Pro$1.50$1.50
Gemini Ultra$4$4
llama-2-7b-chat$0.28$0.28
llama-2-13b-chat$0.28$0.28
llama-2-70b-chat$1.50$1.50
phi-2$0.11$0.11
Claude 3 Opus$17$83
Claude 3 Sonnet$4$17
Gemma 2b$0.11$0.11
Gemma 7b$0.28$0.28