Inference pricing
Once your free credits expire, usage of the LLM Playground is charged per inference tokens. Find below the pricing per model.
Model | $ per million input tokens | $ per million output tokens |
---|---|---|
open-mistral-7b | $0.28 | $0.28 |
open-mixtral-8x7b | $1.50 | $1.50 |
mistral-large-latest | $11 | $33 |
mistral-medium-latest | $3 | $9 |
mistral-small-latest | $3 | $9 |
gpt-4 | $33 | $66 |
gpt-3.5-turbo | $1.50 | $1.50 |
Gemini Pro | $1.50 | $1.50 |
Gemini Ultra | $4 | $4 |
llama-2-7b-chat | $0.28 | $0.28 |
llama-2-13b-chat | $0.28 | $0.28 |
llama-2-70b-chat | $1.50 | $1.50 |
phi-2 | $0.11 | $0.11 |
Claude 3 Opus | $17 | $83 |
Claude 3 Sonnet | $4 | $17 |
Gemma 2b | $0.11 | $0.11 |
Gemma 7b | $0.28 | $0.28 |
Updated 9 months ago