How much does Llama 3.3 70B Instruct cost?

Llama 3.3 70B Instruct costs $0.9 per 1M input tokens and $0.9 per 1M output tokens via Fireworks AI (verified 2026-06-22).

What is the context window of Llama 3.3 70B Instruct?

Llama 3.3 70B Instruct supports up to 131K tokens of context.

Fireworks AI Unknown

Llama 3.3 70B Instruct — pricing & price history

Bottom line: Llama 3.3 70B Instruct costs $0.9 per 1M input tokens and $0.9 per 1M output tokens on Fireworks AI, with a 131K-token context window.

API model id: llama-v3p3-70b-instruct · Verified against the provider's pricing page as of 2026-06-22.

Input / 1M tokens

$0.9

Output / 1M tokens

$0.9

Context window

131K

Provider status

Unknown

Price history

Price history is accruing — the time-series appears here once we have ≥2 dated snapshots. Full history via the price_history API.

Date	Input /1M	Output /1M
2026-06-22	$0.9	$0.9

Cheaper alternatives with similar context

Model	Provider	Input /1M	Output /1M	Context
Command R7B (12-2024)	Cohere	$0.04	$0.15	128K
GPT-OSS 20B	Groq	$0.08	$0.3	131K
DeepSeek V4 Flash	DeepSeek	$0.14	$0.28	1M
Llama 3.3 70B (via OpenRouter)	OpenRouter	$0.1	$0.32	131K
Llama 4 Scout (17Bx16E)	Groq	$0.11	$0.34	131K
GPT-4.1 nano	OpenAI	$0.1	$0.4	1M

Routing agents: call price_history("fireworks--llama-v3p3-70b-instruct") or cheapest_model(min_context=131072) over our MCP/x402 API to use this programmatically. Docs →