What is the context window of Llama 3.3 70B Versatile?

Llama 3.3 70B Versatile supports up to 131K tokens of context.

Groq Operational

Llama 3.3 70B Versatile — pricing & price history

Q: How much does Llama 3.3 70B Versatile cost?

Llama 3.3 70B Versatile costs $0.59 per 1M input tokens and $0.79 per 1M output tokens via Groq (verified 2026-06-22).

Bottom line: Llama 3.3 70B Versatile costs $0.59 per 1M input tokens and $0.79 per 1M output tokens on Groq, with a 131K-token context window.

API model id: llama-3.3-70b-versatile · Verified against the provider's pricing page as of 2026-06-22.

Input / 1M tokens

$0.59

Output / 1M tokens

$0.79

Context window

131K

Provider status

Operational

Price history

Price history is accruing — the time-series appears here once we have ≥2 dated snapshots. Full history via the price_history API.

Date	Input /1M	Output /1M
2026-06-22	$0.59	$0.79

Cheaper alternatives with similar context

Model	Provider	Input /1M	Output /1M	Context
Command R7B (12-2024)	Cohere	$0.04	$0.15	128K
DeepSeek V4 Flash	DeepSeek	$0.14	$0.28	1M
Llama 3.3 70B (via OpenRouter)	OpenRouter	$0.1	$0.32	131K
GPT-4.1 nano	OpenAI	$0.1	$0.4	1M
Gemini 2.5 Flash-Lite	Google Gemini	$0.1	$0.4	1.048576M
GPT-4o mini (legacy)	OpenAI	$0.15	$0.6	128K

Routing agents: call price_history("groq--llama-3-3-70b-versatile") or cheapest_model(min_context=131072) over our MCP/x402 API to use this programmatically. Docs →