What is the context window of Llama 4 Scout (17Bx16E)?

Llama 4 Scout (17Bx16E) supports up to 131K tokens of context.

Groq Operational

Llama 4 Scout (17Bx16E) — pricing & price history

Q: How much does Llama 4 Scout (17Bx16E) cost?

Llama 4 Scout (17Bx16E) costs $0.11 per 1M input tokens and $0.34 per 1M output tokens via Groq (verified 2026-06-22).

Bottom line: Llama 4 Scout (17Bx16E) costs $0.11 per 1M input tokens and $0.34 per 1M output tokens on Groq, with a 131K-token context window.

API model id: llama-4-scout-17b · Verified against the provider's pricing page as of 2026-06-22.

Input / 1M tokens

$0.11

Output / 1M tokens

$0.34

Context window

131K

Provider status

Operational

Price history

Price history is accruing — the time-series appears here once we have ≥2 dated snapshots. Full history via the price_history API.

Date	Input /1M	Output /1M
2026-06-22	$0.11	$0.34

Cheaper alternatives with similar context

Model	Provider	Input /1M	Output /1M	Context
Command R7B (12-2024)	Cohere	$0.04	$0.15	128K
DeepSeek V4 Flash	DeepSeek	$0.14	$0.28	1M
Llama 3.3 70B (via OpenRouter)	OpenRouter	$0.1	$0.32	131K
GPT-4.1 nano	OpenAI	$0.1	$0.4	1M
Gemini 2.5 Flash-Lite	Google Gemini	$0.1	$0.4	1.048576M
GPT-4o mini (legacy)	OpenAI	$0.15	$0.6	128K

Routing agents: call price_history("groq--llama-4-scout-17b") or cheapest_model(min_context=131072) over our MCP/x402 API to use this programmatically. Docs →