📡 AI API Radar

Groq Operational

Llama 4 Scout (17Bx16E) — pricing & price history

Bottom line: Llama 4 Scout (17Bx16E) costs $0.11 per 1M input tokens and $0.34 per 1M output tokens on Groq, with a 131K-token context window.

API model id: llama-4-scout-17b · Verified against the provider's pricing page as of 2026-06-22.

Input / 1M tokens
$0.11
Output / 1M tokens
$0.34
Context window
131K
Provider status
Operational

Price history

Price history is accruing — the time-series appears here once we have ≥2 dated snapshots. Full history via the price_history API.

DateInput /1MOutput /1M
2026-06-22$0.11$0.34

Cheaper alternatives with similar context

ModelProviderInput /1MOutput /1MContext
Command R7B (12-2024) Cohere $0.04 $0.15 128K
DeepSeek V4 Flash DeepSeek $0.14 $0.28 1M
Llama 3.3 70B (via OpenRouter) OpenRouter $0.1 $0.32 131K
GPT-4.1 nano OpenAI $0.1 $0.4 1M
Gemini 2.5 Flash-Lite Google Gemini $0.1 $0.4 1.048576M
GPT-4o mini (legacy) OpenAI $0.15 $0.6 128K

Routing agents: call price_history("groq--llama-4-scout-17b") or cheapest_model(min_context=131072) over our MCP/x402 API to use this programmatically. Docs →