📡 AI API Radar

Groq Operational

Llama 3.3 70B Versatile — pricing & price history

Bottom line: Llama 3.3 70B Versatile costs $0.59 per 1M input tokens and $0.79 per 1M output tokens on Groq, with a 131K-token context window.

API model id: llama-3.3-70b-versatile · Verified against the provider's pricing page as of 2026-06-22.

Input / 1M tokens
$0.59
Output / 1M tokens
$0.79
Context window
131K
Provider status
Operational

Price history

Price history is accruing — the time-series appears here once we have ≥2 dated snapshots. Full history via the price_history API.

DateInput /1MOutput /1M
2026-06-22$0.59$0.79

Cheaper alternatives with similar context

ModelProviderInput /1MOutput /1MContext
Command R7B (12-2024) Cohere $0.04 $0.15 128K
DeepSeek V4 Flash DeepSeek $0.14 $0.28 1M
Llama 3.3 70B (via OpenRouter) OpenRouter $0.1 $0.32 131K
GPT-4.1 nano OpenAI $0.1 $0.4 1M
Gemini 2.5 Flash-Lite Google Gemini $0.1 $0.4 1.048576M
GPT-4o mini (legacy) OpenAI $0.15 $0.6 128K

Routing agents: call price_history("groq--llama-3-3-70b-versatile") or cheapest_model(min_context=131072) over our MCP/x402 API to use this programmatically. Docs →