llama-3.3-70b-versatile — Free API
Created by Meta ⭐ Score: 45groq/llama-3-3-70b-versatile What is llama-3.3-70b-versatile?
Llama 3.3 70B on Groq delivers Meta's flagship 70B model with Groq's ultra-fast LPU inference — expect dramatically lower latency compared to GPU-based providers. With 131K context, 32K output, and OpenAI SDK compatibility, it is one of the fastest ways to access a proven 70B-class model for interactive applications. The free tier is generous: 14,400 requests per day at 30 RPM, making it viable for moderate production workloads. Registration is required but no credit card is needed. If your application values response speed above all else, this Groq + Llama 3.3 combination is hard to beat.
llama-3.3-70b-versatile API Code Example
Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.
Other Free Models from Groq
More About Groq
How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Groq as a free LLM API backend.
View Groq full guide →