llama-4-maverick-17b-128e-instruct — Free API
groq/llama-4-maverick-17b-128e-instruct What is llama-4-maverick-17b-128e-instruct?
Llama 4 Maverick 17B on Groq is Meta's highest-expert-count MoE model with 128 active experts, running on Groq's LPU hardware for fast inference. The large expert count gives it broader knowledge and stronger instruction-following than the Scout variant, making it the better Groq option for complex tasks. Rate limits are notably tighter than Groq's other endpoints at 15 RPM and 500 requests per day, so it is best reserved for evaluations and high-value queries rather than high-volume traffic. OpenAI SDK compatible; registration required, no credit card.
llama-4-maverick-17b-128e-instruct API Code Example
Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.
Other Free Models from Groq
More About Groq
How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Groq as a free LLM API backend.
View Groq full guide →