llama-4-maverick-17b-128e-instruct — Free API

groq/llama-4-maverick-17b-128e-instruct
chat

What is llama-4-maverick-17b-128e-instruct?

Llama 4 Maverick 17B on Groq is Meta's highest-expert-count MoE model with 128 active experts, running on Groq's LPU hardware for fast inference. The large expert count gives it broader knowledge and stronger instruction-following than the Scout variant, making it the better Groq option for complex tasks. Rate limits are notably tighter than Groq's other endpoints at 15 RPM and 500 requests per day, so it is best reserved for evaluations and high-value queries rather than high-volume traffic. OpenAI SDK compatible; registration required, no credit card.

Model ID
llama-4-maverick-17b-128e-instruct
Base URL
https://api.groq.com/openai/v1

llama-4-maverick-17b-128e-instruct API Code Example

Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.groq.com/openai/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="llama-4-maverick-17b-128e-instruct",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)
import OpenAI from "openai";

const openai = new OpenAI({
  baseURL: "https://api.groq.com/openai/v1",
  apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
  model: "llama-4-maverick-17b-128e-instruct",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);
curl https://api.groq.com/openai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "llama-4-maverick-17b-128e-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Other Free Models from Groq

More About Groq

How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Groq as a free LLM API backend.

View Groq full guide →