llama-4-maverick-17b-128e-instruct — Free API

groq/llama-4-maverick-17b-128e-instruct

chat

Get API key → Save Key to Vault Test in Playground →

What is llama-4-maverick-17b-128e-instruct?

Llama 4 Maverick 17B on Groq is Meta's highest-expert-count MoE model with 128 active experts, running on Groq's LPU hardware for fast inference. The large expert count gives it broader knowledge and stronger instruction-following than the Scout variant, making it the better Groq option for complex tasks. Rate limits are notably tighter than Groq's other endpoints at 15 RPM and 500 requests per day, so it is best reserved for evaluations and high-value queries rather than high-volume traffic. OpenAI SDK compatible; registration required, no credit card.

Model ID

llama-4-maverick-17b-128e-instruct

Base URL

https://api.groq.com/openai/v1

llama-4-maverick-17b-128e-instruct API Code Example

Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.groq.com/openai/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="llama-4-maverick-17b-128e-instruct",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

import OpenAI from "openai";

const openai = new OpenAI({
  baseURL: "https://api.groq.com/openai/v1",
  apiKey: "YOUR_API_KEY",
});

const completion = await openai.chat.completions.create({
  model: "llama-4-maverick-17b-128e-instruct",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

curl https://api.groq.com/openai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "llama-4-maverick-17b-128e-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Test in Playground → Generate IDE Config →

More About Groq

How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up Groq as a free LLM API backend.

View Groq full guide →

llama-4-maverick-17b-128e-instruct — Free API

What is llama-4-maverick-17b-128e-instruct?

llama-4-maverick-17b-128e-instruct API Code Example

Other Free Models from Groq

llama-3.3-70b-versatile

llama-3.1-8b-instant

llama-4-scout-17b-16e-instruct

qwen3-32b

kimi-k2-instruct

deepseek-r1-distill-70b

More About Groq

llama-4-maverick-17b-128e-instruct — Free API

What is llama-4-maverick-17b-128e-instruct?

llama-4-maverick-17b-128e-instruct API Code Example

Other Free Models from Groq

llama-3.3-70b-versatile

llama-3.1-8b-instant

llama-4-scout-17b-16e-instruct

qwen3-32b

kimi-k2-instruct

deepseek-r1-distill-70b

More About Groq

Export to Chat Client 🚀