Llama-4-Maverick-17B-128E — Free API
⭐ Score: 53github-models/llama-4-maverick-17b-128e What is Llama-4-Maverick-17B-128E?
Llama 4 Maverick 17B is Meta's high-expert-count MoE model with 128 active experts, available for free on GitHub Models. Despite its compact 17B active parameter footprint, the large expert pool gives it broad knowledge coverage — making it a strong general-purpose chat and instruction-following model that rivals much larger dense architectures. The free tier limits output to 4K tokens per request with 10 RPM and 50 requests per day, so it is best suited for interactive chat and short-form generation rather than long-form writing. Fully OpenAI SDK-compatible; requires only a GitHub account to start using.
Llama-4-Maverick-17B-128E API Code Example
Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.
Other Free Models from GitHub Models
More About GitHub Models
How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up GitHub Models as a free LLM API backend.
View GitHub Models full guide →