Llama-4-Scout-17B-16E — Free API
⭐ Score: 54github-models/llama-4-scout-17b-16e What is Llama-4-Scout-17B-16E?
Llama 4 Scout 17B is Meta's efficient long-context model with 16 active experts (MoE), available free on GitHub Models. With a 512K context window — far beyond most free models — it can process entire novels, full code repositories, or multi-hour transcripts in a single request. The 17B total parameter footprint keeps inference fast and cost-effective, making it practical for retrieval-free document analysis and long-form summarization. Rate limits are 15 RPM and 150 requests per day with per-request output capped at 4K tokens. Fully OpenAI SDK-compatible and requires only a GitHub account.
Llama-4-Scout-17B-16E API Code Example
Paste your API key and run. See the config generator for Claude Code, Cursor, and more tools.
Other Free Models from GitHub Models
More About GitHub Models
How to get an API key, rate limits, platform limitations, and tool configuration — everything you need to set up GitHub Models as a free LLM API backend.
View GitHub Models full guide →