44 models verified via live API · refreshed Jun 4, 2026how we verify
Provider Model Context Max Output Modality Rate Limit Released Weekly Tokens Status
OpenRouter MiniMax: MiniMax M3 Paid Verified 1.0M 512K textimage See provider page May 31, 2026 784.0B Online Details
OpenRouter inclusionAI: Ring-2.6-1T Paid Verified 262K 66K text See provider page May 8, 2026 23.2B Online Details
OpenRouter Owl Alpha Verified 1.0M 262K text See provider page Apr 28, 2026 2.0T Online Details
OpenRouter NVIDIA: Nemotron 3 Nano Omni (free) Verified 256K 66K textimageaudio See provider page Apr 28, 2026 23.6B Online Details
OpenRouter Poolside: Laguna XS.2 (free) Verified 262K 33K text See provider page Apr 28, 2026 95.9B Online Details
OpenRouter Poolside: Laguna M.1 (free) Verified 262K 33K text See provider page Apr 28, 2026 569.3B Online Details
OpenRouter DeepSeek: DeepSeek V4 Flash Paid Verified 1.0M 131K text See provider page Apr 24, 2026 47.2M Online Details
OpenRouter MoonshotAI: Kimi K2.6 (free) Verified 262K 8K textimage See provider page Apr 20, 2026 20.8B Online Details
NVIDIA NIM moonshotai/kimi-k2.6 262K 8K text Up to 40 RPM Apr 20, 2026 20.8B Unavailable Details
OpenRouter Z.ai: GLM 5.1 Paid Verified 203K 8K text See provider page Apr 7, 2026 307.9B Online Details
NVIDIA NIM z-ai/glm-5.1 203K 8K text Up to 40 RPM Apr 7, 2026 307.9B Unavailable Details
OpenRouter Google: Gemma 4 26B A4B (free) Verified 262K 33K textimage See provider page Apr 3, 2026 5.1B Online Details
OpenRouter Google: Gemma 4 31B (free) Verified 262K 33K textimage See provider page Apr 2, 2026 23.7B Online Details
OpenRouter Arcee AI: Trinity Large Thinking Paid Verified 262K 262K textreasoning See provider page Apr 1, 2026 2.7B Online Details
OpenRouter Google: Lyria 3 Pro Preview Verified 1.0M 66K textimage See provider page Mar 30, 2026 15.3M Online Details
OpenRouter Google: Lyria 3 Clip Preview Verified 1.0M 66K textimage See provider page Mar 30, 2026 15.1M Online Details
OpenRouter NVIDIA: Nemotron 3 Super (free) Verified 1.0M 262K text See provider page Mar 11, 2026 672.4B Online Details
NVIDIA NIM qwen/qwen3.5-122b-a10b Verified 262K 262K textimage Up to 40 RPM Feb 25, 2026 6.7B Online Details
OpenRouter NVIDIA: Llama Nemotron Embed VL 1B V2 (free) 131K 8K textimageembeddings See provider page Feb 25, 2026 Unavailable Details
NVIDIA NIM qwen/qwen3.5-397b-a17b Verified 262K 66K textimage Up to 40 RPM Feb 16, 2026 44.5B Online Details
OpenRouter MiniMax: MiniMax M2.5 Paid Verified 205K 197K text See provider page Feb 12, 2026 32.0M Online Details
OpenRouter Qwen: Qwen3 Coder 480B A35B (free) Verified 1.0M 262K textcode See provider page Feb 4, 2026 0 Online Details
OpenRouter Free Models Router Verified 200K 8K textimage See provider page Feb 1, 2026 Online Details
OpenRouter LiquidAI: LFM2.5-1.2B-Thinking (free) Verified 33K 8K textreasoning See provider page Jan 20, 2026 1.6B Online Details
OpenRouter LiquidAI: LFM2.5-1.2B-Instruct (free) Verified 33K 8K text See provider page Jan 20, 2026 979.8M Online Details
OpenRouter NVIDIA: Nemotron 3 Nano 30B A3B (free) Verified 256K 8K text See provider page Dec 14, 2025 36.4B Online Details
OpenRouter OpenAI: gpt-oss-safeguard-20b Paid Verified 131K 66K text See provider page Oct 29, 2025 8.0B Online Details
OpenRouter NVIDIA: Nemotron Nano 12B 2 VL (free) Verified 128K 128K textimage See provider page Oct 28, 2025 10.3B Online Details
NVIDIA NIM nvidia/llama-3.3-nemotron-super-49b-v1.5 Verified 131K 16K text Up to 40 RPM Oct 10, 2025 711.4M Online Details
OpenRouter Qwen: Qwen3 Next 80B A3B Instruct (free) Verified 262K 8K text See provider page Sep 11, 2025 436.0M Online Details
OpenRouter NVIDIA: Nemotron Nano 9B V2 (free) Verified 128K 8K text See provider page Sep 5, 2025 11.8B Online Details
OpenRouter OpenAI: gpt-oss-120b (free) Verified 131K 131K text See provider page Aug 5, 2025 182.9B Online Details
OpenRouter OpenAI: gpt-oss-20b (free) Verified 131K 8K text See provider page Aug 5, 2025 39.1B Online Details
OpenRouter Z.ai: GLM 4.5 Air (free) Verified 131K 96K text See provider page Jul 25, 2025 98.0B Online Details
OpenRouter Meta: Llama 3.3 70B Instruct (free) Verified 131K 8K text See provider page Dec 6, 2024 394.4M Online Details
OpenRouter Meta: Llama 3.2 3B Instruct (free) Verified 131K 8K text See provider page Sep 25, 2024 45.8M Online Details
OpenRouter Nous: Hermes 3 405B Instruct (free) Verified 131K 8K text See provider page Aug 16, 2024 47.1M Online Details
NVIDIA NIM mistralai/mistral-large-2-instruct 131K 8K text Up to 40 RPM Feb 26, 2024 610.8M Unavailable Details
Cloudflare Workers AI `@cf/meta/llama-4-scout-17b-16e-instruct` 10.0M 131K text 10K neurons/day (shared) Online Details
Google Gemini Gemini 2.5 Pro 2.0M 65K text 5 RPM, 100 RPD Online Details
xAI grok-4.1-fast 2.0M 32K text Credit-based Online Details
NVIDIA NIM deepseek-ai/deepseek-v4-flash Verified 1.0M 131K text Up to 40 RPM Online Details
Alibaba Cloud Model Studio Qwen3-Plus 1.0M 32K text Tiered by region Online Details
Google Gemini Gemini 2.5 Flash 1.0M 65K text 10 RPM, 250 RPD Online Details
Google Gemini Gemini 2.5 Flash-Lite 1.0M 65K text 15 RPM, 1,000 RPD Online Details
Google Gemini Gemini 3 Flash (Preview) 1.0M 65K text Preview limits Online Details
xAI grok-4.3 1.0M 32K text Credit-based Online Details
GitHub Models gpt-4.1 1.0M 32K text 10 RPM, 50 RPD Online Details
GitHub Models gpt-4.1-mini 1.0M 32K text 15 RPM, 150 RPD Online Details
GitHub Models Llama-4-Scout-17B-16E 512K 4K text 15 RPM, 150 RPD Online Details
NVIDIA NIM stepfun-ai/step-3.5-flash Verified 262K 16K text Up to 40 RPM Online Details
Groq kimi-k2-instruct 262K 262K text 30 RPM, 14,400 RPD Online Details
Kilo Code `nvidia/nemotron-3-super-120b-a12b:free` 262K 32K text ~200 req/hr Online Details
Ollama Cloud `kimi-k2:1t-cloud` 262K 131K text Session/weekly limits (unpublished) Online Details
AI21 Labs Jamba Large 1.7 256K 4K text 200 RPM, 10 RPS Online Details
AI21 Labs Jamba Mini 2 256K 4K text 200 RPM, 10 RPS Online Details
Alibaba Cloud Model Studio Qwen3-Coder-Plus 256K 8K textcode Tiered by region Online Details
Cohere Command A (111B) 256K 4K text 20 RPM Online Details
Mistral AI Mistral Small 4 256K 256K text ~1 RPS, 500K TPM Online Details
Mistral AI Mistral Large 3 256K 256K text ~1 RPS, 500K TPM Online Details
Mistral AI Codestral 256K 256K textcode ~1 RPS, 500K TPM Online Details
Cloudflare Workers AI `@cf/google/gemma-4-26b-a4b-it` 256K 131K text 10K neurons/day (shared) Online Details
Cloudflare Workers AI `@cf/moonshotai/kimi-k2.5` 256K 131K text 10K neurons/day (shared) Online Details
GitHub Models Llama-4-Maverick-17B-128E 256K 4K text 10 RPM, 50 RPD Online Details
Kilo Code `x-ai/grok-code-fast-1:free` 256K 131K textcode ~200 req/hr Online Details
Nscale Qwen3-Coder-30B-A3B-Instruct 256K 32K textcode Fair-use Online Details
NVIDIA NIM stepfun-ai/step-3.7-flash Verified 256K 256K textimage Up to 40 RPM Online Details
GitHub Models AI21 Jamba 1.5 Large 256K 0 text See provider page Online Details
NVIDIA NIM minimaxai/minimax-m2.7 Verified 205K 131K text Up to 40 RPM Online Details
Z AI (Zhipu AI) GLM-4.7-Flash 200K 128K text 1 concurrent request Online Details
GitHub Models gpt-5 200K 32K text 10 RPM, 50 RPD Online Details
GitHub Models o4-mini 200K 100K text 10 RPM, 50 RPD Online Details
Kilo Code `minimax/minimax-m2.5:free` 196K 8K text ~200 req/hr Online Details
NVIDIA NIM meta/llama-guard-4-12b Verified 164K 16K textimage Up to 40 RPM Online Details
Cohere Embed 4 131K 131K text 2,000 inputs/min Online Details
Cohere Rerank 3.5 131K 131K text 10 RPM Online Details
Groq whisper-large-v3 131K 131K text 20 RPM, 2,000 RPD Online Details
Groq whisper-large-v3-turbo 131K 131K text 20 RPM, 2,000 RPD Online Details
Kilo Code `bytedance-seed/dola-seed-2.0-pro:free` 131K 131K text ~200 req/hr Online Details
Kilo Code `arcee-ai/trinity-large-thinking:free` 131K 131K text ~200 req/hr Online Details
LLM7.io deepseek-r1-0528 131K 131K text 30 RPM (120 with token) Online Details
LLM7.io deepseek-v3-0324 131K 131K text 30 RPM (120 with token) Online Details
LLM7.io gpt-4o-mini 131K 131K text 30 RPM (120 with token) Online Details
LLM7.io qwen2.5-coder-32b 131K 131K textcode 30 RPM (120 with token) Online Details
ModelScope `Qwen/Qwen3.5-35B-A3B` 131K 131K text 2,000 RPD total; <=500 RPD/model (dynamic) Online Details
ModelScope `Qwen/Qwen3.5-27B` 131K 131K text 2,000 RPD total; <=500 RPD/model (dynamic) Online Details
ModelScope `Qwen/Qwen-Image` 131K 131K text 2,000 RPD total; model/AIGC-specific caps Online Details
SiliconFlow `deepseek-ai/DeepSeek-OCR` 131K 8K text 30 RPM, 60K TPM Online Details
SiliconFlow Abbreviation 131K 8K text See provider page Online Details
NVIDIA NIM meta/llama-3.1-70b-instruct Verified 131K 16K text Up to 40 RPM Online Details
NVIDIA NIM meta/llama-3.2-11b-vision-instruct Verified 131K 16K textimage Up to 40 RPM Online Details
NVIDIA NIM meta/llama-3.2-1b-instruct Verified 131K 60K text Up to 40 RPM Online Details
NVIDIA NIM meta/llama-3.2-3b-instruct Verified 131K 8K text Up to 40 RPM Online Details
Chutes.ai DeepSeek-R1 131K 0 text Community-powered, no hard cap Online Details
Chutes.ai Llama 3.1 70B 131K 0 text Community-powered, no hard cap Online Details
Glhf.chat Llama 3.1 70B 131K 0 text Unlimited for free models Online Details
Groq Moonshot Kimi K2 131K 0 text See provider page Online Details
Groq Moonshot Kimi K2 0905 131K 0 text See provider page Online Details
Groq GPT-OSS 120B 131K 0 text See provider page Online Details
Groq GPT-OSS 20B 131K 0 text See provider page Online Details
Groq GPT-OSS Safeguard 20B 131K 0 text See provider page Online Details
GitHub Models Phi-4 131K 0 text See provider page Online Details
GitHub Models Mistral Large (24.11) 131K 0 text See provider page Online Details
Cerebras Llama 3.1 70B 131K 0 text See provider page Online Details
Aion Labs aion-2.0 131K 32K text Daily token allowance Online Details
Aion Labs aion-1.0 131K 32K text Daily token allowance Online Details
Aion Labs aion-1.0-mini 131K 32K text Daily token allowance Online Details
Alibaba Cloud Model Studio QwQ-Plus 131K 32K text Tiered by region Online Details
xAI grok-3-mini 131K 8K text Credit-based Online Details
Cerebras qwen-3-235b-a22b-instruct-2507 131K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cerebras qwen-3-32b 131K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cloudflare Workers AI `@cf/meta/llama-3.3-70b-instruct-fp8-fast` 131K 131K text 10K neurons/day (shared) Online Details
Cloudflare Workers AI `@cf/meta/llama-3.1-8b-instruct-fp8-fast` 131K 131K text 10K neurons/day (shared) Online Details
Cloudflare Workers AI `@cf/meta/llama-3.2-11b-vision-instruct` 131K 131K textimage 10K neurons/day (shared) Online Details
GitHub Models Meta-Llama-3.3-70B 131K 4K text 15 RPM, 150 RPD Online Details
Groq llama-3.3-70b-versatile 131K 32K text 30 RPM, 14,400 RPD Online Details
Groq llama-3.1-8b-instant 131K 131K text 30 RPM, 14,400 RPD Online Details
Groq llama-4-maverick-17b-128e-instruct 131K 8K text 15 RPM, 500 RPD Online Details
Groq qwen3-32b 131K 131K text 30 RPM, 14,400 RPD Online Details
Groq deepseek-r1-distill-70b 131K 8K text 30 RPM, 14,400 RPD Online Details
Hugging Face Qwen2.5-7B-Instruct 131K 4K text Credit-metered Online Details
SiliconFlow `deepseek-ai/DeepSeek-R1-Distill-Qwen-7B` 131K 131K text 30 RPM, 60K TPM Online Details
Alibaba Cloud Model Studio Qwen3-Max 128K 32K text Tiered by region Online Details
Alibaba Cloud Model Studio Qwen3-VL-Plus 128K 8K textimage Tiered by region Online Details
Cohere Command R+ 128K 4K text 20 RPM Online Details
Cohere Command R7B 128K 4K text 20 RPM Online Details
DeepSeek deepseek-chat (V3.2) 128K 8K text Dynamic Online Details
DeepSeek deepseek-reasoner (R1) 128K 8K text Dynamic Online Details
Mistral AI Mistral Medium 3 128K 128K text ~1 RPS, 500K TPM Online Details
Mistral AI Mistral Nemo (12B) 128K 128K text ~1 RPS, 500K TPM Online Details
Mistral AI Pixtral Large 128K 128K textimage ~1 RPS, 500K TPM Online Details
Z AI (Zhipu AI) GLM-4.5-Flash 128K 8K text 1 concurrent request Online Details
Z AI (Zhipu AI) GLM-4.6V-Flash 128K 4K text 1 concurrent request Online Details
Cerebras llama-3.3-70b 128K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cerebras gpt-oss-120b 128K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cerebras llama-4-scout-17b-16e-instruct 128K 8K text 30 RPM, 14,400 RPD, 1M TPD Online Details
Cerebras zai-glm-4.7 128K 8K text 10 RPM, 100 RPD, 1M TPD Online Details
Cloudflare Workers AI `@cf/mistralai/mistral-small-3.1-24b-instruct` 128K 131K text 10K neurons/day (shared) Online Details
GitHub Models gpt-4o 128K 16K text 10 RPM, 50 RPD Online Details
GitHub Models Mistral-Small-3.1 128K 4K text 15 RPM, 150 RPD Online Details
Hugging Face Meta-Llama-3.1-8B-Instruct 128K 4K text Credit-metered Online Details
Hugging Face Phi-3.5-mini-instruct 128K 4K text Credit-metered Online Details
Nebius Meta-Llama-3.3-70B-Instruct 128K 8K text Tier-based Online Details
Nebius Qwen3-235B-A22B 128K 32K text Tier-based Online Details
Nscale Llama-3.3-70B-Instruct 128K 8K text Fair-use Online Details
Nscale DeepSeek-R1-Distill-Llama-70B 128K 32K text Fair-use Online Details
Ollama Cloud `gpt-oss:120b-cloud` 128K 131K text Session/weekly limits (unpublished) Online Details
Ollama Cloud `deepseek-v3.1:671b-cloud` 128K 131K text Session/weekly limits (unpublished) Online Details
Ollama Cloud `qwen3-coder:480b-cloud` 128K 131K textcode Session/weekly limits (unpublished) Online Details
Ollama Cloud `glm-4.6:cloud` 128K 131K text Session/weekly limits (unpublished) Online Details
Ollama Cloud `deepseek-r1:cloud` 128K 131K text Session/weekly limits (unpublished) Online Details
OVHcloud AI Endpoints Qwen2.5-VL-72B-Instruct 128K 8K textimage 2 RPM (anonymous) Online Details
OVHcloud AI Endpoints Mistral-Nemo-Instruct-2407 128K 4K text 2 RPM (anonymous) Online Details
GitHub Models DeepSeek-R1 64K 8K text 15 RPM, 150 RPD Online Details
OpenRouter Venice: Uncensored (free) Verified 33K 8K text See provider page Online Details
Glhf.chat Mixtral 8x7B 33K 0 text Unlimited for free models Online Details
Mistral AI Mistral 7B 33K 0 text See provider page Online Details
Mistral AI Mixtral 8x7B 33K 0 text See provider page Online Details
Cloudflare Workers AI Mistral 7B 33K 0 text See provider page Online Details
Cloudflare Workers AI Qwen 1.5 7B 33K 0 text See provider page Online Details
Cloudflare Workers AI `@cf/deepseek-ai/deepseek-r1-distill-qwen-32b` 32K 131K text 10K neurons/day (shared) Online Details
Hugging Face Mistral-7B-Instruct-v0.3 32K 4K text Credit-metered Online Details
Hugging Face Mixtral-8x7B-Instruct-v0.1 32K 4K text Credit-metered Online Details
LLM7.io mistral-small-3.1-24b 32K 131K text 30 RPM (120 with token) Online Details
OVHcloud AI Endpoints Qwen3Guard-Gen-8B 32K 4K text 2 RPM (anonymous) Online Details
OVHcloud AI Endpoints Qwen3Guard-Gen-0.6B 32K 4K text 2 RPM (anonymous) Online Details
Grok (xAI) Grok-2 131K 0 text $25/month free credits, resets monthly Online Details
Grok (xAI) Grok-2 Mini 131K 0 text $25/month free credits, resets monthly Online Details
NVIDIA NIM deepseek-ai/deepseek-v4-pro 131K 8K text Up to 40 RPM Unavailable Details
NVIDIA NIM nvidia/llama-3.1-nemotron-ultra-253b-v1 131K 8K text Up to 40 RPM Unavailable Details
OpenRouter Baidu Qianfan: CoBuddy Paid 131K 65K textcode See provider page Unavailable Details

How to Use Free LLM API Resources

  1. Pick a model — Click any model name to see details, rate limits, and API key signup link.
  2. Get your API key — Sign up on the provider's website (most require no credit card).
  3. Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
  4. Test it — Use the Playground to test your API key before integrating.

New to LLM terminology? Check the 📖 Glossary — 22 terms explained in plain English →

See our FAQ for common questions about free LLM APIs