Browse 136+ free LLM models from 25 providers. Filter by provider, modality, rate limit — find the perfect model for your project.

53 models verified via live API · refreshed Jun 19, 2026how we verify
Provider Model Score Context Modality Rate Limit Released Weekly Tokens Status
MiniMax: MiniMax M3 Paid Verified
101 1.0M textimage 200 req/day (free tier) Jun 1, 2026 4.4T Online
MoonshotAI: Kimi K2.6 Paid Verified
97 262K textimage 200 req/day (free tier) Apr 20, 2026 545.7B Online
95 1.0M textimage Up to 40 RPM Jun 1, 2026 Online
Z.ai: GLM 5.1 Paid Verified
94 203K text 200 req/day (free tier) Apr 7, 2026 1.1T Online
91 262K textimage Up to 40 RPM Apr 20, 2026 545.7B Online
Gemini 3.5 Flash
90 1.0M text 15 RPM, 1,500 RPD May 19, 2026 Online
DeepSeek: DeepSeek V4 Flash
89 1.0M text 200 req/day (free tier) Apr 24, 2026 4.6T Online
z-ai/glm-5.1 Verified
88 203K text Up to 40 RPM Apr 7, 2026 1.1T Online
inclusionAI: Ring-2.6-1T Paid Verified
88 262K text 200 req/day (free tier) May 8, 2026 8.4B Online
NVIDIA: Nemotron 3 Ultra (free)
88 1.0M text 200 req/day (free tier) Jun 4, 2026 744.4B Online
87 256K textimage Up to 40 RPM May 29, 2026 Online
deepseek-ai/deepseek-v4-pro
86 1.0M text Up to 40 RPM Apr 24, 2026 Online
85 205K text Up to 40 RPM Mar 18, 2026 Online
MiniMax: MiniMax M2.5 Paid Verified
84 205K text 200 req/day (free tier) Feb 12, 2026 1.3B Online
DeepSeek: DeepSeek V4 Flash
83 1.0M text Up to 40 RPM Apr 24, 2026 Online
Qwen3.6-27B
82 131K text 2 RPM (anonymous) Apr 22, 2026 Online
81 196K text ~200 req/hr Feb 12, 2026 Online
80 262K textreasoning 200 req/day (free tier) Apr 1, 2026 1.4B Online
Cohere: North Mini Code (free)
80 256K textcode 200 req/day (free tier) Jun 9, 2026 3.1B Online
Nex AGI: Nex-N2-Pro (free)
78 262K textimage 200 req/day (free tier) Jun 8, 2026 724.5B Online
77 131K textreasoning ~200 req/hr Apr 1, 2026 Online
Google: Gemma 4 31B (free)
77 262K textimage 200 req/day (free tier) Apr 2, 2026 40.7B Online
74 262K text Up to 40 RPM Feb 2, 2026 Online
Qwen/Qwen3.5-27B
74 131K text 2,000 RPD total; <=500 RPD/model (dynamic) Feb 24, 2026 Online
Google: Gemma 4 26B A4B (free)
74 262K textimage 200 req/day (free tier) Apr 2, 2026 3.6B Online
Qwen3.5-9B
73 131K text 2 RPM (anonymous) Mar 2, 2026 Online
Qwen3.5-397B-A17B
73 131K text 2 RPM (anonymous) Feb 16, 2026 Online
NVIDIA: Nemotron 3 Nano Omni (free)
72 256K textimageaudioreasoning 200 req/day (free tier) Apr 28, 2026 14.9B Online
MiniMax-M2.7
72 128K text 20 RPM, 20 RPD, 200K TPD Mar 18, 2026 Online
NVIDIA: Nemotron 3 Super (free)
72 1.0M text 200 req/day (free tier) Mar 11, 2026 360.0B Online
Owl Alpha Verified
70 1.0M text 200 req/day (free tier) Apr 28, 2026 2.4T Online
Qwen3.5-397B-A17B
70 256K textimage Up to 40 RPM Feb 16, 2026 34.7B Online
qwen/qwen3.5-122b-a10b
70 262K textimage Up to 40 RPM Feb 24, 2026 10.0B Online
69 262K text 200 req/day (free tier) Apr 28, 2026 94.7B Online
69 262K text 200 req/day (free tier) Apr 28, 2026 594.3B Online
Qwen/Qwen3.5-35B-A3B
69 131K text 2,000 RPD total; <=500 RPD/model (dynamic) Feb 24, 2026 Online
NVIDIA: Nemotron 3 Super (free)
68 262K text ~200 req/hr Mar 11, 2026 Online
67 131K text 200 req/day (free tier) Aug 5, 2025 204.4B Online
Gemini 3.1 Flash-Lite
67 1.0M text 30 RPM, 1,500 RPD Mar 3, 2026 Online
Google: Gemma 4 26B A4B (free)
66 256K text 10K neurons/day (shared) Apr 2, 2026 Online
65 256K text ~1 RPS, 500K TPM Mar 16, 2026 Online
64 128K text Session/weekly limits (unpublished) May 23, 2026 Online
64 128K textreasoning Session/weekly limits (unpublished) May 10, 2026 Online
GLM-4.7-Flash
64 200K text 1 concurrent request Jan 19, 2026 Online
NVIDIA: Nemotron 3 Nano 30B A3B (free)
63 256K text 200 req/day (free tier) Dec 14, 2025 29.2B Online
62 256K textcode ~200 req/hr Aug 28, 2025 Online
61 131K text ~200 req/hr May 10, 2026 Online
60 131K text 200 req/day (free tier) Aug 5, 2025 57.3B Online
60 131K text 200 req/day (free tier) Aug 16, 2024 91.3M Online
o4-mini
60 200K text 10 RPM, 50 RPD Apr 16, 2025 Online
Qwen: Qwen3 Coder 480B A35B (free)
60 1.0M textcode 200 req/day (free tier) Jul 23, 2025 0 Online
59 128K text Session/weekly limits (unpublished) May 23, 2026 Online
59 128K textcode Session/weekly limits (unpublished) May 23, 2026 Online
Z.ai: GLM 4.5 Air Paid Verified
59 131K text 200 req/day (free tier) Jul 28, 2025 1.5B Online
59 128K text 20 RPM, 20 RPD, 200K TPD Jun 17, 2026 Online
58 128K text 10 RPM, 100 RPD, 1M TPD May 10, 2026 Online
Gemini 2.5 Pro
58 2.0M text 5 RPM, 50 RPD Jun 5, 2025 Online
NVIDIA: Nemotron 3.5 Content Safety (free)
58 128K textimage 200 req/day (free tier) Jun 4, 2026 1.1B Online
gpt-4.1
57 1.0M text 10 RPM, 50 RPD Apr 14, 2025 Online
Google: Lyria 3 Pro Preview
57 1.0M textimage 200 req/day (free tier) Mar 30, 2026 5.5M Online
Google: Lyria 3 Clip Preview
57 1.0M textimage 200 req/day (free tier) Mar 30, 2026 3.6M Online
Qwen: Qwen3 Next 80B A3B Instruct (free)
57 262K text 200 req/day (free tier) Sep 11, 2025 523.9M Online
56 128K text 10K neurons/day (shared) Jun 17, 2026 Online
55 256K text ~1 RPS, 500K TPM Dec 2, 2025 Online
deepseek-r1-0528
55 131K textreasoning 30 RPM (120 with token) May 28, 2025 Online
gpt-4.1-mini
55 1.0M text 15 RPM, 150 RPD Apr 14, 2025 Online
gpt-5
55 200K text 10 RPM, 50 RPD Aug 7, 2025 Online
Qwen3-Coder-30B-A3B-Instruct
55 262K textcode 2 RPM (anonymous) Jul 31, 2025 Online
54 512K text 15 RPM, 150 RPD May 10, 2026 Online
NVIDIA: Nemotron Nano 12B 2 VL (free)
54 128K textimage 200 req/day (free tier) Oct 28, 2025 10.7B Online
gpt-oss-120b
54 128K text 30 RPM, 14,400 RPD, 1M TPD Aug 5, 2025 Online
53 128K text 2 RPM (anonymous) May 10, 2026 Online
53 256K text 10 RPM, 50 RPD May 10, 2026 Online
53 262K text Session/weekly limits (unpublished) May 23, 2026 Online
53 33K textreasoning 200 req/day (free tier) Jan 20, 2026 1.3B Online
52 128K text Session/weekly limits (unpublished) May 23, 2026 Online
52 33K text 200 req/day (free tier) Jul 9, 2025 Online
52 256K text ~1 RPS, 500K TPM Jun 17, 2026 Online
52 131K text 10K neurons/day (shared) Jun 17, 2026 Online
NVIDIA: Nemotron 3.5 Content Safety (free)
52 128K textimage Up to 40 RPM Jun 4, 2026 1.1B Online
deepseek-v3-0324
52 131K text 30 RPM (120 with token) Mar 25, 2025 Online
DeepSeek-V3.1
52 128K text 20 RPM, 20 RPD, 200K TPD Aug 21, 2025 Online
NVIDIA: Nemotron Nano 9B V2 (free)
52 128K text 200 req/day (free tier) Sep 5, 2025 11.1B Online
51 131K text 30 RPM, 1,000 RPD May 10, 2026 Online
51 33K text 200 req/day (free tier) Jan 5, 2026 1.2B Online
gpt-4o
51 128K text 10 RPM, 50 RPD May 13, 2024 Online
Gemini 2.5 Flash
51 1.0M text 15 RPM, 1,500 RPD May 20, 2025 Online
Aion 2.0
51 128K text 15 RPM, 20K TPD Feb 23, 2026 Online
50 10.0M text 10K neurons/day (shared) May 10, 2026 Online
Meta: Llama 3.3 70B Instruct (free)
50 131K text 200 req/day (free tier) Dec 6, 2024 389.0M Online
nvidia/llama-3.3-nemotron-super-49b-v1.5
50 131K text Up to 40 RPM Oct 10, 2025 228.8M Online
49 200K textimage 200 req/day (free tier) Feb 1, 2026 Online
49 128K text 1 concurrent request May 10, 2026 Online
49 262K textcode 10K neurons/day (shared) Jun 17, 2026 Online
49 256K textcode ~1 RPS, 500K TPM May 10, 2026 Online
Qwen2.5-VL-72B-Instruct
49 128K textimage 2 RPM (anonymous) Feb 1, 2025 Online
Mistral-Small-3.2-24B-Instruct
49 128K text 2 RPM (anonymous) Jun 20, 2025 Online
gemini-2.5-flash-lite
49 131K text 30 RPM (120 with token) Jun 17, 2025 Online
gpt-oss-20b
49 128K text 2 RPM (anonymous) Aug 5, 2025 Online
Meta: Llama 3.2 3B Instruct (free)
49 131K text 200 req/day (free tier) Sep 25, 2024 98.1M Online
mistral-small-3.1-24b
48 32K text 30 RPM (120 with token) Mar 17, 2025 Online
47 128K text ~1 RPS, 500K TPM May 10, 2026 Online
47 128K text 20 RPM, 20 RPD, 200K TPD Jun 17, 2026 Online
Meta: Llama 3.3 70B Instruct (free)
47 131K text 2 RPM (anonymous) Dec 6, 2024 Online
qwen2.5-coder-32b
47 131K textcode 30 RPM (120 with token) Nov 11, 2024 Online
gpt-4o-mini
47 131K text 30 RPM (120 with token) Jul 18, 2024 Online
qwen3-32b
47 131K text 30 RPM, 1,000 RPD Apr 28, 2025 Online
46 128K text 15 RPM, 150 RPD Mar 17, 2025 Online
46 131K text See provider page May 10, 2026 Online
Meta: Llama 3.3 70B Instruct (free)
46 131K text 15 RPM, 150 RPD Dec 6, 2024 Online
DeepSeek-R1
46 64K textreasoning 15 RPM, 150 RPD May 28, 2025 Online
45 32K text Credit-metered May 10, 2026 Online
Meta: Llama 3.3 70B Instruct (free)
45 131K text 30 RPM, 1,000 RPD Dec 6, 2024 Online
45 128K text 10K neurons/day (shared) Mar 17, 2025 Online
@cf/deepseek-ai/deepseek-r1-distill-qwen-32b
45 32K textreasoning 10K neurons/day (shared) Jan 20, 2025 Online
44 256K text 20 RPM May 10, 2026 Online
44 131K text 200 req/day (free tier) Oct 29, 2025 7.1B Online
44 131K textreasoning 30 RPM, 60K TPM May 10, 2026 Online
44 128K text Credit-metered May 10, 2026 Online
44 128K text 20 RPM Jun 17, 2026 Online
44 128K text 20 RPM May 10, 2026 Online
44 128K text 20 RPM May 10, 2026 Online
44 128K textimage ~1 RPS, 500K TPM Nov 18, 2024 Online
44 128K text 15 RPM, 20K TPD Jun 17, 2026 Online
Llama-3.1-8B-Instruct
44 131K text 2 RPM (anonymous) Jul 23, 2024 Online
43 32K text Credit-metered May 10, 2026 Online
Meta: Llama 3.2 3B Instruct (free)
43 131K text Up to 40 RPM Sep 25, 2024 Online
meta/llama-3.1-70b-instruct
43 131K text Up to 40 RPM Jul 23, 2024 Online
Meta: Llama 3.3 70B Instruct (free)
42 131K text 10K neurons/day (shared) Dec 6, 2024 Online
Llama-3.1-8B-Instruct
42 131K text 30 RPM, 1,000 RPD Jul 23, 2024 Online
41 32K text 15 RPM, 20K TPD Jun 17, 2026 Online
Qwen2.5-7B-Instruct
40 131K text Credit-metered Oct 16, 2024 Online
meta/llama-3.2-11b-vision-instruct
40 131K textimage Up to 40 RPM Sep 25, 2024 Online
meta/llama-3.2-1b-instruct
38 131K text Up to 40 RPM Sep 25, 2024 Online
Llama-3.1-8B-Instruct
36 128K text Credit-metered Jul 23, 2024 Online
meta/llama-guard-4-12b
34 164K textimage Up to 40 RPM Apr 30, 2025 Online

How to Use Free LLM API Resources

  1. Pick a free LLM model — Click any model name to see details, rate limits, and API key signup link.
  2. Get your API key — Sign up on the provider's website (most require no credit card).
  3. Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
  4. Test it — Use the Playground to test your API key before integrating.

New to LLM terminology? Check the 📖 Glossary — 22 terms explained in plain English →

FAQ: Common questions about free LLM APIs →