Free LLM API Directory (2026): Browse 350+ Models

Discover and filter 350+ free LLM models across 30 providers. Find APIs by capability (vision, reasoning), rate limits, or no-credit-card requirements, and get the perfect free AI model for your project.

258 models verified via live API · refreshed Jul 19, 2026 — how we verify

Provider	Model	Score	Context	Modality	Rate Limit	Released	Weekly Tokens	Status
NVIDIA NIM	z-ai/glm-5.2 Verified	96	1.0M	text🧠 reasoning	Up to 40 RPM	Jun 16, 2026	3.5T	Online
OpenRouter	MiniMax: MiniMax M3 Paid MiniMax: MiniMax M3	92	1.0M	text👁️ imagevideo🧠 reasoning	200 req/day (free tier)	Jun 1, 2026	3.9T	Online
OpenRouter	Tencent: Hy3 (free) Verified	91	262K	text🧠 reasoning	200 req/day (free tier)	Jul 6, 2026	11.1T	Online
NVIDIA NIM	minimaxai/minimax-m3 Verified MiniMax: MiniMax M3	89	1.0M	text👁️ imagevideo🧠 reasoning	Up to 40 RPM	Jun 1, 2026	—	Online
OpenRouter	Nex AGI: Nex-N2-Pro Paid Nex AGI: Nex-N2-Pro	89	262K	text👁️ image🧠 reasoning	200 req/day (free tier)	Jun 2, 2026	4.3B	Online
Google Gemini	Gemini 3.5 Flash Verified Gemini 3.5 Flash	89	1.0M	text👁️ imagevideoaudiopdf🧠 reasoning	15 RPM, 1,500 RPD	May 19, 2026	—	Online
NVIDIA NIM	deepseek-ai/deepseek-v4-pro Verified deepseek-ai/deepseek-v4-pro	87	1.0M	text🧠 reasoning	Up to 40 RPM	Apr 24, 2026	—	Online
OpenRouter	NVIDIA: Nemotron 3 Ultra (free) Verified NVIDIA: Nemotron 3 Ultra (free)	87	1.0M	text🧠 reasoning	200 req/day (free tier)	Jun 4, 2026	3.2T	Online
OpenRouter	DeepSeek: DeepSeek V4 Flash Paid DeepSeek: DeepSeek V4 Flash	86	1.0M	text🧠 reasoning	200 req/day (free tier)	Apr 24, 2026	5.3T	Online
OpenRouter	MoonshotAI: Kimi K2.6 Paid MoonshotAI: Kimi K2.6	85	262K	text👁️ imagevideo🧠 reasoning👁️ vision	200 req/day (free tier)	Apr 20, 2026	3.0B	Online
OpenRouter	Z.ai: GLM 5.1 Paid	84	203K	text🧠 reasoning	200 req/day (free tier)	Apr 7, 2026	190.4B	Online
NVIDIA NIM	deepseek-ai/deepseek-v4-flash Verified DeepSeek: DeepSeek V4 Flash	84	1.0M	text🧠 reasoning	Up to 40 RPM	Apr 24, 2026	—	Online
NVIDIA NIM	moonshotai/kimi-k2.6 Verified MoonshotAI: Kimi K2.6	84	262K	text👁️ imagevideo🧠 reasoning	Up to 40 RPM	Apr 20, 2026	2.9B	Online
Agnes AI	agnes-2.0-flash Verified	81	256K	text👁️ vision	30 RPM	Jun 30, 2026	—	Online
OVHcloud AI Endpoints	Qwen3.6-27B Qwen3.6-27B	80	131K	text👁️ imagevideoaudio🧠 reasoning	2 RPM (anonymous)	Apr 22, 2026	—	Online
OpenRouter	inclusionAI: Ring-2.6-1T Paid	78	262K	text🧠 reasoning	200 req/day (free tier)	May 8, 2026	7.7B	Online
NVIDIA NIM	stepfun-ai/step-3.7-flash Verified	77	256K	text👁️ imagevideo🧠 reasoning	Up to 40 RPM	May 29, 2026	—	Online
NVIDIA NIM	minimaxai/minimax-m2.7 Verified MiniMax: MiniMax M3	77	205K	text🧠 reasoning	Up to 40 RPM	Mar 18, 2026	—	Online
NVIDIA NIM	Nemotron 3 Ultra 550B A55B Verified NVIDIA: Nemotron 3 Ultra (free)	76	1.0M	🧠 reasoning		Jun 4, 2026	—	Online
OpenRouter	Google: Gemma 4 31B (free) Verified Google: Gemma 4 31B (free)	74	262K	text👁️ image🧠 reasoningvideo	200 req/day (free tier)	Apr 2, 2026	8.9B	Online
OpenRouter	MiniMax: MiniMax M2.5 Paid MiniMax: MiniMax M3	73	205K	text🧠 reasoning	200 req/day (free tier)	Feb 12, 2026	32.4B	Online
OpenRouter	Cohere: North Mini Code (free) Verified Cohere: North Mini Code (free)	73	256K	textcode🧠 reasoning	200 req/day (free tier)	Jun 9, 2026	120.8B	Online
ModelScope	deepseek-ai/DeepSeek-V4-Pro Verified deepseek-ai/deepseek-v4-pro	72	8K			Apr 24, 2026	—	Online
OpenRouter	Google: Gemma 4 26B A4B (free) Verified Google: Gemma 4 26B A4B (free)	72	262K	text👁️ image🧠 reasoningvideo	200 req/day (free tier)	Apr 2, 2026	8.6B	Online
OpenRouter	Arcee AI: Trinity Large Thinking Paid	71	262K	text🧠 reasoning	200 req/day (free tier)	Apr 1, 2026	1.1B	Online
NVIDIA NIM	qwen/qwen3.5-397b-a17b Verified qwen/qwen3.5-397b-a17b	71	262K	text👁️ imagevideoaudio🧠 reasoning	Up to 40 RPM	Feb 16, 2026	30.6B	Online
OVHcloud AI Endpoints	Qwen3.5-397B-A17B qwen/qwen3.5-397b-a17b	71	131K	text👁️ imagevideoaudio🧠 reasoning	2 RPM (anonymous)	Feb 16, 2026	—	Online
NVIDIA NIM	qwen/qwen3.5-122b-a10b Verified qwen/qwen3.5-122b-a10b	71	262K	text👁️ imagevideoaudio🧠 reasoning	Up to 40 RPM	Feb 24, 2026	16.7B	Online
ModelScope	MiniMax-M2.5-highspeed Verified MiniMax: MiniMax M3	70	205K	🧠 reasoning	See provider page	Feb 12, 2026	32.4B	Online
Kilo Code	minimax/minimax-m2.5:free MiniMax: MiniMax M3	70	196K	text🧠 reasoning	~200 req/hr	Feb 12, 2026	—	Online
SambaNova	MiniMax-M2.7 MiniMax-M2.7	70	128K	text🧠 reasoning	20 RPM, 20 RPD, 200K TPD	Mar 18, 2026	—	Online
OpenRouter	NVIDIA: Nemotron 3 Nano Omni (free) Verified NVIDIA: Nemotron 3 Nano Omni (free)	70	256K	text👁️ imageaudiovideo🧠 reasoning	200 req/day (free tier)	Apr 28, 2026	22.7B	Online
OpenRouter	NVIDIA: Nemotron 3 Super (free) Verified NVIDIA: Nemotron 3 Super (free)	70	1.0M	text🧠 reasoning	200 req/day (free tier)	Mar 11, 2026	253.4B	Online
ModelScope	deepseek-ai/DeepSeek-V4-Flash Verified DeepSeek: DeepSeek V4 Flash	69	8K			Apr 24, 2026	—	Online
Groq	qwen/qwen3.6-27b Paid Qwen3.6-27B	69	8K			Apr 27, 2026	—	Online
NVIDIA NIM	poolside/laguna-xs-2.1 Verified poolside/laguna-xs-2.1	68	262K	text🧠 reasoning	Up to 40 RPM	Jul 2, 2026	98.4B	Online
Kilo Code	arcee-ai/trinity-large-thinking:free	67	131K	text🧠 reasoning	~200 req/hr	Apr 1, 2026	—	Online
OpenRouter	Poolside: Laguna XS 2.1 (free) Verified	67	262K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2026	80.7B	Online
OpenRouter	Poolside: Laguna M.1 (free) Verified	67	262K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2026	767.9B	Online
Kilo Code	nvidia/nemotron-3-super-120b-a12b:free NVIDIA: Nemotron 3 Super (free)	66	262K	text🧠 reasoning	~200 req/hr	Mar 11, 2026	—	Online
Cloudflare Workers AI	@cf/google/gemma-4-26b-a4b-it Verified Google: Gemma 4 26B A4B (free)	65	256K	text👁️ image🧠 reasoning	10K neurons/day (shared)	Apr 2, 2026	—	Online
NVIDIA NIM	stepfun-ai/step-3.5-flash Verified	64	262K	text🧠 reasoning	Up to 40 RPM	Feb 2, 2026	—	Online
Cloudflare Workers AI	@cf/moonshotai/kimi-k2.7-code Verified MoonshotAI: Kimi K2.6	64	262K	textcode👁️ imagevideo🧠 reasoning	10K neurons/day (shared)	Jun 12, 2026	—	Online
NVIDIA NIM	Gemma 4 31B IT Verified Google: Gemma 4 31B (free)	64	262K	👁️ vision🧠 reasoning		Apr 2, 2026	—	Online
Z AI (Zhipu AI)	GLM-4.7-Flash Verified GLM-4.7-Flash	63	200K	text🧠 reasoning	1 concurrent request	Jan 19, 2026	—	Online
GitHub Models	o4-mini o4-mini	62	200K	text👁️ image🧠 reasoning	10 RPM, 50 RPD	Apr 16, 2025	—	Online
OVHcloud AI Endpoints	Qwen3.5-9B Qwen3.5-9B	62	131K	text🧠 reasoning👁️ image	2 RPM (anonymous)	Mar 2, 2026	—	Online
NVIDIA NIM	01-ai/yi-large Verified	61	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	meta/codellama-70b Verified	61	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/llama3-chatqa-1.5-70b Verified	61	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	writer/palmyra-fin-70b-32k Verified	61	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	writer/palmyra-med-70b Verified	61	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	writer/palmyra-med-70b-32k Verified	61	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
Google Gemini	Gemini 3.1 Flash-Lite Verified Gemini 3.1 Flash-Lite	61	1.0M	text👁️ imagevideoaudiopdf🧠 reasoning	30 RPM, 1,500 RPD	Mar 3, 2026	—	Online
Kilo Code	bytedance-seed/dola-seed-2.0-pro:free	60	131K	text	~200 req/hr	May 10, 2026	—	Online
Google Gemini	Gemma 4 31B IT Verified Google: Gemma 4 31B (free)	60	262K	👁️ vision🧠 reasoning		Apr 2, 2026	—	Online
NVIDIA NIM	Nemotron 3 Nano Omni 30B A3B Reasoning Verified NVIDIA: Nemotron 3 Nano Omni (free)	60	256K	👁️ visionaudio🧠 reasoning		Apr 28, 2026	—	Online
Cerebras	zai-glm-4.7 Verified zai-glm-4.7	60	128K	text🧠 reasoning	10 RPM, 100 RPD, 1M TPD	Dec 22, 2025	—	Online
OpenRouter	Nous: Hermes 3 405B Instruct (free) Verified	59	131K	text	200 req/day (free tier)	Aug 16, 2024	53.6M	Online
Groq	Moonshot Kimi K2	59	131K	text	See provider page	Sep 5, 2025	2.9B	Online
Groq	Moonshot Kimi K2 0905	59	131K	text	See provider page	Sep 5, 2025	2.9B	Online
SambaNova	DeepSeek-V3.2 (Preview)	59	128K	text	20 RPM, 20 RPD, 200K TPD	Jun 17, 2026	—	Online
NVIDIA NIM	Nemotron 3 Super 120B A12B Verified NVIDIA: Nemotron 3 Super (free)	59	262K	🧠 reasoning		Mar 11, 2026	—	Online
Z AI (Zhipu AI)	GLM-4.6V-Flash Verified GLM-4.6V-Flash	59	128K	text👁️ imagevideo🧠 reasoning	1 concurrent request	Dec 8, 2025	—	Online
OpenRouter	Qwen: Qwen3 Coder 480B A35B (free) Verified Qwen: Qwen3 Coder 480B A35B (free)	59	1.0M	textcode	200 req/day (free tier)	Jul 23, 2025	0	Online
GitHub Models	gpt-4.1 gpt-4.1	59	1.0M	text👁️ imagepdf	10 RPM, 50 RPD	Apr 14, 2025	—	Online
ModelScope	Qwen/Qwen3.5-397B-A17B Verified qwen/qwen3.5-397b-a17b	58	8K			Feb 16, 2026	—	Online
ModelScope	Qwen/Qwen3.5-122B-A10B Verified qwen/qwen3.5-122b-a10b	58	8K			Feb 25, 2026	—	Online
OpenRouter	OpenAI: gpt-oss-120b Paid OpenAI: gpt-oss-120b	58	131K	text🧠 reasoning	200 req/day (free tier)	Aug 5, 2025	26.9B	Online
Mistral AI	Mistral Small 4	57	256K	text	~1 RPS, 500K TPM	Mar 16, 2026	—	Online
Google Gemini	Gemma 4 26B A4B IT Verified Google: Gemma 4 26B A4B (free)	57	262K	👁️ vision🧠 reasoning		Apr 3, 2026	—	Online
OpenRouter	Qwen: Qwen3 VL 235B A22B Instruct Paid Qwen: Qwen3 VL 235B A22B Instruct	57	131K	text👁️ image	200 req/day (free tier)	Sep 23, 2025	11.2B	Online
NVIDIA NIM	nvidia/llama-3.1-nemotron-ultra-253b-v1 Verified nvidia/llama-3.1-nemotron-ultra-253b-v1	57	131K	text🧠 reasoning	Up to 40 RPM	Apr 7, 2025	—	Online
OpenRouter	NVIDIA: Nemotron 3.5 Content Safety (free) Verified NVIDIA: Nemotron 3.5 Content Safety (free)	57	128K	text👁️ image🧠 reasoning	200 req/day (free tier)	Jun 4, 2026	1.4B	Online
NVIDIA NIM	ibm/granite-34b-code-instruct Verified	56	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
OpenRouter	Qwen: Qwen3 Coder 30B A3B Instruct Paid Qwen: Qwen3 Coder 30B A3B Instruct	56	160K	textcode	200 req/day (free tier)	Jul 31, 2025	4.6B	Online
Google Gemini	Gemini Flash Latest Verified Gemini Flash Latest	56	1.0M	👁️ visionaudio🧠 reasoning		Apr 27, 2026	—	Online
NVIDIA NIM	mistralai/mistral-large-3-675b-instruct-2512 Verified	55	8K			Jun 29, 2026	—	Online
Kilo Code	x-ai/grok-code-fast-1:free	55	256K	textcode	~200 req/hr	Aug 28, 2025	—	Online
GitHub Models	AI21 Jamba 1.5 Large	55	256K	text🧠 reasoning	See provider page	Aug 29, 2024	—	Online
Agnes AI	agnes-1.5-flash Verified	55	256K	text👁️ vision	30 RPM	Jun 30, 2026	—	Online
GitHub Models	gpt-5 gpt-5	55	200K	text👁️ image🧠 reasoning	10 RPM, 50 RPD	Aug 7, 2025	—	Online
OpenRouter	Qwen: Qwen3 Next 80B A3B Instruct (free) Verified Qwen: Qwen3 Next 80B A3B Instruct (free)	55	262K	text🧠 reasoning	200 req/day (free tier)	Sep 11, 2025	138.6M	Online
SambaNova	gemma-4-31B-it (Preview) gemma-4-31B-it (Preview)	55	128K	text👁️ image🧠 reasoning	20 RPM, 20 RPD, 200K TPD	Apr 2, 2026	—	Online
OpenRouter	Qwen: Qwen3 VL 8B Thinking Paid Qwen: Qwen3 VL 8B Thinking	55	256K	text👁️ image🧠 reasoning	200 req/day (free tier)	Oct 14, 2025	134.3M	Online
OpenRouter	Qwen: Qwen3 235B A22B Paid Qwen: Qwen3 235B A22B	55	131K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2025	4.8B	Online
Google Gemini	Gemini 3 Flash (Preview) Verified	54	1.0M	text	Preview limits	Dec 17, 2025	—	Online
NVIDIA NIM	bigcode/starcoder2-15b Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	google/deplot Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	google/gemma-2b Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	google/recurrentgemma-2b Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	microsoft/kosmos-2 Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	microsoft/phi-3-vision-128k-instruct Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	microsoft/phi-3.5-moe-instruct Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	mistralai/codestral-22b-instruct-v0.1 Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	mistralai/mixtral-8x22b-v0.1 Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemotron-51b-instruct Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nemoretriever-parse Verified	54	131K	rerank	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nemotron-4-340b-instruct Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nemotron-4-340b-reward Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nemotron-parse Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/neva-22b Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nvclip Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/riva-translate-4b-instruct Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/vila Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	writer/palmyra-creative-122b Verified	54	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nemotron-3.5-content-safety Verified NVIDIA: Nemotron 3.5 Content Safety (free)	54	128K	text👁️ image🧠 reasoning	Up to 40 RPM	Jun 4, 2026	1.4B	Online
OVHcloud AI Endpoints	Qwen3-Coder-30B-A3B-Instruct Qwen: Qwen3 Coder 30B A3B Instruct	54	262K	textcode	2 RPM (anonymous)	Jul 31, 2025	—	Online
OpenRouter	Google: Lyria 3 Pro Preview Verified Google: Lyria 3 Pro Preview	54	1.0M	text👁️ imageaudio	200 req/day (free tier)	Mar 30, 2026	11.1M	Online
OpenRouter	Google: Lyria 3 Clip Preview Verified Google: Lyria 3 Clip Preview	54	1.0M	text👁️ imageaudio	200 req/day (free tier)	Mar 30, 2026	9.7M	Online
OpenRouter	Qwen: Qwen3 VL 8B Instruct Paid Qwen: Qwen3 VL 8B Instruct	54	256K	text👁️ image	200 req/day (free tier)	Oct 14, 2025	9.3B	Online
OpenRouter	NVIDIA: Nemotron 3 Nano 30B A3B (free) Verified NVIDIA: Nemotron 3 Nano 30B A3B (free)	54	256K	text🧠 reasoning	200 req/day (free tier)	Dec 14, 2025	36.2B	Online
Groq	GPT OSS 120B Paid	53	131K	🧠 reasoningtext		Aug 5, 2025	—	Online
NVIDIA NIM	GPT OSS 120B Verified	53	131K	🧠 reasoning		Jul 10, 2026	—	Online
Cloudflare Workers AI	@cf/openai/gpt-oss-120b Verified	53	128K	text🧠 reasoning	10K neurons/day (shared)	Aug 5, 2025	—	Online
Google Gemini	Gemini 3.1 Flash Lite Verified Gemini 3.1 Flash-Lite	53	1.0M	👁️ visionaudio🧠 reasoning		May 7, 2026	—	Online
Cerebras	gpt-oss-120b Verified gpt-oss-120b	53	128K	text🧠 reasoning	30 RPM, 14,400 RPD, 1M TPD	Aug 5, 2025	—	Online
OpenRouter	Z.ai: GLM 4.5 Air Paid Z.ai: GLM 4.5 Air	53	131K	text🧠 reasoning	200 req/day (free tier)	Jul 28, 2025	1.3B	Online
OpenRouter	Qwen: Qwen3 235B A22B Thinking 2507 Paid Qwen: Qwen3 235B A22B Thinking 2507	53	262K	text🧠 reasoning	200 req/day (free tier)	Jul 25, 2025	4.8B	Online
OpenRouter	NVIDIA: Nemotron Nano 12B 2 VL (free) Verified NVIDIA: Nemotron Nano 12B 2 VL (free)	53	128K	text👁️ imagevideo🧠 reasoning	200 req/day (free tier)	Oct 28, 2025	8.3B	Online
NVIDIA NIM	adept/fuyu-8b Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	aisingapore/sea-lion-7b-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	deepseek-ai/deepseek-coder-6.7b-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	google/codegemma-1.1-7b Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	google/codegemma-7b Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	ibm/granite-3.0-8b-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	ibm/granite-8b-code-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	mistralai/mistral-7b-instruct-v0.3 Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nv-mistralai/mistral-nemo-12b-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/cosmos-reason2-8b Verified	52	131K	text🧠 reasoning	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/mistral-nemo-minitron-8b-8k-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	zyphra/zamba2-7b-instruct Verified	52	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
Mistral AI	Mistral Medium 3.5 (128B)	52	256K	text	~1 RPS, 500K TPM	Jun 17, 2026	—	Online
Agnes AI	agnes-image-2.0-flash Verified	52	4K	👁️ image	30 RPM (1K)	Jun 30, 2026	—	Online
Agnes AI	agnes-image-2.1-flash Verified	52	4K	👁️ image	30 RPM (1K)	Jun 30, 2026	—	Online
ModelScope	deepseek-ai/DeepSeek-V3.2 Verified deepseek-ai/DeepSeek-V3.2	52	8K			Dec 1, 2025	—	Online
OpenRouter	Qwen: Qwen3 Next 80B A3B Thinking Paid Qwen: Qwen3 Next 80B A3B Thinking	52	262K	text🧠 reasoning	200 req/day (free tier)	Sep 11, 2025	402.8M	Online
Google Gemini	Gemini 2.5 Flash Verified Gemini 2.5 Flash	52	1.0M	text👁️ imageaudiovideopdf🧠 reasoning	15 RPM, 1,500 RPD	May 20, 2025	—	Online
Google Gemini	Gemini 2.5 Pro Verified Gemini 2.5 Pro	52	1.0M	text👁️ imageaudiovideopdf🧠 reasoning	5 RPM, 50 RPD	Jun 5, 2025	—	Online
SambaNova	DeepSeek-V3.1 DeepSeek-V3.1	52	128K	text	20 RPM, 20 RPD, 200K TPD	Aug 21, 2025	—	Online
OpenRouter	Venice: Uncensored (free) Verified	51	33K	text	200 req/day (free tier)	Jul 9, 2025	—	Online
NVIDIA NIM	abacusai/dracarys-llama-3.1-70b-instruct Verified	51	8K			Jun 29, 2026	—	Online
OpenRouter	OpenAI: gpt-oss-20b (free) Verified OpenAI: gpt-oss-120b	51	131K	text🧠 reasoning	200 req/day (free tier)	Aug 5, 2025	18.4B	Online
GitHub Models	Llama-4-Scout-17B-16E Llama-4-Scout-17B-16E	51	512K	text👁️ image	15 RPM, 150 RPD	Apr 5, 2025	—	Online
GitHub Models	Llama-4-Maverick-17B-128E Llama-4-Scout-17B-16E	51	256K	text👁️ image	10 RPM, 50 RPD	Apr 5, 2025	—	Online
GitHub Models	gpt-4o gpt-4o	51	128K	text👁️ imagepdfaudio	10 RPM, 50 RPD	May 13, 2024	—	Online
OpenRouter	NVIDIA: Nemotron Nano 9B V2 (free) Verified NVIDIA: Nemotron Nano 9B V2 (free)	51	128K	text🧠 reasoning	200 req/day (free tier)	Sep 5, 2025	12.4B	Online
GitHub Models	Mistral-Small-3.1	50	128K	text	15 RPM, 150 RPD	Mar 17, 2025	—	Online
NVIDIA NIM	baai/bge-m3 Verified	50	131K	text	Up to 40 RPM	Jan 30, 2024	12.0B	Online
OpenCode Zen	Nemotron 3 Ultra 550B A55B Verified	50	1.0M	🧠 reasoning		Jun 28, 2026	—	Online
OpenRouter	Qwen: Qwen3 30B A3B Paid Qwen: Qwen3 30B A3B	50	131K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2025	19.8B	Online
GitHub Models	DeepSeek-R1 DeepSeek-R1	50	64K	text🧠 reasoning	15 RPM, 150 RPD	May 28, 2025	—	Online
OVHcloud AI Endpoints	Qwen2.5-VL-72B-Instruct Qwen2.5-VL-72B-Instruct	50	128K	text👁️ image	2 RPM (anonymous)	Sep 1, 2024	—	Online
OVHcloud AI Endpoints	Mistral-Small-3.2-24B-Instruct Mistral-Small-3.2-24B-Instruct	50	128K	text	2 RPM (anonymous)	Jun 20, 2025	—	Online
OVHcloud AI Endpoints	Mistral-Nemo-Instruct-2407 Mistral-Nemo-Instruct-2407	50	128K	text	2 RPM (anonymous)	Jul 1, 2024	—	Online
Z AI (Zhipu AI)	GLM-4.5-Flash Verified GLM-4.5-Flash	50	128K	text	1 concurrent request	Jul 25, 2025	—	Online
GitHub Models	gpt-4.1-mini gpt-4.1-mini	50	1.0M	text👁️ imagepdf	15 RPM, 150 RPD	Apr 14, 2025	—	Online
Mistral AI	Codestral	49	256K	textcode	~1 RPS, 500K TPM	May 10, 2026	—	Online
Agnes AI	agnes-video-v2.0 Verified	49	4K	video	2 RPM	Jun 30, 2026	—	Online
Mistral AI	Mistral Large 3	49	256K	text	~1 RPS, 500K TPM	Dec 2, 2025	—	Online
OpenRouter	Qwen: Qwen3 30B A3B Thinking 2507 Paid Qwen: Qwen3 30B A3B Thinking 2507	49	131K	text🧠 reasoning	200 req/day (free tier)	Aug 28, 2025	1.2B	Online
NVIDIA NIM	nvidia/llama-3.3-nemotron-super-49b-v1.5 Verified nvidia/llama-3.3-nemotron-super-49b-v1.5	49	131K	text🧠 reasoning	Up to 40 RPM	Jul 25, 2025	229.1M	Online
OVHcloud AI Endpoints	gpt-oss-20b gpt-oss-20b	49	128K	text🧠 reasoning	2 RPM (anonymous)	Aug 5, 2025	—	Online
OpenRouter	Qwen: Qwen3 32B Paid Qwen: Qwen3 32B	48	131K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2025	13.8B	Online
Aion Labs	Aion 2.0 Aion 2.0	48	128K	text	15 RPM, 20K TPD	Feb 23, 2026	—	Online
OpenRouter	Qwen: Qwen3 14B Paid Qwen: Qwen3 14B	48	132K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2025	2.2B	Online
Cloudflare Workers AI	@cf/nvidia/nemotron-3-120b-a12b Verified	47	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-large-en-v1.5 Verified	47	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/meta/llama-4-scout-17b-16e-instruct Verified	47	10.0M	text👁️ image	10K neurons/day (shared)	Apr 5, 2025	—	Online
OpenRouter	Free Models Router Verified	47	200K	text👁️ image🧠 reasoning	200 req/day (free tier)	Feb 1, 2026	—	Online
NVIDIA NIM	mistralai/mistral-medium-3.5-128b Verified	47	8K			Jun 29, 2026	—	Online
NVIDIA NIM	ibm/granite-3.0-3b-a800m-instruct Verified	47	131K	text	Up to 40 RPM	Jun 17, 2026	—	Online
Glhf.chat	Mixtral 8x7B Verified	47	33K	text	Unlimited for free models	Jun 29, 2026	—	Online
Groq	GPT OSS 20B Paid	47	131K	🧠 reasoningtext		Aug 5, 2025	—	Online
NVIDIA NIM	nvidia/nemotron-nano-3-30b-a3b Verified nvidia/llama-3.1-nemotron-ultra-253b-v1	47	131K	text🧠 reasoning	Up to 40 RPM	Dec 15, 2025	—	Online
OpenRouter	Meta: Llama 3.3 70B Instruct (free) Verified Meta: Llama 3.3 70B Instruct (free)	47	131K	text	200 req/day (free tier)	Dec 6, 2024	214.6M	Online
NVIDIA NIM	Llama 3.3 Nemotron Super 49B v1 Verified	46	131K	🧠 reasoning		Jun 29, 2026	—	Online
NVIDIA NIM	GPT OSS 20B Verified	46	131K	🧠 reasoning		Jul 10, 2026	—	Online
ModelScope	GLM-4.7-FlashX Verified	46	200K	🧠 reasoning		Jun 29, 2026	—	Online
OpenRouter	Qwen: Qwen3 8B Paid Qwen: Qwen3 8B	46	131K	text🧠 reasoning	200 req/day (free tier)	Apr 28, 2025	3.2B	Online
Cloudflare Workers AI	@cf/mistralai/mistral-small-3.1-24b-instruct Verified @cf/mistralai/mistral-small-3.1-24b-instruct	46	128K	text	10K neurons/day (shared)	Mar 18, 2025	—	Online
OpenCode Zen	DeepSeek V4 Flash Verified	45	1.0M	🧠 reasoning		Jun 28, 2026	—	Online
NVIDIA NIM	mistralai/mistral-large-2-instruct Verified	45	131K	text	Up to 40 RPM	Nov 18, 2024	1.3B	Online
SiliconFlow	Abbreviation	45	131K	text	See provider page	May 10, 2026	—	Online
NVIDIA NIM	Nemotron Mini 4B Instruct Verified	45	128K	text		Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/llama-nemotron-embed-1b-v2 Verified nvidia/llama-3.1-nemotron-ultra-253b-v1	45	131K	embeddingtext👁️ image	Up to 40 RPM	Feb 10, 2026	—	Online
NVIDIA NIM	nvidia/llama-nemotron-embed-vl-1b-v2 Verified nvidia/llama-3.1-nemotron-ultra-253b-v1	45	131K	embeddingtext👁️ image	Up to 40 RPM	Feb 10, 2026	8.4B	Online
Chutes.ai	DeepSeek-R1 DeepSeek-R1	45	131K	text🧠 reasoning	Community-powered, no hard cap	May 28, 2025	—	Online
Groq	llama-3.3-70b-versatile Paid Meta: Llama 3.3 70B Instruct (free)	45	131K	text	30 RPM, 1,000 RPD	Dec 6, 2024	—	Online
GitHub Models	Meta-Llama-3.3-70B Meta: Llama 3.3 70B Instruct (free)	45	131K	text	15 RPM, 150 RPD	Dec 6, 2024	—	Online
NVIDIA NIM	meta/llama-3.1-70b-instruct Verified meta/llama-3.1-70b-instruct	45	131K	text	Up to 40 RPM	Jul 23, 2024	—	Online
Hugging Face	Mixtral-8x7B-Instruct-v0.1	44	32K	text	Credit-metered	May 10, 2026	—	Online
Cohere	Command A (111B)	44	256K	text	20 RPM	May 10, 2026	—	Online
Cloudflare Workers AI	Mistral 7B Verified	44	33K	text	See provider page	Jun 27, 2026	—	Online
Cloudflare Workers AI	Qwen 1.5 7B	44	33K	text	See provider page	Jun 27, 2026	—	Online
ModelScope	GLM-5.2 Verified	44	1.0M	🧠 reasoning		Jun 29, 2026	—	Online
NVIDIA NIM	bytedance/seed-oss-36b-instruct Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	google/diffusiongemma-26b-a4b-it Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	google/gemma-2-2b-it Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	google/gemma-3n-e2b-it Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	meta/llama-3.2-90b-vision-instruct Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	meta/llama-4-maverick-17b-128e-instruct Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	mistralai/mistral-nemotron Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	mistralai/mistral-small-4-119b-2603 Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/gliner-pii Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/ising-calibration-1-35b-a3b Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/riva-translate-4b-instruct-v1.1 Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	sarvamai/sarvam-m Verified	44	8K			Jun 29, 2026	—	Online
NVIDIA NIM	mistralai/mixtral-8x7b-instruct-v0.1 Verified	44	8K			Jun 29, 2026	—	Online
Hugging Face	Phi-3.5-mini-instruct	44	128K	text	Credit-metered	May 10, 2026	—	Online
Mistral AI	Mixtral 8x7B	44	33K	text	See provider page	Dec 11, 2023	—	Online
Cohere	Command A+ (218B)	44	128K	text	20 RPM	Jun 17, 2026	—	Online
Cohere	Command R+	44	128K	text	20 RPM	May 10, 2026	—	Online
Cohere	Command R7B	44	128K	text	20 RPM	May 10, 2026	—	Online
ModelScope	MiniMax-M3 Verified	44	512K	👁️ vision🧠 reasoning		Jun 29, 2026	—	Online
Groq	whisper-large-v3 Verified	44	131K	text	20 RPM, 2,000 RPD	May 10, 2026	—	Online
Groq	whisper-large-v3-turbo Verified	44	131K	text	20 RPM, 2,000 RPD	May 10, 2026	—	Online
SiliconFlow	deepseek-ai/DeepSeek-R1-Distill-Qwen-7B	44	131K	text🧠 reasoning	30 RPM, 60K TPM	May 10, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemotron-70b-instruct Verified nvidia/llama-3.1-nemotron-ultra-253b-v1	44	131K	text	Up to 40 RPM	Oct 15, 2024	—	Online
NVIDIA NIM	Nemotron 3 Nano 30B A3B Verified NVIDIA: Nemotron 3 Nano 30B A3B (free)	44	262K	🧠 reasoning		Dec 14, 2025	—	Online
OVHcloud AI Endpoints	Meta-Llama-3_3-70B-Instruct Verified Meta: Llama 3.3 70B Instruct (free)	44	131K	text	2 RPM (anonymous)	Dec 6, 2024	—	Online
OpenRouter	Meta: Llama 3.2 3B Instruct (free) Verified Meta: Llama 3.2 3B Instruct (free)	44	131K	text	200 req/day (free tier)	Sep 25, 2024	57.1M	Online
GitHub Models	Phi-4 Phi-4	44	131K	text🧠 reasoning	See provider page	Dec 12, 2024	—	Online
ModelScope	opencompass/CompassJudger-1-32B-Instruct Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	XGenerationLab/XiYanSQL-QwenCoder-32B-2412 Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	XGenerationLab/XiYanSQL-QwenCoder-32B-2504 Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	stepfun-ai/Step-3.5-Flash Verified	43	8K			Jun 29, 2026	—	Online
ModelScope	stepfun-ai/Step-3.7-Flash Verified	43	8K			Jun 29, 2026	—	Online
Aion Labs	Aion 2.5	43	128K	text	15 RPM, 20K TPD	Jun 17, 2026	—	Online
ModelScope	Kimi K2.5 Verified	43	262K	👁️ vision🧠 reasoning		Jun 29, 2026	—	Online
OpenRouter	OpenAI: gpt-oss-safeguard-20b Paid	43	131K	text🧠 reasoning	200 req/day (free tier)	Oct 29, 2025	6.0B	Online
ModelScope	GLM-5.1 Verified	43	200K	🧠 reasoning		Jun 29, 2026	—	Online
ModelScope	GLM-5.1 Verified	43	200K	🧠 reasoning		Jun 29, 2026	—	Online
Hugging Face	Mistral-7B-Instruct-v0.3	43	32K	text	Credit-metered	May 10, 2026	—	Online
Cloudflare Workers AI	@cf/zai-org/glm-4.7-flash Verified	43	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/qwen/qwq-32b Verified	43	8K			Jun 29, 2026	—	Online
NVIDIA NIM	upstage/solar-10.7b-instruct Verified	43	8K			Jun 29, 2026	—	Online
OpenCode Zen	MiMo-V2.5 Verified	43	1.0M	👁️ visionaudio🧠 reasoning		Jun 28, 2026	—	Online
NVIDIA NIM	qwen/qwen3-next-80b-a3b-instruct Verified Qwen: Qwen3 Next 80B A3B Instruct (free)	43	8K			Sep 11, 2025	—	Online
Mistral AI	Mistral Nemo (12B) Mistral-Nemo-Instruct-2407	43	128K	text	~1 RPS, 500K TPM	Jul 1, 2024	—	Online
Groq	llama-3.1-8b-instant Paid llama-3.1-8b-instant	43	131K	text	30 RPM, 1,000 RPD	Jul 23, 2024	—	Online
NVIDIA NIM	meta/llama-3.2-11b-vision-instruct Verified meta/llama-3.2-11b-vision-instruct	43	131K	text👁️ image	Up to 40 RPM	Sep 25, 2024	—	Online
Mistral AI	Mistral 7B	42	33K	text	See provider page	Sep 27, 2023	—	Online
NVIDIA NIM	ai21labs/jamba-1.5-large-instruct Verified	42	131K	text	Up to 40 RPM	Aug 22, 2024	—	Online
Groq	groq/compound Verified	42	8K			Jul 10, 2026	—	Online
Google Gemini	Gemini Flash-Lite Latest Verified	42	1.0M	👁️ visionaudio🧠 reasoning		Jun 29, 2026	—	Online
Google Gemini	gemini-robotics-er-1.6-preview Verified	42	131K			Jun 29, 2026	—	Online
Cerebras	Gemma 4 31B IT Verified	42	262K	👁️ vision🧠 reasoning		Jul 10, 2026	—	Online
NVIDIA NIM	Nemotron Nano 12B v2 VL Verified NVIDIA: Nemotron Nano 12B 2 VL (free)	42	128K	👁️ vision🧠 reasoning		Oct 28, 2025	—	Online
NVIDIA NIM	meta/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free)	42	131K	text	Up to 40 RPM	Sep 25, 2024	—	Online
OVHcloud AI Endpoints	Llama-3.1-8B-Instruct llama-3.1-8b-instant	42	131K	text	2 RPM (anonymous)	Jul 23, 2024	—	Online
ModelScope	Qwen/Qwen3-235B-A22B-Instruct-2507 Verified Qwen/Qwen3-235B-A22B-Instruct-2507	42	8K			Jul 21, 2025	—	Online
Z AI (Zhipu AI)	GLM-4.5-Air Verified GLM-4.5-Air	42	131K	🧠 reasoning		Jul 25, 2025	—	Online
NVIDIA NIM	nvidia/embed-qa-4 Verified	41	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.2-nemoretriever-1b-vlm-embed-v1 Verified	41	131K	embeddingrerank	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.2-nv-embedqa-1b-v1 Verified	41	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nemotron-3-embed-1b Verified	41	131K	embedding	Up to 40 RPM	Jul 17, 2026	96.4M	Online
NVIDIA NIM	nvidia/nv-embedqa-e5-v5 Verified	41	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	nvidia/nv-embedqa-mistral-7b-v2 Verified	41	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
NVIDIA NIM	snowflake/arctic-embed-l Verified	41	131K	embedding	Up to 40 RPM	Jun 17, 2026	—	Online
Groq	allam-2-7b Verified	41	8K			Jul 10, 2026	—	Online
Groq	groq/compound-mini Verified	41	8K			Jul 10, 2026	—	Online
Aion Labs	Aion-RP 1.0 (8B)	41	32K	text	15 RPM, 20K TPD	Jun 17, 2026	—	Online
NVIDIA NIM	databricks/dbrx-instruct Verified	41	131K	text	Up to 40 RPM	Mar 27, 2024	—	Online
NVIDIA NIM	meta/llama2-70b Verified	41	131K	text	Up to 40 RPM	Jul 18, 2023	—	Online
ModelScope	LLM-Research/Llama-4-Maverick-17B-128E-Instruct Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	MedAIBase/AntAngelMed Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	MiniMax/MiniMax-M1-80k Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	MusePublic/Qwen-Image-Edit Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	OpenGVLab/InternVL3_5-241B-A28B Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-21B-A3B-PT Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-300B-A47B-PT Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-VL-28B-A3B-PT Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen-Image-Edit Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen3-4B Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	Shanghai_AI_Laboratory/Intern-S1 Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	Shanghai_AI_Laboratory/Intern-S2-Preview Verified	41	8K			Jun 29, 2026	—	Online
ModelScope	Tencent-Hunyuan/Hy3 Verified	41	8K			Jul 10, 2026	—	Online
ModelScope	Qwen/Qwen3-VL-235B-A22B-Instruct Verified Qwen: Qwen3 VL 235B A22B Instruct	41	8K			Sep 23, 2025	—	Online
Cerebras	Llama 3.1 70B meta/llama-3.1-70b-instruct	41	131K	text	See provider page	Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/deepseek-ai/deepseek-r1-distill-qwen-32b Verified @cf/deepseek-ai/deepseek-r1-distill-qwen-32b	41	32K	text🧠 reasoning	10K neurons/day (shared)	Jan 20, 2025	—	Online
NVIDIA NIM	meta/llama-3.2-1b-instruct Verified meta/llama-3.2-1b-instruct	41	131K	text	Up to 40 RPM	Sep 25, 2024	—	Online
OpenCode Zen	North Mini Code Verified	40	256K	🧠 reasoning		Jun 28, 2026	—	Online
Grok (xAI)	Grok-2 Mini	40	131K	text	$25/month free credits, resets monthly	Jun 27, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-m3 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/google/gemma-2b-it-lora Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/moonshotai/kimi-k2.6 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/ibm-granite/granite-4.0-h-micro Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-small-en-v1.5 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/zai-org/glm-5.2 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/baai/bge-base-en-v1.5 Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/aisingapore/gemma-sea-lion-v4-27b-it Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/openai/gpt-oss-20b Verified	40	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/moondream/moondream3.1-9B-A2B Verified	40	8K			Jul 10, 2026	—	Online
LLM7.io	Codestral (latest) Verified	40	256K	text		Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen3-Coder-30B-A3B-Instruct Verified Qwen: Qwen3 Coder 30B A3B Instruct	40	8K			Jul 31, 2025	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.3-70b-instruct-fp8-fast Verified Meta: Llama 3.3 70B Instruct (free)	40	131K	text	10K neurons/day (shared)	Dec 6, 2024	—	Online
Chutes.ai	Llama 3.1 70B meta/llama-3.1-70b-instruct	40	131K	text	Community-powered, no hard cap	Jul 23, 2024	—	Online
NVIDIA NIM	mistralai/ministral-14b-instruct-2512 Verified mistralai/ministral-14b-instruct-2512	40	8K			Dec 2, 2025	—	Online
Hugging Face	Qwen2.5-7B-Instruct Qwen2.5-7B-Instruct	40	131K	text	Credit-metered	Oct 16, 2024	—	Online
Google Gemini	Gemini 2.5 Flash-Lite Verified Gemini 2.5 Flash-Lite	40	1.0M	👁️ visionaudio🧠 reasoning		Jul 22, 2025	—	Online
Mistral AI	Pixtral Large Pixtral Large	40	128K	text👁️ image	~1 RPS, 500K TPM	Nov 18, 2024	—	Online
GitHub Models	Mistral Large (24.11) Mistral Large (24.11)	40	131K	text👁️ image🧠 reasoning	See provider page	Feb 26, 2024	—	Online
ModelScope	meituan-longcat/LongCat-Flash-Lite Verified	39	8K			Jun 29, 2026	—	Online
ModelScope	Shanghai_AI_Laboratory/Intern-S1-mini Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/mistral/mistral-7b-instruct-v0.2-lora Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/meta-llama/llama-2-7b-chat-hf-lora Verified	39	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/google/gemma-7b-it-lora Verified	39	8K			Jun 29, 2026	—	Online
Groq	GPT-OSS Safeguard 20B Paid	39	131K	text🧠 reasoning		Oct 29, 2025	—	Online
OpenCode Zen	big-pickle Verified	39	N/A			Jun 28, 2026	—	Online
ModelScope	Qwen/Qwen3-Next-80B-A3B-Instruct Verified Qwen: Qwen3 Next 80B A3B Instruct (free)	39	8K			Sep 11, 2025	—	Online
ModelScope	Qwen/Qwen3-VL-8B-Thinking Verified Qwen: Qwen3 VL 8B Thinking	39	8K	🧠 reasoning		Oct 14, 2025	—	Online
ModelScope	Qwen/Qwen3-235B-A22B Verified Qwen: Qwen3 235B A22B	39	8K			Apr 28, 2025	—	Online
Glhf.chat	Llama 3.1 70B meta/llama-3.1-70b-instruct	39	131K	text	Unlimited for free models	Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/qwen/qwen2.5-coder-32b-instruct Verified @cf/qwen/qwen2.5-coder-32b-instruct	39	8K			Nov 11, 2024	—	Online
ModelScope	Qwen/Qwen3-VL-8B-Instruct Verified Qwen: Qwen3 VL 8B Instruct	38	8K			Oct 14, 2025	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemotron-nano-vl-8b-v1 Verified	37	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/nvidia-nemotron-nano-9b-v2 Verified	37	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemotron-nano-8b-v1 Verified	37	8K			Jul 10, 2026	—	Online
NVIDIA NIM	nvidia/nv-embed-v1 Verified	37	131K	embeddingtext	Up to 40 RPM	Jun 7, 2024	—	Online
NVIDIA NIM	nvidia/nv-embedcode-7b-v1 Verified	37	131K	embeddingtext	Up to 40 RPM	Mar 17, 2025	—	Online
OpenCode Zen	Hy3 preview Verified	37	256K	🧠 reasoning		Jul 10, 2026	—	Online
ModelScope	Qwen/Qwen3-235B-A22B-Thinking-2507 Verified Qwen: Qwen3 235B A22B Thinking 2507	37	8K	🧠 reasoning		Jul 25, 2025	—	Online
NVIDIA NIM	meta/llama-guard-4-12b Verified meta/llama-guard-4-12b	37	164K	text👁️ image	Up to 40 RPM	Apr 5, 2025	—	Online
ModelScope	Qwen/Qwen3-Next-80B-A3B-Thinking Verified Qwen: Qwen3 Next 80B A3B Thinking	36	8K	🧠 reasoning		Sep 11, 2025	—	Online
NVIDIA NIM	Llama-3.3-70B-Instruct Verified Meta: Llama 3.3 70B Instruct (free)	36	128K	text		Dec 6, 2024	—	Online
ModelScope	Qwen/Qwen3-30B-A3B Verified Qwen: Qwen3 30B A3B	35	8K			Apr 28, 2025	—	Online
ModelScope	mistralai/Mistral-Large-Instruct-2407 Verified mistralai/Mistral-Large-Instruct-2407	35	8K			Nov 19, 2024	—	Online
ModelScope	PaddlePaddle/ERNIE-4.5-0.3B-PT Verified	34	8K			Jun 29, 2026	—	Online
NVIDIA NIM	Llama 3.1 Nemotron Safety Guard 8B v3 Verified	34	128K	text		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/qwen/qwen3-30b-a3b-fp8 Verified Qwen: Qwen3 30B A3B	34	8K			Apr 28, 2025	—	Online
ModelScope	Qwen/Qwen3-30B-A3B-Thinking-2507 Verified Qwen: Qwen3 30B A3B Thinking 2507	34	8K	🧠 reasoning		Aug 28, 2025	—	Online
Hugging Face	Meta-Llama-3.1-8B-Instruct Verified llama-3.1-8b-instant	34	128K	text	Credit-metered	Jul 23, 2024	—	Online
ModelScope	Qwen/Qwen3-32B Verified Qwen: Qwen3 32B	33	8K			Apr 28, 2025	—	Online
Grok (xAI)	Grok-2	32	131K	text	$25/month free credits, resets monthly	Dec 12, 2024	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemoguard-8b-content-safety Verified	32	8K			Jun 29, 2026	—	Online
NVIDIA NIM	nvidia/llama-3.1-nemoguard-8b-topic-control Verified	32	8K			Jun 29, 2026	—	Online
ModelScope	Qwen/Qwen3-14B Verified Qwen: Qwen3 14B	32	8K			Apr 28, 2025	—	Online
NVIDIA NIM	meta/llama-3.1-8b-instruct Verified llama-3.1-8b-instant	32	8K			Jul 23, 2024	—	Online
NVIDIA NIM	google/gemma-3n-e4b-it Verified google/gemma-3n-e4b-it	32	8K			May 20, 2025	—	Online
Groq	meta-llama/llama-prompt-guard-2-22m Paid	30	8K			Jul 13, 2026	—	Online
Groq	meta-llama/llama-prompt-guard-2-86m Paid	30	8K			Jul 13, 2026	—	Online
ModelScope	Qwen/Qwen3-8B Verified Qwen: Qwen3 8B	30	8K			Apr 28, 2025	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.2-3b-instruct Verified Meta: Llama 3.2 3B Instruct (free)	29	8K			Sep 25, 2024	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.1-8b-instruct-fp8 Verified llama-3.1-8b-instant	28	8K			Jul 23, 2024	—	Online
Cloudflare Workers AI	@cf/meta/llama-3.2-1b-instruct Verified meta/llama-3.2-1b-instruct	28	8K			Sep 25, 2024	—	Online
Cloudflare Workers AI	@cf/meta/llama-guard-3-8b Verified	27	8K			Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/qwen/qwen3-embedding-0.6b Verified	27	8K	embedding		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/pfnet/plamo-embedding-1b Verified	27	8K	embedding		Jun 29, 2026	—	Online
Cloudflare Workers AI	@cf/google/embeddinggemma-300m Verified	27	8K	embedding		Jun 29, 2026	—	Online

How to Get Started with Free LLM APIs

Pick a free LLM model — Click any model name to see details, rate limits, and API key signup link.
Get your API key — Sign up on the provider's website (most require no credit card).
Copy the config — Go to the Config Generator, pick your tool and backend, copy the ready-to-use snippet.
Test it — Use the Playground to test your API key before integrating.

New to LLM terminology? Check the 📖 Glossary — 22 terms explained in plain English →

FAQ: Common questions about free LLM APIs →

About This Free LLM API Directory

Finding reliable free LLM API resources online can be frustrating. Many developers traditionally rely on static GitHub repositories to find endpoints. While those lists are a good starting point, they often become outdated quickly, leaving you with dead links, expired API keys, and unverified rate limits.

That's why we built this dynamic, auto-updating directory. If you are looking for a reliable alternative to GitHub free LLM API lists, this page tracks over 350 free LLM models online in real-time. Whether you need a free API key for text generation, vision, or coding tasks, you can compare context windows, capabilities, and strict rate limit data side-by-side.

Our goal is to be the most accurate and comprehensive list of free AI APIs for developers. Use the filters above to find providers that don't require credit cards or phone verification, and grab your free API keys to start building immediately.

Directory of Free LLM APIs: Compare 350+ Models

How to Get Started with Free LLM APIs

About This Free LLM API Directory