freellmpool

Name: freellmpool
Author: 0xzr

A free, open-source, OpenAI-compatible gateway that pools the free tiers of 24 cataloged LLM providers.

freellmpool is a free, open-source Python tool that pools the free tiers of 24 LLM providers — Groq, Cerebras, NVIDIA, Google Gemini, OpenRouter, Cloudflare, Mistral, Cohere and more — behind one OpenAI-compatible endpoint. It runs as a command-line tool, a Python library, a local proxy, and an MCP server, with automatic failover when a provider is rate-limited and per-day quota tracking across tiers. Several providers (Pollinations, OVHcloud, and Kilo Gateway) need no API key, and LLM7 works without one, so pip install freellmpool can answer without signup when a keyless provider is up. The proxy speaks the OpenAI API and includes an experimental Anthropic-compatible path, so existing apps and coding agents such as Codex, Claude Code, and aider can run on pooled free models.

pip install freellmpool
freellmpool ask "Explain the CAP theorem in one sentence."   # keyless when routes are available

GitHub · PyPI · Changelog · MIT license

Release status. Latest release: 0.11.4, available from both GitHub and PyPI. Current main includes unreleased changes, including the Hermes profile, the readiness/provider operations APIs, refreshed catalog data, and registry-readiness hardening for the existing repository-local OpenCode plugins. Released 0.11.4 already includes spread routing. To test the main-only additions explicitly:

python -m pip install --force-reinstall 'git+https://github.com/0xzr/freellmpool.git@main'

Registry publication status: pending. opencode-freellmpool and opencode-freellmpool-tui are CI-tested but were not published on npm as of 2026-07-19; use their repository-local installation paths.

How it compares

Tool	What it is	Install	Keyless start	CLI / library / proxy / MCP
freellmpool	Pools many providers' free tiers	`pip install`	Yes (keyless routes)	All four
OpenRouter	Hosted paid aggregator (some free models)	API key	No (account + key)	API only
LiteLLM	Multi-provider SDK/proxy (bring your own keys)	`pip install`	No	Library + proxy
Self-hosted free-API servers	A server you deploy	Docker + config	No (deploy first)	Server only

freellmpool's niche: the keyless, pip-installable client for squeezing the hosted free tiers — not a server you deploy and not a paid aggregator.

What you can do with it

Use free LLMs from the command line: freellmpool ask "...", pipe stdin, pin a provider/model.
OpenAI-compatible proxy: point OPENAI_BASE_URL at freellmpool proxy for most text, embedding, and transcription calls.
Run coding agents for free: freellmpool code claude (also codex, aider, cline, continue, cursor, Hermes, and OpenCode) routes them to pooled free models — see the step-by-step guide. The Hermes profile is currently main-only.
Automate proxy operations: current main adds public /livez and /readyz probes, authenticated /v1/providers, and /v1/models?ready=true for locally ready targets.
Use OpenCode deeply: choose freellmpool/spread for quota-aware agent loops and add the repository-local dashboard/tools plugins while npm publication is pending.
Add model-pool review to metaswarm: use the experimental metaswarm adapter as a review-only external tool.
Async + library: from freellmpool import Pool, AsyncPool.
MCP server: freellmpool mcp lets Claude Desktop/Code/Cursor offload subtasks to free models.
Pick the fastest tier: freellmpool benchmark + latency-aware routing.
See what's usable right now: freellmpool capacity status and freellmpool providers health show which free tiers are healthy, near quota, or missing a key — see the capacity guide.

Guides

Per-provider free tiers: Groq · Gemini · Cerebras · NVIDIA · Mistral

FAQ

Is there a free, OpenAI-compatible LLM API gateway?

Yes — freellmpool is a free, open-source, MIT-licensed gateway that exposes one OpenAI-compatible endpoint backed by the free tiers of 24 cataloged providers. Install it with pip install freellmpool and point any OpenAI client at the local proxy.

How do I use multiple free LLM APIs at once?

freellmpool pools them: each request goes to a provider you have access to, and if that one is rate-limited or down it automatically fails over to the next. It tracks per-day usage so load spreads evenly across the free tiers, and an optional fast routing mode prefers the lowest-latency provider.

Can I run Claude Code or Codex on free models?

Yes. The proxy implements the OpenAI API and an experimental Anthropic-compatible path, so you set OPENAI_BASE_URL=http://localhost:8080/v1 or ANTHROPIC_BASE_URL=http://localhost:8080 and run Codex, Claude Code, aider, Cline, Continue, Cursor, or OpenCode on pooled free models. Run freellmpool code claude for the exact setup. (The Claude Code path is experimental: text and tool use, no vision yet.)

Do I need an API key?

No key is required to start: Pollinations, OVHcloud, and Kilo Gateway need no API key, and LLM7 is key-optional, so a fresh install can answer without signup when a keyless provider is up. Add free keys for the other providers to unlock more models and higher limits.

Which free LLM providers does it support?

24 cataloged providers: Pollinations, OVHcloud, LLM7, Kilo Gateway, OpenCode Zen (disabled by default), Groq, Cerebras, NVIDIA NIM, OpenRouter, Google Gemini, GitHub Models, Cloudflare Workers AI, Mistral, Cohere, SambaNova, Z.ai/Zhipu, Ollama Cloud, LongCat, Hugging Face (router), Aion Labs, ModelScope API Inference, Morph, Vercel AI Gateway, and SiliconFlow — 247 enabled chat routes, 405 cataloged chat models, plus free embeddings and audio transcription (Whisper).

How do I know which free providers are usable right now?

Run freellmpool capacity status: it labels each provider healthy, low_quota, exhausted, invalid_key, or missing from your local catalog, environment, and per-day usage, and flags when you're below a target number of healthy providers. freellmpool providers health sends one tiny request to each configured provider and reports latency or failure, and freellmpool keys add walks you through configuring more. See the capacity guide.

How is freellmpool different from OpenRouter?

OpenRouter is a hosted aggregator you pay per token (with some rate-limited free models) and access through its API. freellmpool is a local, open-source tool you pip install that pools the providers' own free tiers directly, can start keyless when routes are available, and ships a CLI, a Python library, a proxy, and an MCP server.

Is freellmpool free and open source?

Yes — it is MIT-licensed and free to use. Source is on GitHub; the package is on PyPI.

What are the limitations?

Free-tier models are smaller than frontier models — good for drafting, classification, triage, and everyday coding, not GPT-class reasoning on the hardest problems. Limits reset at UTC midnight and capacity varies through the day. The proxy is meant for local/single-user use.

Where do prompts go?

freellmpool is not a privacy layer: prompts go to the selected upstream provider. See the FAQ for provider routing, ToS, and privacy notes.

Free and open source (MIT). Latest release: 0.11.4. Page updated 2026-07-19. Project: github.com/0xzr/freellmpool