Open-source inference,
20% below the rest.
The most popular open-source models — DeepSeek V3, DeepSeek R1, Qwen3.5-35B-A3B — through an OpenAI-compatible API. Cheaper than every other reseller. Change one line of code.
or try all three models live on HuggingFace — no signup required.
# One line change. That's it. from openai import OpenAI client = OpenAI( base_url="https://api.quicksilverpro.io/v1", api_key="your-api-key", )
Cheapest open-source inference
Per million tokens · compared against OpenRouter, Together AI, Fireworks.
| Model | Context | Input | Output | Savings |
|---|---|---|---|---|
Qwen3.5-35B-A3B qwen3.5-35b Best for long-context RAG, summarization | 262K | $0.13$0.16 | $1.00$1.25 | −20% |
DeepSeek V3Default deepseek-v3 Best for chat, coding, structured output | 128K | $0.24$0.30 | $0.70$0.88 | −20% |
DeepSeek R1Reasoning deepseek-r1 Best for math, multi-step reasoning, logic | 128K | $0.40$0.50 | $1.70$2.15 | −20% |
Compared against OpenRouter, Together AI, and Fireworks AI. Prices as of April 2026.
How much would you save?
Plug in your monthly usage — see the cost on QSP vs every competitor.
QuickSilver Pro | $0.45cheapest |
OpenRouter | $0.56+25% |
DeepInfra | $0.54+21% |
Together AI | $0.60+33% |
Fireworks AI | $0.57+27% |
OpenAIclosed model analog | $5.50+12.2× |
qspBuilt for terminals and AI agents. --json output with stable exit codes — Claude Code, Cursor, Aider can call it without parsing HTML.
Common questions
What is QuickSilver Pro?
An OpenAI-compatible HTTP API for the top open-source LLMs — DeepSeek V3, DeepSeek R1, and Qwen3.5-35B-A3B. Point the official OpenAI SDK at our base URL and get the same chat-completions interface, 20% below competing resellers.
How much cheaper than OpenRouter / OpenAI?
20% below the public per-token rates at OpenRouter, Together AI, Fireworks AI, and DeepInfra on the same open-source models. DeepSeek V3: $0.24 / $0.70. DeepSeek R1: $0.40 / $1.70. Qwen3.5-35B-A3B: $0.13 / $1.00. We don't serve closed models (GPT-4, Claude).
Is it really a drop-in OpenAI replacement?
Yes. Change base_url to https://api.quicksilverpro.io/v1 in the official openai Python / Node / Swift SDKs. Streaming, tool calling, json_schema strict mode, and usage.cost accounting all work out of the box.
Is there a free tier?
New accounts get $1 in free credits on registration — enough for ~500-700 real DeepSeek V3 calls to evaluate the service. After that it's pay-as-you-go starting at $5, no subscription.
Start saving on inference today
Create an account, buy credits, get your API key in 30 seconds.
Get API Key