Question 1

How much cheaper is QuickSilver Pro than DeepInfra?

Accepted Answer

On DeepSeek V3, QuickSilver Pro is ~43% cheaper on input and ~30% cheaper on output: $0.16 / $0.616 vs DeepInfra's $0.28 / $0.88 per 1M tokens. On DeepSeek R1, input is roughly at parity ($0.56 on QSP vs $0.55 on DeepInfra — QSP marginally higher) while output is ~9% cheaper: $2.00 vs $2.19 per 1M tokens.

Question 2

How do I migrate from DeepInfra to QuickSilver Pro?

Accepted Answer

Both are OpenAI-compatible. Change base_url from https://api.deepinfra.com/v1/openai to https://api.quicksilverpro.io/v1 and swap your API key. Model IDs: deepseek-ai/DeepSeek-V3 becomes deepseek-v3, deepseek-ai/DeepSeek-R1 becomes deepseek-r1.

Question 3

When should I stay on DeepInfra?

Accepted Answer

Stay on DeepInfra if you use Llama-family models, embeddings, image generation, Whisper transcription, or their dedicated inference deployments. QuickSilver Pro focuses on 9 open-source LLMs and does not offer non-chat modalities.

Question 4

What about cached input pricing?

Accepted Answer

Both expose cached-input pricing. DeepInfra discounts cached input on DeepSeek V3 and V3.1, and QuickSilver Pro bills cached-input tokens at a separate, lower cache-read rate on DeepSeek V3/V4 (and the Qwen/Kimi models). For workloads with >50% cache-hit, compare the effective per-request cost including the cache rate, not the list price alone.

特性	QuickSilver Pro	deepinfra
目录侧重点	9 个开源 LLM	60+ 开源模型、视觉、音频
DeepSeek V3 输出价格	$0.616 / 1M	$0.88 / 1M
DeepSeek R1 输出价格	$2.00 / 1M	$2.19 / 1M
缓存输入折扣	暂未提供	是（DeepSeek V3/V3.1）
Embeddings / 音频 / 图像	否	是
专用部署	否	是
OpenAI-compatible chat	是	是
最小充值金额	$5	$20

QuickSilver Pro vs DeepInfra

快速概览

价格（每百万 tokens，USD）

迁移：只改两行

常见问题

首次充值双倍 — 最高 $50 免费

模型	QSP 输入	QSP 输出	deepinfra 输入	deepinfra 输出	节省
DeepSeek V3	$0.16	$0.616	$0.28	$0.88	~30%
DeepSeek R1	$0.56	$2.00	$0.55	$2.19	~9% output
Qwen3.5-35B-A3B	$0.111	$0.80	Comparable	Comparable	—