Question 1

How much cheaper is QuickSilver Pro than Azure OpenAI?

Accepted Answer

On direct quality-equivalent maps: GPT-4o-mini→DeepSeek V4 Flash ~73% cheaper output; GPT-4o→V3 ~16x cheaper output; o3-mini→V4 Pro ~6x cheaper output; o1→R1 ~30x cheaper output. Real bills usually land at 10–20% of Azure OpenAI for traffic that re-routes cleanly to open-source.

Question 2

Does the OpenAI SDK still work?

Accepted Answer

Yes — use the plain openai client (not AzureOpenAI). Change base_url to https://api.quicksilverpro.io/v1 and drop the deployment-name indirection; pass model IDs directly. Streaming, tool calling, json_schema strict mode, and usage accounting are preserved.

Question 3

When should I stay on Azure OpenAI?

Accepted Answer

When Entra ID, Private Link, Sentinel logging, or Microsoft Compliance Manager mappings are load-bearing. Also when you need vision, real-time audio, DALL-E, or the Assistants API. QuickSilver Pro covers the chat / coding / reasoning slice where open-source is quality-equivalent.

Feature	QuickSilver Pro	azure-openai
Model catalog	9 open-source LLMs (V4 Flash + Pro, V3, R1, Qwen 3.7 Max + 3.6 Plus + 3.6 + 3.5, Kimi K2.6)	GPT-4o, o1, o3-mini, GPT-4o-mini (closed)
Model weights	Open (MIT / Apache)	Closed
Cheap chat output (GPT-4o-mini / V4 Flash)	$0.16 / 1M	$0.60 / 1M
General chat output (GPT-4o / V3)	$0.616 / 1M	$10.00 / 1M
Top reasoning output (o1 / R1)	$2.00 / 1M	$60.00 / 1M
API setup	Sign up, paste key	Provision resource, request quota, AAD
Private Link / Entra ID / Sentinel	No	Yes

Model	QSP input	QSP output	azure-openai input	azure-openai output	Savings
deepseek-v4-flash vs gpt-4o-mini	$0.08	$0.16	$0.15	$0.60	~73%
deepseek-v3 vs gpt-4o	$0.16	$0.616	$2.50	$10.00	~94%
deepseek-v4-pro vs o3-mini	$0.348	$0.696	$1.10	$4.40	~84%
deepseek-r1 vs o1	$0.56	$2.00	$15.00	$60.00	~97%

QuickSilver Pro vs Azure OpenAI Service

At a glance

Pricing (per million tokens, USD)

Migration - two lines

FAQ

Try it with double credits — up to $50 free