Reference · Models
The lineup, at the token level.
-
GPT-5.5 Azure OpenAIS G P DPremium · hardest only · keep narrow$5.00 / $30.00
-
Claude Opus 4.7 BedrockS G P DPremium · hardest queries · ambiguous multi-doc$5.00 / $25.00
-
Gemini 3 GoogleS G P DMultimodal · chart synthesis · screenshot reasoning$3.50 / $15.00
-
Claude Sonnet 4.6 BedrockS G P DDefault · English reports · charts · the workhorse$3.00 / $15.00
-
DeepSeek V4 Pro AlibabaS G P DDefault + math + long-context · the bet of this plan$1.74 / $3.48
-
Qwen 3.6 Max AlibabaS G P DChinese frontier · mainland-specific only$1.30 / $7.80
-
GPT-5 Azure OpenAIS G P DDefault · English reports · charts$1.25 / $10.00
-
o4-mini Azure OpenAIS G P DAgentic chains · OpenAI tool format$1.10 / $4.40
-
Claude Haiku 4.5 BedrockS G P DFast English lookups · cheap path · classification$1.00 / $5.00
-
Gemma 4 GoogleS G P DOpen weights · multilingual utility · keeps Google in the mix$0.40 / $1.20
-
GPT-5 mini Azure OpenAIS G P DEnglish lookups · structured-output reliability$0.25 / $2.00
-
DeepSeek V4 Flash AlibabaS G P DBulk cheap path · multilingual utility$0.14 / $0.28
May 2026 list price · cache-aware blended cost. S · G · P · D = Silver · Gold · Platinum · Diamond — the highlighted letter is the currently active plan. Routing % is the share of queries this lane wins under that plan.