Model Pricing
Pricing details for AI models supported by QCode
Model Pricing¶
QCode supports three major AI model families: Claude, GPT/Codex, and Gemini. Pricing is based on each provider's official rates, with a service rate multiplier applied.
Pricing Formula¶
QCode Price = Official API Price × Service Rate
Service Rates¶
The service rate is a multiplier that covers routing, reliability, and infrastructure costs. Rates may vary by provider, reflecting differences in access costs.
Visit the Models & Pricing page to see current rates in the rate cards section. Rates are managed by QCode administrators and can be adjusted at any time.
Supported Model Families¶
Anthropic Claude¶
Includes Claude Sonnet, Opus, and Haiku series. Pricing covers input/output tokens and prompt caching (cache write/read).
OpenAI / Codex¶
Includes GPT and Codex series models. Used with Claude Code, Codex CLI, and other tools.
Google Gemini¶
Includes Gemini Pro, Flash, and other variants. Accessed via Vertex AI.
View Detailed Pricing¶
Visit qcode.cc/en/models for real-time pricing, context windows, capability badges, and usage popularity for all models.
Pricing data is automatically synced from the LiteLLM open-source pricing database every 10 minutes.