Model Pricing

Pricing details for AI models supported by QCode

Model Pricing

QCode supports three major AI model families: Claude, GPT/Codex, and Gemini. Pricing is based on each provider's official rates, with a service rate multiplier applied.

Pricing Formula

QCode Price = Official API Price × Service Rate

Service Rates

The service rate is a multiplier that covers routing, reliability, and infrastructure costs. Rates may vary by provider, reflecting differences in access costs.

Visit the Models & Pricing page to see current rates in the rate cards section. Rates are managed by QCode administrators and can be adjusted at any time.

Supported Model Families

Anthropic Claude

Includes Claude Sonnet, Opus, and Haiku series. Pricing covers input/output tokens and prompt caching (cache write/read).

OpenAI / Codex

Includes GPT and Codex series models. Used with Claude Code, Codex CLI, and other tools.

Google Gemini

Includes Gemini Pro, Flash, and other variants. Accessed via Vertex AI.

View Detailed Pricing

Visit qcode.cc/en/models for real-time pricing, context windows, capability badges, and usage popularity for all models.

Pricing data is automatically synced from the LiteLLM open-source pricing database every 10 minutes.

  • Billing — Plan billing rules and quota details
  • FAQ — Frequently asked questions
🚀
Get Started with QCode — Claude Code & Codex
One plan for both Claude Code and Codex, Asia-Pacific low latency
View Pricing Plans → Create Account
Team of 3+?
Enterprise: dedicated domain + sub-key management + ban protection, from ¥250/person/mo
Learn Enterprise →