Skip to main content
This page applies to both Verdent Desktop and Verdent for VS Code.

Overview

Verdent integrates state-of-the-art large language models from the world’s leading AI labs, including Anthropic (Claude), OpenAI (GPT), Google (Gemini), Moonshot (Kimi), Zhipu AI (GLM), and MiniMax. To help users understand the cost behind every AI interaction, we fully disclose the provider pricing for all available models in this document. All prices listed are the official published prices from each model provider, denominated in US dollars ($) per one million tokens (1M tokens).

Key Concepts

Tokens

A token is the fundamental unit of text processing for large language models. One token is approximately 4 English characters or 1-2 Chinese characters. Model pricing is based on the number of input and output tokens consumed, billed separately.

Billing Model

All models on Verdent currently use a per-token billing model, meaning you are charged based on the actual number of input and output tokens consumed during each interaction.

Price Components

Each model’s pricing consists of the following dimensions:
  • Input Price: The per-token cost for the prompt (user message and context) sent to the model.
  • Output Price: The per-token cost for the response generated by the model. Typically higher than input price.
  • Cache Write Price: Some models support prompt caching. This is the per-token cost when creating a cache entry for the first time.
  • Cache Read Price: The per-token cost when hitting an existing cache entry. Typically much lower than the standard input price, effectively reducing costs for repeated contexts.

Model Pricing Details

Below are the provider prices for all models currently available on Verdent, organized by provider. All prices are in USD per 1M tokens.

Anthropic (Claude Series)

ModelInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
Claude Opus 4.6$5.00$25.00$6.25$0.50
Claude Opus 4.5$5.00$25.00$6.25$0.50
Claude Sonnet 4.6$3.00$15.00$3.75$0.30
Claude Sonnet 4.5$3.00$15.00$3.75$0.30
Claude Sonnet 4$3.00$15.00$3.75$0.30
Claude Haiku 4.5$1.00$5.00$1.25$0.10
Claude Opus 4.6 (claude-opus-4-6) The latest flagship model in the Opus series. Best-in-class reasoning, ideal for complex code architecture, deep analysis, and difficult problem-solving tasks. Claude Opus 4.5 (claude-opus-4-5@20251101) Flagship-tier model with top-tier reasoning and creative capabilities. Claude Sonnet 4.6 (claude-sonnet-4-6) The latest balanced model. Excellent performance-to-cost ratio; recommended for everyday development. Claude Sonnet 4.5 (claude-sonnet-4-5@20250929) Balanced model with strong performance and competitive pricing. Claude Sonnet 4 (claude-sonnet-4@20250514) Classic Sonnet release. Stable and reliable for general-purpose use. Claude Haiku 4.5 (claude-haiku-4-5@20251001) Fast and lightweight model with the quickest response times. Best for simple conversations, quick lookups, and low-latency scenarios.

OpenAI (GPT Series)

ModelInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
GPT-5.4$2.50$15.00Free$0.25
GPT-5.3 Codex$1.75$14.00Free$0.17
GPT-5.2$1.75$14.00Free$0.17
GPT-5.2 Codex$1.75$14.00Free$0.17
GPT-5.1$1.25$10.00Free$0.12
GPT-5.1 Codex$1.25$10.00Free$0.12
GPT-5$1.25$10.00Free$0.12
GPT-5 Codex$1.25$10.00Free$0.12
GPT-5.4 (gpt-5.4) The latest flagship model with comprehensive upgrades in reasoning and code generation quality. GPT-5.3 Codex (gpt-5.3-codex) The latest code-specialized model, deeply optimized for programming tasks. GPT-5.2 (gpt-5.2) Powerful reasoning model for complex logic and deep analysis. GPT-5.2 Codex (gpt-5.2-codex) Code-specialized variant, ideal for large-scale code generation and refactoring. GPT-5.1 (gpt-5.1) Stable and efficient model with well-balanced capabilities. GPT-5.1 Codex (gpt-5.1-codex) Code-optimized variant, recommended for programming scenarios. GPT-5 (gpt-5) Classic baseline model with reliable general-purpose capabilities. GPT-5 Codex (gpt-5-codex) Code-optimized variant with outstanding cost-effectiveness.

Google (Gemini Series)

ModelInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
Gemini 3.1 Pro$2.00$12.00-$0.20
Gemini 3 Flash$0.50$3.00-$0.050
Gemini 3.1 Pro (gemini-3.1-pro-preview) Professional-grade model with strong reasoning, suitable for complex analysis and deep thinking tasks. Gemini 3 Flash (gemini-3-flash-preview) Ultra-fast model with excellent cost efficiency. Ideal for high-volume batch processing.

Moonshot (Kimi Series)

ModelInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
Kimi K2.5$0.60$3.00-$0.10
Kimi K2 Thinking$1.15$8.00-$0.15
Kimi K2 Turbo$1.15$8.00-$0.15
Kimi K2.5 (kimi-k2.5) The latest version with significantly improved bilingual capabilities and enhanced reasoning quality. Kimi K2 Thinking (kimi-k2-thinking-turbo) Thinking-enhanced variant, excels at complex reasoning and logical analysis. Kimi K2 Turbo (kimi-k2-turbo-preview) High-speed variant with fast response times, ideal for everyday conversations. Currently at 50% off promotional pricing.

Zhipu AI (GLM Series)

ModelInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
GLM-5$1.00$3.20-$0.20
GLM-5 (glm-5) The latest generation general-purpose model with outstanding Chinese-language performance.

MiniMax

ModelInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
MiniMax M2.7$0.30$1.20$0.38$0.060
MiniMax M2.5$0.30$1.20$0.38$0.030
MiniMax M2.7 (MiniMax-M2.7) Latest version with comprehensive performance improvements, maintaining highly competitive pricing. MiniMax M2.5 (MiniMax-M2.5) High cost-efficiency model, ideal for cost-sensitive, high-volume processing scenarios.

Pricing Overview

The table below summarizes the core pricing for all models, sorted by output price descending for easy comparison:
ModelProviderInput ($/1M)Output ($/1M)Cache Write ($/1M)Cache Read ($/1M)
Opus 4.5Anthropic$5.00$25.00$6.25$0.50
Opus 4.6Anthropic$5.00$25.00$6.25$0.50
Sonnet 4Anthropic$3.00$15.00$3.75$0.30
GPT-5.4OpenAI$2.50$15.00-$0.25
Sonnet 4.5Anthropic$3.00$15.00$3.75$0.30
Sonnet 4.6Anthropic$3.00$15.00$3.75$0.30
GPT-5.3-CodexOpenAI$1.75$14.00-$0.17
GPT-5.2-CodexOpenAI$1.75$14.00-$0.17
GPT-5.2OpenAI$1.75$14.00-$0.17
Gemini 3.1 ProGoogle$2.00$12.00-$0.20
GPT-5.1-CodexOpenAI$1.25$10.00-$0.12
GPT-5OpenAI$1.25$10.00-$0.12
GPT-5.1OpenAI$1.25$10.00-$0.12
GPT-5-CodexOpenAI$1.25$10.00-$0.12
Kimi K2 ThinkingMoonshot$1.15$8.00-$0.15
Kimi K2 TurboMoonshot$1.15$8.00-$0.15
Haiku 4.5Anthropic$1.00$5.00$1.25$0.10
GLM-5Zhipu AI$1.00$3.20-$0.20
Gemini 3 FlashGoogle$0.50$3.00-$0.050
Kimi K2.5Moonshot$0.60$3.00-$0.10
MiniMax M2.5MiniMax$0.30$1.20$0.38$0.030
MiniMax M2.7MiniMax$0.30$1.20$0.38$0.060