This page applies to both Verdent Desktop and Verdent for VS Code.
Overview
Verdent integrates state-of-the-art large language models from the world’s leading AI labs, including Anthropic (Claude), OpenAI (GPT), Google (Gemini), Moonshot (Kimi), MiniMax, Qwen, and Z.AI (GLM). To help users understand the cost behind every AI interaction, we disclose provider token pricing for Verdent’s current built-in model lineup in this document.
All prices listed are provider-side published prices in US dollars ($) per one million tokens (1M tokens). The lineup below follows the currently enabled models from Verdent’s model configuration.
Key Concepts
Tokens
A token is the fundamental unit of text processing for large language models. One token is approximately 4 English characters or 1-2 Chinese characters. Model pricing is based on the number of input and output tokens consumed, billed separately.
Billing Model
All models on Verdent currently use a per-token billing model, meaning you are charged based on the actual number of input and output tokens consumed during each interaction.
Price Components
Each model’s pricing consists of the following dimensions:
- Input Price: The per-token cost for the prompt (user message and context) sent to the model.
- Output Price: The per-token cost for the response generated by the model. Typically higher than input price.
- Cache Write Price: Some models support prompt caching. This is the per-token cost when creating a cache entry for the first time.
- Cache Read Price: The per-token cost when hitting an existing cache entry. Typically much lower than the standard input price, effectively reducing costs for repeated contexts.
Model Pricing Details
Below are the provider prices for all models currently available on Verdent, organized by provider. All prices are in USD per 1M tokens.
Anthropic (Claude Series)
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| Claude Opus 4.8 | $5.00 | $25.00 | $6.25 | $0.50 |
| Claude Opus 4.7 | $5.00 | $25.00 | $6.25 | $0.50 |
| Claude Opus 4.6 | $5.00 | $25.00 | $6.25 | $0.50 |
| Claude Sonnet 4.6 | $3.00 | $15.00 | $3.75 | $0.30 |
| Claude Haiku 4.5 | $1.00 | $5.00 | $1.25 | $0.10 |
Claude Opus 4.8 (claude-opus-4-8)
Latest Opus model available on Verdent. It is designed for complex coding, agentic tasks, enterprise workflows, and deep multi-step reasoning.
Claude Opus 4.7 (claude-opus-4-7)
High-capability Opus model for long-running agents, multi-step coding tasks, and complex knowledge work.
Claude Opus 4.6 (claude-opus-4-6)
Flagship-tier Opus model for complex code architecture, deep analysis, and difficult problem-solving tasks.
Claude Sonnet 4.6 (claude-sonnet-4-6)
Balanced model with strong performance and competitive pricing; recommended for everyday development.
Claude Haiku 4.5 (claude-haiku-4-5@20251001)
Fast and lightweight model with low latency. Best for simple conversations, quick lookups, and cost-sensitive tasks.
OpenAI (GPT Series)
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| GPT-5.5 | $5.00 | $30.00 | - | $0.50 |
| GPT-5.4 | $2.50 | $15.00 | - | $0.25 |
| GPT-5.3 Codex | $1.75 | $14.00 | - | $0.175 |
| GPT-5.4 mini | $0.75 | $4.50 | - | $0.075 |
GPT-5.5 (gpt-5.5)
The latest GPT model available on Verdent, suited to frontier reasoning, code generation, and multi-step analysis workloads.
GPT-5.4 (gpt-5.4)
Flagship GPT model with strong reasoning and code generation quality.
GPT-5.3 Codex (gpt-5.3-codex)
Code-specialized model optimized for programming tasks, including large-scale code generation and refactoring.
GPT-5.4 mini (gpt-5.4-mini)
Efficient GPT model for high-volume workloads, subagents, and cost-sensitive coding tasks.
Google (Gemini Series)
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| Gemini 3.5 Flash | $1.50 | $9.00 | - | $0.15 |
| Gemini 3.1 Pro | $2.00 | $12.00 | - | $0.20 |
| Gemini 3 Flash | $0.50 | $3.00 | - | $0.050 |
Gemini 3.5 Flash (gemini-3.5-flash)
Fast frontier Gemini model for agentic execution, coding, and long-horizon tasks at scale.
Gemini 3.1 Pro (gemini-3.1-pro-preview)
Professional-grade model with strong reasoning, suitable for complex analysis and deep thinking tasks.
Gemini 3 Flash (gemini-3-flash-preview)
Ultra-fast model with excellent cost efficiency. Ideal for high-volume batch processing.
Moonshot (Kimi Series)
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| Kimi K2.7 Code | $0.95 | $4.00 | - | $0.19 |
Kimi K2.7 Code (kimi-k2.7-code)
Moonshot’s current coding-focused Kimi model, built for long-context programming tasks and agentic code workflows.
MiniMax
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| MiniMax M3 | $0.60 | $2.40 | $0.00 | $0.12 |
MiniMax M3 (MiniMax-M3)
Cost-efficient long-context model for reliable multi-skill agent workflows, engineering tasks, and office automation.
Qwen
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| Qwen3.7-Plus | $0.1794 | $0.7157 | $0.00 | $0.0364 |
Qwen3.7-Plus (qwen3.7-plus)
Cost-effective multimodal model for text, image, and video inputs, with strong agentic and vision-language capabilities.
Z.AI (GLM Series)
| Model | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| GLM-5.2 | $1.40 | $4.40 | - | $0.26 |
GLM-5.2 (glm-5.2)
Long-context engineering model for project-scale code understanding, stable long-task execution, and standards-aware implementation work.
Pricing Overview
The table below summarizes the core pricing for all currently enabled models, sorted by output price descending for easy comparison:
| Model | Provider | Input ($/1M) | Output ($/1M) | Cache Write ($/1M) | Cache Read ($/1M) |
|---|
| GPT-5.5 | OpenAI | $5.00 | $30.00 | - | $0.50 |
| Opus 4.8 | Anthropic | $5.00 | $25.00 | $6.25 | $0.50 |
| Opus 4.7 | Anthropic | $5.00 | $25.00 | $6.25 | $0.50 |
| Opus 4.6 | Anthropic | $5.00 | $25.00 | $6.25 | $0.50 |
| GPT-5.4 | OpenAI | $2.50 | $15.00 | - | $0.25 |
| Sonnet 4.6 | Anthropic | $3.00 | $15.00 | $3.75 | $0.30 |
| GPT-5.3-Codex | OpenAI | $1.75 | $14.00 | - | $0.175 |
| Gemini 3.1 Pro | Google | $2.00 | $12.00 | - | $0.20 |
| Gemini 3.5 Flash | Google | $1.50 | $9.00 | - | $0.15 |
| Haiku 4.5 | Anthropic | $1.00 | $5.00 | $1.25 | $0.10 |
| GPT-5.4 mini | OpenAI | $0.75 | $4.50 | - | $0.075 |
| GLM-5.2 | Z.AI | $1.40 | $4.40 | - | $0.26 |
| Kimi K2.7 Code | Moonshot | $0.95 | $4.00 | - | $0.19 |
| Gemini 3 Flash | Google | $0.50 | $3.00 | - | $0.050 |
| MiniMax M3 | MiniMax | $0.60 | $2.40 | $0.00 | $0.12 |
| Qwen3.7-Plus | Qwen | $0.1794 | $0.7157 | $0.00 | $0.0364 |